This is a submission to the W4A 2009 Web accessibility conference (http://www.w4a.info/ ). In the video, we explain the current status of video accessibility on the Web and means forward for HTML5. We propose a solution for associating textual captions with video and explain it on the example of Ogg Kate, SRT and DFXP. We then explain further challenges such as Sign Language, Audio Annotations, and more general types of time-aligned text, e.g. Karaoke, music lyrics, ticker-text, transcripts, or annotations with hyperlinks.