Level A Perceivable WCAG 1.2.3

What This Criterion Requires

WCAG 1.2.3 requires that prerecorded synchronized media (video with audio) provides either an audio description of the visual content or a full text alternative that describes both the audio and visual information. Audio descriptions are narrated tracks that describe important visual details during natural pauses in dialogue, such as on-screen actions, scene changes, character expressions, and text that appears on screen. A full text alternative such as a screenplay or detailed transcript can serve as an alternative to audio descriptions. This criterion specifically addresses the visual information in videos that is not already conveyed by the existing audio track. For example, if a speaker says 'as you can see in this chart' but does not describe the chart data verbally, blind users need an audio description or text alternative that fills in that visual information. Either approach is acceptable at Level A, though audio descriptions are generally preferred as they allow users to stay in the media experience.

Why It Matters

Users who are blind or have low vision cannot see the visual content in videos. While they can hear the dialogue and narration, they miss critical visual information such as on-screen text, actions, facial expressions, scene changes, and demonstrations that are not described verbally in the existing audio track. Without audio descriptions or text alternatives, these users receive an incomplete and potentially confusing experience. Consider an instructional video where the presenter says 'click this button here' while pointing to a specific UI element -- a blind user has no idea which button is being referenced. Audio descriptions bridge this gap by narrating the visual elements during pauses in the existing audio. This criterion also benefits users with cognitive disabilities who may better understand content when visual information is explicitly described, and it provides a richer experience for anyone listening to video content without watching the screen.

Common Failures and How to Fix Them

Video with important visual information not described in audio

A tutorial or presentation video shows on-screen actions, charts, or demonstrations that are not described by the narrator. Blind users miss the visual content entirely because the audio track assumes the viewer can see what is happening.

Inaccessible
<video controls>
  <source src="/videos/quarterly-report.mp4" type="video/mp4">
  <track kind="captions" src="/captions/quarterly-report.vtt" srclang="en" label="English" default>
  <!-- Video shows charts and graphs while narrator says
       "As you can see, our numbers look great this quarter" -->
</video>
Accessible
<video controls>
  <source src="/videos/quarterly-report.mp4" type="video/mp4">
  <track kind="captions" src="/captions/quarterly-report.vtt" srclang="en" label="English" default>
  <track kind="descriptions" src="/descriptions/quarterly-report.vtt" srclang="en" label="Audio descriptions">
</video>
<!-- Audio description track includes:
     "Bar chart shows revenue increasing from $1.2M in Q1 to $1.8M in Q4,
      a 50% year-over-year increase" -->

No text alternative provided for video content

A video lacks both audio descriptions and a full text transcript or screenplay that describes the visual content alongside the dialogue. Users who cannot see the video have no way to access the visual information.

Inaccessible
<video controls>
  <source src="/videos/office-tour.mp4" type="video/mp4">
</video>
<p>Watch our virtual office tour!</p>
Accessible
<video controls>
  <source src="/videos/office-tour.mp4" type="video/mp4">
</video>
<details>
  <summary>Full text description of video</summary>
  <div class="media-alternative">
    <p>[Scene: Modern open-plan office with floor-to-ceiling windows overlooking a city skyline]</p>
    <p>Narrator: Welcome to our headquarters.</p>
    <p>[Camera pans to reveal rows of standing desks with dual monitors, plants on every desk, and a large communal kitchen area in the background]</p>
    <p>Narrator: We designed this space to foster collaboration.</p>
  </div>
</details>

How to Test

  1. Watch each prerecorded video and note all visual information that is not conveyed through the existing audio track, including on-screen text, actions, charts, scene changes, and speaker identification.
  2. Check whether the video provides an audio description track (via a track element with kind='descriptions' or a separate version of the video with audio descriptions mixed in).
  3. If no audio description is available, check for a full text alternative near the video that describes both the audio and visual content in sufficient detail.
  4. Verify that the audio descriptions or text alternative accurately and completely convey the visual information a sighted user would receive.

CMS-Specific Guidance

This criterion commonly causes issues on these platforms:

Further Reading

Related WCAG Criteria