Overview
Subtitle creation fails when it's treated as transcription rather than as editing for readability. A verbatim transcript displayed as subtitles produces either text that is too dense (long sentences that appear and disappear faster than most viewers read at regular reading speed) or too fragmented (every natural sentence broken into 2-word chunks that require rapid reading and produce no coherent phrase grouping). Subtitles are not a transcript — they are a visual presentation of spoken audio, edited for the reading speed and comprehension of the viewer watching at normal playback.
The Subtitle & Caption Framework builds subtitle timing to reading speed rather than to speaking speed, segments text into readable phrase groups, and produces the file formats each delivery platform requires.