TED-LIUM is an English ASR corpus derived from public TED talks, with the v3 release providing ~452 hours of audio aligned to verbatim transcripts. Long-form prepared speech with diverse speakers, accents, and topics.
Wer is the reported evaluation metric for TED-LIUM. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Lower is better