The Tedlium dataset is a corpus of English-language TED talks with transcriptions, sampled at 16kHz. It is used for automatic speech recognition (ASR) and comes in three releases, ranging from 118 to 452 hours of transcribed speech data. It was built during The International Workshop on Spoken Language Translation (IWSLT) 2011 Evaluation Campaign.
No results indexed yet — be the first to submit a score.
Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.