GigaSpeech is a 10,000-hour English ASR corpus pulled from audiobooks, podcasts, and YouTube. Released by SpeechColab and widely used as a high-volume training+evaluation set covering diverse speaking styles and noise conditions.
Wer is the reported evaluation metric for GigaSpeech. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Lower is better