The LibriSpeech dataset is a large corpus of approximately 1,000 hours of 16kHz read English speech, derived from public domain audiobooks from the LibriVox project. It is widely used for training and evaluating automatic speech recognition (ASR) systems, and the data has been meticulously segmented and aligned with corresponding text transcripts. This dataset is about classifying male vs. female voices
No results indexed yet — be the first to submit a score.
Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.