Speech

Working with voice and audio? Evaluate speech-to-text accuracy, voice synthesis quality, and speaker identification performance.

5 tasks 4 datasets 0 results

Speech Recognition

Converting spoken audio to text (LibriSpeech, Common Voice).

2 datasets 0 results
LibriSpeech LibriSpeech ASR Corpus 2015

1000 hours of English speech from audiobooks. Standard benchmark for automatic speech recognition.

Common Voice Mozilla Common Voice 2019

Massive multilingual dataset of transcribed speech. Covers diverse demographics and accents.

Text-to-Speech

Generating natural-sounding speech from text.

2 datasets 0 results
LJ Speech The LJ Speech Dataset 2017

13,100 short audio clips of a single speaker reading passages from non-fiction books. Standard benchmark for single-speaker TTS.

VCTK CSTR VCTK Corpus 2019

Speech data from 110 English speakers with various accents. Used for multi-speaker TTS.

Speaker Verification

Verifying speaker identity from voice samples.

0 datasets 0 results
No datasets indexed yet. Contribute on GitHub

Speech Translation

Translating spoken audio directly to another language.

0 datasets 0 results
No datasets indexed yet. Contribute on GitHub

Voice Cloning

Replicating a speaker's voice characteristics.

0 datasets 0 results
No datasets indexed yet. Contribute on GitHub