Speech
Text-to-Speech
Generating natural-sounding speech from text.
2 datasets0 results
Text-to-Speech is a key task in speech. Below you will find the standard benchmarks used to evaluate models, along with current state-of-the-art results.
Benchmarks & SOTA
LJ Speech
The LJ Speech Dataset
20170 results
13,100 short audio clips of a single speaker reading passages from non-fiction books. Standard benchmark for single-speaker TTS.
No results tracked yet
VCTK
CSTR VCTK Corpus
20190 results
Speech data from 110 English speakers with various accents. Used for multi-speaker TTS.
No results tracked yet