Speech data from 110 English speakers with various accents. Used for multi-speaker TTS.
Mos is the reported evaluation metric for VCTK. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Fix |
|---|---|---|---|---|---|---|
| 01 | NaturalSpeech 3 | verified | 4.36 | 2024 | Paper ↗ | Looks wrong? |
| 02 | Ground Truth (VCTK) | verified | 4.26 | 2022 | Source ↗ | Looks wrong? |
| 03 | VITS | verified | 4.21 | 2026 | Source ↗ | Looks wrong? |
| 04 | StyleTTS 2 | verified | 4.19 | 2023 | Paper ↗ | Looks wrong? |
| 05 | StyleTTS2 | verified | 4.19 | 2023 | Source ↗ | Looks wrong? |
| 06 | VALL-E 2 | verified | 4.18 | 2024 | Paper ↗ | Looks wrong? |
| 07 | XTTS v2 | verified | 4.14 | 2023 | Paper ↗ | Looks wrong? |
| 08 | YourTTS | verified | 4.07 | 2022 | Source ↗ | Looks wrong? |
| 09 | SC-GlowTTS | verified | 3.78 | 2022 | Source ↗ | Looks wrong? |
Sim Score is the reported evaluation metric for VCTK. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Fix |
|---|---|---|---|---|---|---|
| 01 | Ground Truth (VCTK) | verified | 4.19 | 2022 | Source ↗ | Looks wrong? |
| 02 | YourTTS | verified | 4.16 | 2022 | Source ↗ | Looks wrong? |
| 03 | SC-GlowTTS | verified | 3.99 | 2022 | Source ↗ | Looks wrong? |
| 04 | VITS2 | verified | 3.99 | 2023 | Source ↗ | Looks wrong? |
| 05 | VITS | verified | 3.79 | 2023 | Source ↗ | Looks wrong? |