Massive multilingual dataset of transcribed speech. Covers diverse demographics and accents.
Wer Vi is the reported evaluation metric for Common Voice. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Fix |
|---|---|---|---|---|---|---|
| 01 | Whisper base | verified | 44.07 | 2024 | Source ↗ | Looks wrong? |
| 02 | MMS 1B-L1107 | verified | 43.88 | 2024 | Source ↗ | Looks wrong? |
| 03 | Whisper large-v2 | verified | 18 | 2024 | Source ↗ | Looks wrong? |
| 04 | Whisper large-v3 | verified | 13.74 | 2024 | Source ↗ | Looks wrong? |
| 05 | Google USM Chirp v2 | verified | 12.46 | 2024 | Source ↗ | Looks wrong? |
| 06 | GigaSpeech 2 | verified | 11.47 | 2024 | Source ↗ | Looks wrong? |
| 07 | Azure Speech CLI 1.37 | verified | 10.21 | 2024 | Source ↗ | Looks wrong? |
Wer Id is the reported evaluation metric for Common Voice. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Fix |
|---|---|---|---|---|---|---|
| 01 | Whisper base | verified | 34.7 | 2024 | Source ↗ | Looks wrong? |
| 02 | MMS 1B-L1107 | verified | 20.72 | 2024 | Source ↗ | Looks wrong? |
| 03 | Azure Speech CLI 1.37 | verified | 10.33 | 2024 | Source ↗ | Looks wrong? |
| 04 | Google USM Chirp v2 | verified | 9.70 | 2024 | Source ↗ | Looks wrong? |
| 05 | Whisper large-v2 | verified | 8.93 | 2024 | Source ↗ | Looks wrong? |
| 06 | Whisper large-v3 | verified | 7.43 | 2024 | Source ↗ | Looks wrong? |
| 07 | GigaSpeech 2 | verified | 7.33 | 2024 | Source ↗ | Looks wrong? |
Wer Th is the reported evaluation metric for Common Voice. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Fix |
|---|---|---|---|---|---|---|
| 01 | Whisper base | verified | 32.59 | 2024 | Source ↗ | Looks wrong? |
| 02 | Google USM Chirp v2 | verified | 14.75 | 2024 | Source ↗ | Looks wrong? |
| 03 | MMS 1B-L1107 | verified | 14.49 | 2024 | Source ↗ | Looks wrong? |
| 04 | Azure Speech CLI 1.37 | verified | 10.2 | 2024 | Source ↗ | Looks wrong? |
| 05 | Whisper large-v2 | verified | 8.79 | 2024 | Source ↗ | Looks wrong? |
| 06 | Whisper large-v3 | verified | 6.02 | 2024 | Source ↗ | Looks wrong? |
| 07 | GigaSpeech 2 | verified | 4.15 | 2024 | Source ↗ | Looks wrong? |
Wer is the reported evaluation metric for Common Voice. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Lower is better
| Rank | Model | Trust | Score | Year | Links | Fix |
|---|---|---|---|---|---|---|
| 01 | Whisper Large v3 | verified | 8.40 | 2026 | Source ↗ | Looks wrong? |
| 02 | LUPET | verified | 9.15 | 2024 | Source ↗ | Looks wrong? |
| 03 | wav2vec 2.0 Large (960h) | verified | 10.5 | 2020 | Paper ↗ | Looks wrong? |
| 04 | Whisper Large v2 | verified | 11.2 | 2026 | Source ↗ | Looks wrong? |
Wer En is the reported evaluation metric for Common Voice. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Fix |
|---|---|---|---|---|---|---|
| 01 | Whisper large-v3 | verified | 11 | 2025 | Source ↗ | Looks wrong? |
| 02 | Whisper large-v2 | verified | 9.80 | 2023 | Source ↗ | Looks wrong? |
| 03 | Vicuna-13B + Whisper Q-Former | verified | 8.20 | 2023 | Source ↗ | Looks wrong? |
| 04 | B-Whisper | verified | 7.00 | 2025 | Source ↗ | Looks wrong? |
Wer En Accents is the reported evaluation metric for Common Voice. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Fix |
|---|---|---|---|---|---|---|
| 01 | Accent-Specific Codebook ASR | verified | 6.43 | 2024 | Source ↗ | Looks wrong? |