Multilingual speech benchmark covering 100+ languages. Commonly used for ASR and speech-language model evaluation.
Wer is the reported evaluation metric for FLEURS. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Lower is better
| Rank | Model | Trust | Score | Year | Links | Fix |
|---|---|---|---|---|---|---|
| 01 | Phi-4-Multimodal 5.6B | unverified | 4.00 | 2025 | Paper ↗ | Looks wrong? |