Benchmark Stats
SOTA History
overall-score
Higher is better
| Rank | Model | Source | Score | Year | Paper |
|---|---|---|---|---|---|
| 1 | Ultravox-GLM-4P7 VoiceBench overall-score. Rank #1 on VoiceBench leaderboard as of 2026-03-28. Source: matthewcym.github.io/VoiceBench/ (accessed 2026-03-28) and arXiv:2410.17196v2. | Community | 88.86 | 2026 | Source |
| 2 | Whisper-v3-large + GPT-4o (cascade) VoiceBench overall-score. Cascade baseline: Whisper-large-v3 + GPT-4o. Rank #3. Source: matthewcym.github.io/VoiceBench/ (accessed 2026-03-28) and arXiv:2410.17196v2. | Community | 87.8 | 2026 | Source |
| 3 | GPT-4o-Audio VoiceBench overall-score. Rank #5 on VoiceBench leaderboard as of 2026-03-28. Source: matthewcym.github.io/VoiceBench/ (accessed 2026-03-28) and arXiv:2410.17196v2. | Community | 86.75 | 2026 | Source |
| 4 | Whisper-v3-large + LLaMA-3.1-8B (cascade) VoiceBench overall-score. Cascade baseline from original paper. Rank #9. Source: matthewcym.github.io/VoiceBench/ (accessed 2026-03-28) and arXiv:2410.17196v2. | Community | 77.48 | 2026 | Source |
| 5 | Kimi-Audio VoiceBench overall-score. Rank #10 on VoiceBench leaderboard as of 2026-03-28. Source: matthewcym.github.io/VoiceBench/ (accessed 2026-03-28) and arXiv:2410.17196v2. | Community | 76.91 | 2026 | Source |
| 6 | MiniCPM-o VoiceBench overall-score. Rank #15 on VoiceBench leaderboard as of 2026-03-28. Source: matthewcym.github.io/VoiceBench/ (accessed 2026-03-28) and arXiv:2410.17196v2. | Community | 71.23 | 2026 | Source |
| 7 | VITA-1.5 VoiceBench overall-score. Rank #19 on VoiceBench leaderboard as of 2026-03-28. Source: matthewcym.github.io/VoiceBench/ (accessed 2026-03-28) and arXiv:2410.17196v2. | Community | 64.53 | 2026 | Source |
| 8 | Qwen2-Audio VoiceBench overall-score. Rank #27. From original paper Table 3 and leaderboard. Source: matthewcym.github.io/VoiceBench/ (accessed 2026-03-28) and arXiv:2410.17196v2. | Community | 55.8 | 2026 | Source |
| 9 | LLaMA-Omni VoiceBench overall-score. Rank #34. From original paper Table 3 and leaderboard. Source: matthewcym.github.io/VoiceBench/ (accessed 2026-03-28) and arXiv:2410.17196v2. | Community | 41.12 | 2026 | Source |
| 10 | VITA-1.0 VoiceBench overall-score. Rank #35. From original paper Table 3 (as VITA) and leaderboard. Source: matthewcym.github.io/VoiceBench/ (accessed 2026-03-28) and arXiv:2410.17196v2. | Community | 36.43 | 2026 | Source |
| 11 | Mini-Omni2 VoiceBench overall-score. Rank #37. From original paper Table 3 and leaderboard. Source: matthewcym.github.io/VoiceBench/ (accessed 2026-03-28) and arXiv:2410.17196v2. | Community | 33.49 | 2026 | Source |
| 12 | Mini-Omni VoiceBench overall-score. Rank #38. From original paper Table 3 and leaderboard. Source: matthewcym.github.io/VoiceBench/ (accessed 2026-03-28) and arXiv:2410.17196v2. | Community | 30.42 | 2026 | Source |
| 13 | Moshi VoiceBench overall-score. Rank #39 (last). From original paper Table 3 and leaderboard. Source: matthewcym.github.io/VoiceBench/ (accessed 2026-03-28) and arXiv:2410.17196v2. | Community | 29.51 | 2026 | Source |