Codesota · OCR · Benchmarks · voicebenchHome/OCR/Benchmarks/voicebench
Unknown

voicebench.

OCR benchmark

§ 01 · overall-score

overall-score.

Higher is better

#ModelScoreSource
Ultravox-GLM-4P7
Fetched from CodeSOTA API on 2026-04-20
88.86codesota-api
2
Whisper-v3-large + GPT-4o (cascade)
Fetched from CodeSOTA API on 2026-04-20
87.8codesota-api
3
GPT-4o-Audio
Fetched from CodeSOTA API on 2026-04-20
86.75codesota-api
4
Whisper-v3-large + LLaMA-3.1-8B (cascade)
Fetched from CodeSOTA API on 2026-04-20
77.48codesota-api
5
Kimi-Audio
Fetched from CodeSOTA API on 2026-04-20
76.91codesota-api
6
MiniCPM-o
Fetched from CodeSOTA API on 2026-04-20
71.23codesota-api
7
VITA-1.5
Fetched from CodeSOTA API on 2026-04-20
64.53codesota-api
8
Qwen2-Audio
Fetched from CodeSOTA API on 2026-04-20
55.8codesota-api
9
LLaMA-Omni
Fetched from CodeSOTA API on 2026-04-20
41.12codesota-api
10
VITA-1.0
Fetched from CodeSOTA API on 2026-04-20
36.43codesota-api
11
Mini-Omni2
Fetched from CodeSOTA API on 2026-04-20
33.49codesota-api
12
Mini-Omni
Fetched from CodeSOTA API on 2026-04-20
30.42codesota-api
13
Moshi
Fetched from CodeSOTA API on 2026-04-20
29.51codesota-api
§ Related · Explore

More OCR content.

Verified Model Reviews
Comparisons & Guides
View all OCR benchmarks → Back to All Benchmarks