Codesota
.
Papers
Tasks
Benchmarks
Models
OCR
Submit score
Sign in
Submit score
↵
Menu
Codesota · OCR · Benchmarks · audiobench
Home
/
OCR
/
Benchmarks
/
audiobench
Unknown
audiobench
.
OCR benchmark
§ 01 · avg-score
avg-score.
Higher is better
#
Model
Score
Source
★
WavLLM
Fetched from CodeSOTA API on 2026-04-20
50.25
codesota-api
2
SALMONN
Fetched from CodeSOTA API on 2026-04-20
43.99
codesota-api
3
Qwen2-Audio-Instruct
Fetched from CodeSOTA API on 2026-04-20
42.12
codesota-api
4
Whisper+LLaMA-3 (cascade)
Fetched from CodeSOTA API on 2026-04-20
40.9
codesota-api
5
Qwen-Audio-Chat
Fetched from CodeSOTA API on 2026-04-20
38.59
codesota-api
§ Related · Explore
More OCR content.
Verified Model Reviews
dots.ocr 3B — 88.41 OmniDocBench, 100+ languages
Mistral OCR 3 — 79.75 composite, verified results
clearOCR — Traditional OCR solution
Rys OCR — Polish SOTA model
Comparisons & Guides
PaddleOCR vs Tesseract comparison
GPT-4o vs PaddleOCR comparison
Docling Tutorial: PDF to Markdown
All OCR Vendors Comparison
View all OCR benchmarks →
←
Back to All Benchmarks