Recent studyBlind TTS Elo is live. Compare two anonymous voice samples, vote after listening, and help separate real preference signal from noise.Vote in the study ->
Codesota · Benchmark · KITAB-BenchHome/Leaderboards/Vision & Documents/Document OCR/KITAB-Bench
MBZUAI

KITAB-Bench.

8,809 Arabic text samples across 9 domains. Tests Arabic script recognition.

Paper Leaderboard Lineage
§ 01 · SOTA history

Year over year.

§ 02 · Leaderboard

Results by metric.

Found a wrong score or missing run?
Use row edits to send a sourced correction into moderation.
Add / edit result Report issue

Character Error Rate

Levenshtein distance between predicted and ground truth (lower is better)

Lower is better

Trust tiers for Character Error Rateverifiedpapervendorcommunityunverified
RankModelTrustScoreYearLinksEdit
01Gemini 2.0 Flash
Arabic OCR - Character Error Rate (lower is better). 8,809 samples, 9 domains
unverified0.132025Source ↗Edit result
02gemini-20-flash
Arabic OCR - Character Error Rate (lower is better). 8,809 samples, 9 domains
paper0.132025Source ↗Edit result
03ain-7bpaper0.202025Source ↗Edit result
04AIN 7Bunverified0.202025Source ↗Edit result
05GPT-4ounverified0.312025Source ↗Edit result
06GPT-4o miniunverified0.432025Source ↗Edit result
07gpt-4o-minipaper0.432025Source ↗Edit result
08azure-ocrpaper0.522025Source ↗Edit result
09Azure OCRunverified0.522025Source ↗Edit result
10tesseractpaper0.542025Source ↗Edit result
11easyocrpaper0.582025Source ↗Edit result
12PaddleOCRunverified0.792025Source ↗Edit result
13Gemma 3
Arabic OCR - Character Error Rate (lower is better). Gemma 3 on KITAB-Bench.
verified1.052026Source ↗Edit result
14qwen2.5-vl-7b
Arabic OCR - Character Error Rate (lower is better). Qwen2.5-VL-7B on KITAB-Bench.
verified1.202026Source ↗Edit result
15Qwen2-VL 7B
Arabic OCR - Character Error Rate (lower is better). Qwen2-VL-7B on KITAB-Bench 8,809 samples, 9 domains.
verified1.482026Source ↗Edit result
16Qaari
Arabic OCR - Character Error Rate (lower is better). Qaari specialized Arabic OCR model on KITAB-Bench.
verified1.802026Source ↗Edit result
17ArabicNougat
Arabic OCR - Character Error Rate (lower is better). ArabicNougat specialized Arabic document model on KITAB-Bench.
verified4.372026Source ↗Edit result
18Surya
Arabic OCR - Character Error Rate (lower is better). Surya OCR on KITAB-Bench.
verified4.952026Source ↗Edit result
Lineage

KITAB-Bench in context.

See full ocr benchmarks lineage →
This benchmark (1)
active2025-02
KITAB-Bench
None yet — this is the current frontier.
§ 04 · Submit a result

Add to the leaderboard.

← Back to Document OCR