Home/OCR/Benchmarks/KITAB-Bench

KITAB-Bench

MBZUAI

Arabic OCR and document understanding benchmark with 8,809 samples across 9 domains.

8
Total Results
8
Models Tested
1
Metrics
2025-12-21
Last Updated

Character Error Rate

Levenshtein distance between predicted and ground truth (lower is better)

Lower is better

RankModelScoreSource
1gemini-20-flash

Arabic OCR - Character Error Rate (lower is better). 8,809 samples, 9 domains

0.13%alphaxiv-leaderboard
2ain-7b0.20%alphaxiv-leaderboard
3gpt-4o0.31%alphaxiv-leaderboard
4gpt-4o-mini0.43%alphaxiv-leaderboard
5azure-ocr0.52%alphaxiv-leaderboard
6tesseract0.54%alphaxiv-leaderboard
7easyocr0.58%alphaxiv-leaderboard
8paddleocr0.79%alphaxiv-leaderboard

Explore More OCR Content