8,809 Arabic text samples across 9 domains. Tests Arabic script recognition.
14 results indexed across 1 metric. Shaded row marks current SOTA; ties broken by submission date.
| # | Model | Org | Submitted | Paper / code | cer |
|---|---|---|---|---|---|
| 01 | Gemini 2.0 FlashAPI | Dec 2025 | alphaxiv-leaderboard | 0.130 | |
| 02 | AIN 7BOSS | Research | Dec 2025 | alphaxiv-leaderboard | 0.200 |
| 03 | GPT-4oAPI | OpenAI | Dec 2025 | alphaxiv-leaderboard | 0.310 |
| 04 | GPT-4o mini | OpenAI | Dec 2025 | alphaxiv-leaderboard | 0.430 |
| 05 | Azure OCR | Microsoft | Dec 2025 | alphaxiv-leaderboard | 0.520 |
| 06 | TesseractOSS | Google (Open Source) | Dec 2025 | alphaxiv-leaderboard | 0.540 |
| 07 | EasyOCROSS | JaidedAI | Dec 2025 | alphaxiv-leaderboard | 0.580 |
| 08 | PaddleOCROSS | Baidu | Dec 2025 | alphaxiv-leaderboard | 0.790 |
| 09 | Gemma 3 | Apr 2026 | kitab-bench-leaderboard | 1.05 | |
| 10 | qwen2.5-vl-7b | — | Apr 2026 | kitab-bench-leaderboard | 1.20 |
| 11 | Qwen2-VL 7B | Alibaba | Apr 2026 | kitab-bench-leaderboard | 1.48 |
| 12 | Qaari | MBZUAI | Apr 2026 | kitab-bench-leaderboard | 1.80 |
| 13 | ArabicNougat | community | Apr 2026 | kitab-bench-leaderboard | 4.37 |
| 14 | Surya | VikParuchuri | Apr 2026 | kitab-bench-leaderboard | 4.95 |
Each row below marks a model that broke the previous record on cer. Intermediate submissions are kept in the leaderboard above; only SOTA-setting entries are re-listed here.
Lower scores win. Each subsequent entry improved upon the previous best.
Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.