OCR benchmark
Higher is better
| # | Model | Score | Source |
|---|---|---|---|
| ★ | Claude Opus 4.7 | 87.6 | vendor |
| 2 | Claude Opus 4.5 | 80.9 | |
| 3 | Claude Opus 4.6 | 80.8 | |
| 4 | Gemini 3.1 Pro | 80.6 | |
| 5 | MiniMax M2.5 | 80.2 | |
| 6 | GPT-5.2 Thinking | 80 | |
| 7 | Claude Sonnet 4.6 | 79.6 | |
| 8 | Gemini 3 Flash | 78 | |
| 9 | Claude Sonnet 4.5 | 77.2 | |
| 10 | Kimi K2.5 | 76.8 | |
| 11 | GPT-5.1 | 76.3 | |
| 12 | Gemini 3 Pro | 76.2 | |
| 13 | GPT-5 | 74.9 | |
| 14 | MiniMax M2.1 | 74 | |
| 15 | Claude Haiku 4.5 | 73.3 | |
| 16 | Claude Sonnet 4 | 72.7 | |
| 17 | Claude Opus 4 | 72.5 | |
| 18 | Devstral 2 | 72.2 | |
| 19 | Qwen3-Coder-480B | 69.6 | |
| 20 | MiniMax M2 | 69.4 | |
| 21 | o3 | 69.1 | |
| 22 | o4-mini | 68.1 | |
| 23 | DeepSeek V3.1 | 66 | |
| 24 | Kimi K2 | 65.8 | |
| 25 | Grok 3 | 63.8 | |
| 26 | Gemini 2.5 Pro | 63.8 | |
| 27 | Claude 3.7 Sonnet | 63.7 | |
| 28 | Gemini 2.5 Flash | 60.4 | |
| 29 | DeepSeek R1-0528 | 57.6 | |
| 30 | o3-mini | 55.8 | |
| 31 | GPT-4.1 | 54.6 | |
| 32 | Claude 3.5 Sonnet | 50.8 | |
| 33 | DeepSeek-R1 | 49.2 | |
| 34 | o1 | 48.9 | |
| 35 | Devstral Small 2505 | 46.8 | |
| 36 | DeepSeek V3 | 42 | |
| 37 | GPT-4o | 41.2 | |
| 38 | Claude 3.5 Haiku | 40.6 | |
| 39 | DeepSeek V2.5 | 37 |