OCR benchmark
Higher is better
| # | Model | Score | Source |
|---|---|---|---|
| ★ | o3 | 92.9 | |
| 2 | o1 | 91.8 | |
| 3 | gpt-45-preview | 90.8 | |
| 4 | o1-preview | 90.8 | |
| 5 | gpt-41 | 90.2 | |
| 6 | o4-mini | 90 | |
| 7 | llama-31-405b | 88.6 | |
| 8 | deepseek-v3 | 88.5 | |
| 9 | claude-35-sonnet | 88.3 | |
| 10 | grok-2 | 87.5 | |
| 11 | gpt-4o | 87.2 | |
| 12 | claude-3-opus | 86.8 | |
| 13 | gpt-4-turbo | 86.7 | |
| 14 | gemini-15-pro | 85.9 | |
| 15 | o3-mini | 85.9 | |
| 16 | o1-mini | 85.2 | |
| 17 | llama-31-70b | 82 | |
| 18 | gpt-4o-mini | 82 | |
| 19 | llama-3-70b | 82 |