Document Parsing2024en
olmOCR-Bench
7,010 unit tests across 1,402 PDF documents. Tests parsing of tables, math, multi-column layouts, old scans, and more.
Samples:1,402
Metrics:pass-rate, tables, old-scans-math, long-tiny-text, base, headers-footers, multi-column, arxiv, old-scans
Paper / WebsiteCurrent State of the Art
Chandra v0.1.0
datalab-to
83.1
pass-rate
Top Models Performance Comparison
Top 10 models ranked by pass-rate
Best Score
83.1
Top Model
Chandra v0.1.0
Models Compared
10
Score Range
7.7
arxiv
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | Marker 1.10.0Open Source VikParuchuri | 83.8 | Dec 2025 | |
| 2 | Chandra v0.1.0Open Source datalab-to | 82.2 | Dec 2025 |
base
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | Chandra v0.1.0Open Source datalab-to | 99.9 | Dec 2025 |
headers-footers
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | olmOCR v0.3.0Open Source Allen AI | 95.1 | Dec 2025 | |
| 2 | Chandra v0.1.0Open Source datalab-to | 90.8 | Dec 2025 |
long-tiny-text
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | Chandra v0.1.0Open Source datalab-to | 92.3 | Dec 2025 |
multi-column
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | Chandra v0.1.0Open Source datalab-to | 81.2 | Dec 2025 |
old-scans
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | Chandra v0.1.0Open Source datalab-to | 50.4 | Dec 2025 | |
| 2 | GPT-4oAPI OpenAI | 40.7 | Dec 2025 |
old-scans-math
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | Chandra v0.1.0Open Source datalab-to | 80.3 | Dec 2025 | |
| 2 | olmOCR v0.3.0Open Source Allen AI | 79.9 | Dec 2025 |
pass-ratePrimary
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | Chandra v0.1.0Open Source datalab-to | 83.1 | Dec 2025 | |
| 2 | Infinity-Parser 7BOpen Source | 82.5 | Dec 2025 | |
| 3 | olmOCR v0.4.0Open Source Allen AI | 82.4 | Dec 2025 | |
| 4 | PaddleOCR-VLOpen Source Baidu | 80 | Dec 2025 | |
| 5 | dots.ocr 3BOpen Source RedNote HILab | 79.1 | Dec 2025 | |
| 6 | Mistral OCR 3API Mistral | 78 | Dec 2025 | |
| 7 | Marker 1.10.0Open Source VikParuchuri | 76.5 | Dec 2025 | |
| 8 | Marker 1.10.1Open Source VikParuchuri | 76.1 | Dec 2025 | |
| 9 | DeepSeek OCROpen Source DeepSeek | 75.7 | Dec 2025 | |
| 10 | DeepSeek OCROpen Source DeepSeek | 75.4 | Dec 2025 | |
| 11 | MinerU 2.5Open Source OpenDataLab | 75.2 | Dec 2025 | |
| 12 | Mistral OCR 2API Mistral | 72 | Dec 2025 | |
| 13 | GPT-4o (Anchored) OpenAI | 69.9 | Dec 2025 | |
| 14 | Nanonets OCR2 3B Nanonets | 69.5 | Dec 2025 | |
| 15 | Gemini Flash 2 Google | 63.8 | Dec 2025 |
tables
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | dots.ocr 3BOpen Source RedNote HILab | 88.3 | Dec 2025 | |
| 2 | Chandra v0.1.0Open Source datalab-to | 88 | Dec 2025 |