7,010 unit tests across 1,402 PDF documents. Tests parsing of tables, math, multi-column layouts, old scans, and more.
74 results indexed across 10 metrics. Shaded row marks current SOTA; ties broken by submission date.
| # | Model | Org | Submitted | Paper / code | arxiv |
|---|---|---|---|---|---|
| 01 | LightOnOCR-2-1BOpen | LightOn | Jan 2026 | paper | 89.60 |
| 02 | Marker 1.10.0Open | VikParuchuri | Dec 2025 | github-readme | 83.80 |
| 03 | olmOCR v0.4.0Open | Allen AI | Oct 2025 | paper | 83 |
| 04 | Chandra v0.1.0Open | datalab-to | Dec 2025 | github-readme | 82.20 |
| 05 | Qianfan-OCROpen | Baidu Qianfan | Mar 2026 | paper | 80.10 |
| # | Model | Org | Submitted | Paper / code | base |
|---|---|---|---|---|---|
| 01 | Chandra v0.1.0Open | datalab-to | Dec 2025 | github-readme | 99.90 |
| 02 | olmOCR v0.4.0Open | Allen AI | Oct 2025 | paper | 99.70 |
| 03 | Qianfan-OCROpen | Baidu Qianfan | Mar 2026 | paper | 99.60 |
| 04 | LightOnOCR-2-1BOpen | LightOn | Jan 2026 | paper | 99.60 |
| # | Model | Org | Submitted | Paper / code | headers-footers |
|---|---|---|---|---|---|
| 01 | olmOCR v0.4.0Open | Allen AI | Oct 2025 | paper | 96.10 |
| 02 | olmOCR v0.3.0Open | Allen AI | Dec 2025 | github-readme | 95.10 |
| 03 | Chandra v0.1.0Open | datalab-to | Dec 2025 | github-readme | 90.80 |
| 04 | Qianfan-OCROpen | Baidu Qianfan | Mar 2026 | paper | 42 |
| # | Model | Org | Submitted | Paper / code | long-tiny-text |
|---|---|---|---|---|---|
| 01 | Chandra v0.1.0Open | datalab-to | Dec 2025 | github-readme | 92.30 |
| 02 | LightOnOCR-2-1BOpen | LightOn | Jan 2026 | paper | 91.40 |
| 03 | olmOCR v0.4.0Open | Allen AI | Oct 2025 | paper | 81.90 |
| 04 | Qianfan-OCROpen | Baidu Qianfan | Mar 2026 | paper | 80.40 |
| # | Model | Org | Submitted | Paper / code | multi-column |
|---|---|---|---|---|---|
| 01 | Qianfan-OCROpen | Baidu Qianfan | Mar 2026 | paper | 92.20 |
| 02 | LightOnOCR-2-1BOpen | LightOn | Jan 2026 | paper | 84.80 |
| 03 | olmOCR v0.4.0Open | Allen AI | Oct 2025 | paper | 83.70 |
| 04 | Chandra v0.1.0Open | datalab-to | Dec 2025 | github-readme | 81.20 |
| # | Model | Org | Submitted | Paper / code | old-scans |
|---|---|---|---|---|---|
| 01 | Qianfan-OCROpen | Baidu Qianfan | Mar 2026 | paper | 73.10 |
| 02 | Chandra v0.1.0Open | datalab-to | Dec 2025 | github-readme | 50.40 |
| 03 | olmOCR v0.4.0Open | Allen AI | Oct 2025 | paper | 47.70 |
| 04 | LightOnOCR-2-1BOpen | LightOn | Jan 2026 | paper | 42.20 |
| 05 | GPT-4oAPI | OpenAI | Dec 2025 | github-readme | 40.70 |
| # | Model | Org | Submitted | Paper / code | old-scans-math |
|---|---|---|---|---|---|
| 01 | LightOnOCR-2-1BOpen | LightOn | Jan 2026 | paper | 85.60 |
| 02 | olmOCR v0.4.0Open | Allen AI | Oct 2025 | paper | 82.30 |
| 03 | Chandra v0.1.0Open | datalab-to | Dec 2025 | github-readme | 80.30 |
| 04 | olmOCR v0.3.0Open | Allen AI | Dec 2025 | github-readme | 79.90 |
| # | Model | Org | Submitted | Paper / code | pass-rate |
|---|---|---|---|---|---|
| 01 | dots.mocrOpen | RedNote | Mar 2026 | github-readme | 83.90 |
| 02 | LightOnOCR-2-1BOpen | LightOn | Jan 2026 | paper | 83.20 |
| 03 | Chandra v0.1.0Open | datalab-to | Dec 2025 | alphaxiv-leaderboard | 83.10 |
| 04 | Infinity-Parser 7BOpen | — | Dec 2025 | alphaxiv-leaderboard | 82.50 |
| 05 | olmOCR v0.4.0Open | Allen AI | Dec 2025 | alphaxiv-leaderboard | 82.40 |
| 06 | PaddleOCR-VLOpen | Baidu | Dec 2025 | alphaxiv-leaderboard | 80 |
| 07 | Qianfan-OCROpen | Baidu Qianfan | Mar 2026 | paper | 79.80 |
| 08 | Qwen3-VL-4BOpen | Alibaba Qwen | Mar 2026 | paper | 79.20 |
| 09 | dots.ocr 3BOpen | RedNote HILab | Dec 2025 | github-readme | 79.10 |
| 10 | PaddleOCR-VL-1.5Open | Baidu PaddlePaddle | Mar 2026 | paper | 79.10 |
| 11 | Mistral OCR 3API | Mistral | Dec 2025 | mistral-announcement | 78 |
| 12 | Marker 1.10.0Open | VikParuchuri | Dec 2025 | github-readme | 76.50 |
| 13 | Marker 1.10.1Open | VikParuchuri | Dec 2025 | alphaxiv-leaderboard | 76.10 |
| 14 | MonkeyOCR-pro-3BOpen | — | Jun 2025 | paper | 75.80 |
| 15 | DeepSeek-OCROpen | DeepSeek | Dec 2025 | alphaxiv-leaderboard | 75.70 |
| 16 | DeepSeek-OCROpen | DeepSeek | Dec 2025 | github-readme | 75.40 |
| 17 | MinerU 2.5Open | OpenDataLab | Dec 2025 | alphaxiv-leaderboard | 75.20 |
| 18 | Mistral OCR 2API | Mistral | Dec 2025 | alphaxiv-leaderboard | 72 |
| 19 | GPT-4o (Anchored) | OpenAI | Dec 2025 | github-readme | 69.90 |
| 20 | Nanonets OCR2 3B | Nanonets | Dec 2025 | alphaxiv-leaderboard | 69.50 |
| 21 | Gemini Flash 2 | Dec 2025 | github-readme | 63.80 |
| # | Model | Org | Submitted | Paper / code | tables |
|---|---|---|---|---|---|
| 01 | LightOnOCR-2-1BOpen | LightOn | Jan 2026 | paper | 89 |
| 02 | dots.ocr 3BOpen | RedNote HILab | Dec 2025 | github-readme | 88.30 |
| 03 | Chandra v0.1.0Open | datalab-to | Dec 2025 | github-readme | 88 |
| 04 | olmOCR v0.4.0Open | Allen AI | Oct 2025 | paper | 84.90 |
| 05 | Qianfan-OCROpen | Baidu Qianfan | Mar 2026 | paper | 81.60 |
Each row below marks a model that broke the previous record on pass-rate. Intermediate submissions are kept in the leaderboard above; only SOTA-setting entries are re-listed here.
Higher scores win. Each subsequent entry improved upon the previous best.
Every paper below corresponds to at least one row in the leaderboard above. Click through for the arXiv preprint and, when available, the reference implementation.
Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.