7,010 unit tests across 1,402 PDF documents. Tests parsing of tables, math, multi-column layouts, old scans, and more.
Base is the reported evaluation metric for olmOCR-Bench. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Edit |
|---|---|---|---|---|---|---|
| 01 | Chandra v0.1.0 | unverified | 99.9 | 2025 | Source ↗ | Edit result |
| 02 | chandra-ocr-0.1.0 | paper | 99.9 | 2025 | Source ↗ | Edit result |
| 03 | olmOCR v0.4.0 | paper | 99.7 | 2025 | Source ↗ | Edit result |
| 04 | olmocr-v0.4.0 | paper | 99.7 | 2025 | Source ↗ | Edit result |
| 05 | LightOnOCR-2-1B | paper | 99.6 | 2026 | Source ↗ | Edit result |
| 06 | Qianfan-OCR | paper | 99.6 | 2026 | Source ↗ | Edit result |
Headers Footers is the reported evaluation metric for olmOCR-Bench. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Edit |
|---|---|---|---|---|---|---|
| 01 | olmOCR v0.4.0 | paper | 96.1 | 2025 | Source ↗ | Edit result |
| 02 | olmocr-v0.4.0 | paper | 96.1 | 2025 | Source ↗ | Edit result |
| 03 | olmOCR v0.3.0 | unverified | 95.1 | 2025 | Source ↗ | Edit result |
| 04 | olmocr-v0.3.0 | paper | 95.1 | 2025 | Source ↗ | Edit result |
| 05 | chandra-ocr-0.1.0 | paper | 90.8 | 2025 | Source ↗ | Edit result |
| 06 | Chandra v0.1.0 | unverified | 90.8 | 2025 | Source ↗ | Edit result |
Long Tiny Text is the reported evaluation metric for olmOCR-Bench. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Edit |
|---|---|---|---|---|---|---|
| 01 | Chandra v0.1.0 | unverified | 92.3 | 2025 | Source ↗ | Edit result |
| 02 | chandra-ocr-0.1.0 | paper | 92.3 | 2025 | Source ↗ | Edit result |
| 03 | LightOnOCR-2-1B | paper | 91.4 | 2026 | Source ↗ | Edit result |
| 04 | olmocr-v0.4.0 | paper | 81.9 | 2025 | Source ↗ | Edit result |
| 05 | olmOCR v0.4.0 | paper | 81.9 | 2025 | Source ↗ | Edit result |
| 06 | Qianfan-OCR | paper | 80.4 | 2026 | Source ↗ | Edit result |
Multi Column is the reported evaluation metric for olmOCR-Bench. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Edit |
|---|---|---|---|---|---|---|
| 01 | Qianfan-OCR | paper | 92.2 | 2026 | Source ↗ | Edit result |
| 02 | LightOnOCR-2-1B | paper | 84.8 | 2026 | Source ↗ | Edit result |
| 03 | olmocr-v0.4.0 | paper | 83.7 | 2025 | Source ↗ | Edit result |
| 04 | olmOCR v0.4.0 | paper | 83.7 | 2025 | Source ↗ | Edit result |
| 05 | Chandra v0.1.0 | unverified | 81.2 | 2025 | Source ↗ | Edit result |
| 06 | chandra-ocr-0.1.0 | paper | 81.2 | 2025 | Source ↗ | Edit result |
Arxiv is the reported evaluation metric for olmOCR-Bench. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Edit |
|---|---|---|---|---|---|---|
| 01 | LightOnOCR-2-1B | paper | 89.6 | 2026 | Source ↗ | Edit result |
| 02 | marker-1.10.0 | paper | 83.8 | 2025 | Source ↗ | Edit result |
| 03 | Marker 1.10.0 | unverified | 83.8 | 2025 | Source ↗ | Edit result |
| 04 | olmOCR v0.4.0 | paper | 83 | 2025 | Source ↗ | Edit result |
| 05 | olmocr-v0.4.0 | paper | 83 | 2025 | Source ↗ | Edit result |
| 06 | chandra-ocr-0.1.0 | paper | 82.2 | 2025 | Source ↗ | Edit result |
| 07 | Chandra v0.1.0 | unverified | 82.2 | 2025 | Source ↗ | Edit result |
| 08 | Qianfan-OCR | paper | 80.1 | 2026 | Source ↗ | Edit result |
Tables is the reported evaluation metric for olmOCR-Bench. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Edit |
|---|---|---|---|---|---|---|
| 01 | LightOnOCR-2-1B | paper | 89 | 2026 | Source ↗ | Edit result |
| 02 | dots.ocr 3B | unverified | 88.3 | 2025 | Source ↗ | Edit result |
| 03 | dots-ocr-3b | paper | 88.3 | 2025 | Source ↗ | Edit result |
| 04 | Chandra v0.1.0 | unverified | 88 | 2025 | Source ↗ | Edit result |
| 05 | chandra-ocr-0.1.0 | paper | 88 | 2025 | Source ↗ | Edit result |
| 06 | olmocr-v0.4.0 | paper | 84.9 | 2025 | Source ↗ | Edit result |
| 07 | olmOCR v0.4.0 | paper | 84.9 | 2025 | Source ↗ | Edit result |
| 08 | Qianfan-OCR | paper | 81.6 | 2026 | Source ↗ | Edit result |
Accuracy is the reported evaluation metric for olmOCR-Bench. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Edit |
|---|---|---|---|---|---|---|
| 01 | Infinity-Parser2-Pro | unverified | 87.6 | 2026 | Paper ↗ | Edit result |
| 02 | Chandra 2 | unverified | 85.9 | 2026 | Paper ↗Code ↗ | Edit result |
| 03 | dots.mocr | unverified | 83.9 | 2026 | Paper ↗Code ↗ | Edit result |
| 04 | LightOnOCR-2-1B | unverified | 83.2 | 2026 | Paper ↗Source ↗ | Edit result |
| 05 | Chandra | unverified | 83.1 | 2025 | Paper ↗ | Edit result |
| 06 | Infinity-Parser 7B | unverified | 82.5 | 2025 | Paper ↗Code ↗ | Edit result |
| 07 | olmOCR-2-7B-1025 (7B) | unverified | 82.4 | 2025 | Paper ↗ | Edit result |
| 08 | Falcon-OCR | unverified | 80.3 | 2026 | Paper ↗Code ↗ | Edit result |
| 09 | PaddleOCR-VL | unverified | 80 | 2025 | Paper ↗Code ↗ | Edit result |
| 10 | Qianfan-OCR | unverified | 79.8 | 2026 | Paper ↗Code ↗ | Edit result |
| 11 | dots.ocr | unverified | 79.1 | 2025 | Paper ↗Code ↗ | Edit result |
| 12 | MinerU2.5 | unverified | 77.5 | 2025 | Paper ↗Code ↗ | Edit result |
| 13 | DeepSeek-OCR-2 | unverified | 76.3 | 2026 | Paper ↗Code ↗ | Edit result |
| 14 | LightOnOCR-1B-1025 | unverified | 76.1 | 2026 | Paper ↗ | Edit result |
Old Scans Math is the reported evaluation metric for olmOCR-Bench. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Edit |
|---|---|---|---|---|---|---|
| 01 | LightOnOCR-2-1B | paper | 85.6 | 2026 | Source ↗ | Edit result |
| 02 | olmocr-v0.4.0 | paper | 82.3 | 2025 | Source ↗ | Edit result |
| 03 | olmOCR v0.4.0 | paper | 82.3 | 2025 | Source ↗ | Edit result |
| 04 | chandra-ocr-0.1.0 | paper | 80.3 | 2025 | Source ↗ | Edit result |
| 05 | Chandra v0.1.0 | unverified | 80.3 | 2025 | Source ↗ | Edit result |
| 06 | olmocr-v0.3.0 | paper | 79.9 | 2025 | Source ↗ | Edit result |
| 07 | olmOCR v0.3.0 | unverified | 79.9 | 2025 | Source ↗ | Edit result |
Pass Rate is the reported evaluation metric for olmOCR-Bench. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better