Codesota · Computer Vision · Document Parsing · olmOCR-BenchTasks/Computer Vision/Document Parsing
Document Parsing · benchmark dataset · 2024 · EN

olmOCR-Bench.

7,010 unit tests across 1,402 PDF documents. Tests parsing of tables, math, multi-column layouts, old scans, and more.

Paper Submit a result
§ 01 · Leaderboard

Best published scores.

74 results indexed across 10 metrics. Shaded row marks current SOTA; ties broken by submission date.


Primary
pass-rate · higher is better
All metrics
accuracy, arxiv, base, headers-footers, long-tiny-text, multi-column, old-scans, old-scans-math, pass-rate, tables
accuracy
18 rows
#ModelOrgSubmittedPaper / codeaccuracy
01Infinity-Parser2-ProMay 2026pwc-dump87.60
02Chandra 2Mar 2026pwc-dump · code85.90
03dots.mocrMar 2026Multimodal OCR: Parse Anything from Documents · code83.90
04LightOnOCR-2-1BOpenLightOnJan 2026LightOnOCR: A 1B End-to-End Multilingual Vision-Language…83.20
05ChandraOct 2025pwc-dump83.10
06Infinity-Parser 7BOpenJun 2025Infinity Parser: Layout Aware Reinforcement Learning for… · code82.50
07olmOCR-2-7B-1025 (7B)Oct 2025olmOCR 2: Unit Test Rewards for Document OCR82.40
08Falcon-OCRMar 2026Falcon Perception · code80.30
09PaddleOCR-VLOpenBaiduOct 2025PaddleOCR-VL: Boosting Multilingual Document Parsing via… · code80
10Qianfan-OCROpenBaidu QianfanMar 2026Qianfan-OCR: A Unified End-to-End Model for Document Int… · code79.80
11dots.ocrDec 2025dots.ocr: Multilingual Document Layout Parsing in a Sing… · code79.10
12MinerU2.5Sep 2025MinerU2.5: A Decoupled Vision-Language Model for Efficie… · code77.50
13DeepSeek-OCR-2Jan 2026DeepSeek-OCR 2: Visual Causal Flow · code76.30
14LightOnOCR-1B-1025Jan 2026LightOnOCR: A 1B End-to-End Multilingual Vision-Language…76.10
15DeepSeek-OCROpenDeepSeekOct 2025DeepSeek-OCR: Contexts Optical Compression · code75.70
16olmOCR-7BOpenAllen AIFeb 2025olmOCR: Unlocking Trillions of Tokens in PDFs with Visio… · code75.50
17GLM-OCROpenZhipu AIMar 2026GLM-OCR Technical Report75.20
18FireRed-OCRMar 2026FireRed-OCR Technical Report · code70.20
arxiv
5 rows
#ModelOrgSubmittedPaper / codearxiv
01LightOnOCR-2-1BOpenLightOnJan 2026paper89.60
02Marker 1.10.0OpenVikParuchuriDec 2025github-readme83.80
03olmOCR v0.4.0OpenAllen AIOct 2025paper83
04Chandra v0.1.0Opendatalab-toDec 2025github-readme82.20
05Qianfan-OCROpenBaidu QianfanMar 2026paper80.10
base
4 rows
#ModelOrgSubmittedPaper / codebase
01Chandra v0.1.0Opendatalab-toDec 2025github-readme99.90
02olmOCR v0.4.0OpenAllen AIOct 2025paper99.70
03Qianfan-OCROpenBaidu QianfanMar 2026paper99.60
04LightOnOCR-2-1BOpenLightOnJan 2026paper99.60
headers-footers
4 rows
#ModelOrgSubmittedPaper / codeheaders-footers
01olmOCR v0.4.0OpenAllen AIOct 2025paper96.10
02olmOCR v0.3.0OpenAllen AIDec 2025github-readme95.10
03Chandra v0.1.0Opendatalab-toDec 2025github-readme90.80
04Qianfan-OCROpenBaidu QianfanMar 2026paper42
long-tiny-text
4 rows
#ModelOrgSubmittedPaper / codelong-tiny-text
01Chandra v0.1.0Opendatalab-toDec 2025github-readme92.30
02LightOnOCR-2-1BOpenLightOnJan 2026paper91.40
03olmOCR v0.4.0OpenAllen AIOct 2025paper81.90
04Qianfan-OCROpenBaidu QianfanMar 2026paper80.40
multi-column
4 rows
#ModelOrgSubmittedPaper / codemulti-column
01Qianfan-OCROpenBaidu QianfanMar 2026paper92.20
02LightOnOCR-2-1BOpenLightOnJan 2026paper84.80
03olmOCR v0.4.0OpenAllen AIOct 2025paper83.70
04Chandra v0.1.0Opendatalab-toDec 2025github-readme81.20
old-scans
5 rows
#ModelOrgSubmittedPaper / codeold-scans
01Qianfan-OCROpenBaidu QianfanMar 2026paper73.10
02Chandra v0.1.0Opendatalab-toDec 2025github-readme50.40
03olmOCR v0.4.0OpenAllen AIOct 2025paper47.70
04LightOnOCR-2-1BOpenLightOnJan 2026paper42.20
05GPT-4oAPIOpenAIDec 2025github-readme40.70
old-scans-math
4 rows
#ModelOrgSubmittedPaper / codeold-scans-math
01LightOnOCR-2-1BOpenLightOnJan 2026paper85.60
02olmOCR v0.4.0OpenAllen AIOct 2025paper82.30
03Chandra v0.1.0Opendatalab-toDec 2025github-readme80.30
04olmOCR v0.3.0OpenAllen AIDec 2025github-readme79.90
pass-rate· primary
21 rows
#ModelOrgSubmittedPaper / codepass-rate
01dots.mocrOpenRedNoteMar 2026github-readme83.90
02LightOnOCR-2-1BOpenLightOnJan 2026paper83.20
03Chandra v0.1.0Opendatalab-toDec 2025alphaxiv-leaderboard83.10
04Infinity-Parser 7BOpenDec 2025alphaxiv-leaderboard82.50
05olmOCR v0.4.0OpenAllen AIDec 2025alphaxiv-leaderboard82.40
06PaddleOCR-VLOpenBaiduDec 2025alphaxiv-leaderboard80
07Qianfan-OCROpenBaidu QianfanMar 2026paper79.80
08Qwen3-VL-4BOpenAlibaba QwenMar 2026paper79.20
09dots.ocr 3BOpenRedNote HILabDec 2025github-readme79.10
10PaddleOCR-VL-1.5OpenBaidu PaddlePaddleMar 2026paper79.10
11Mistral OCR 3APIMistralDec 2025mistral-announcement78
12Marker 1.10.0OpenVikParuchuriDec 2025github-readme76.50
13Marker 1.10.1OpenVikParuchuriDec 2025alphaxiv-leaderboard76.10
14MonkeyOCR-pro-3BOpenJun 2025paper75.80
15DeepSeek-OCROpenDeepSeekDec 2025alphaxiv-leaderboard75.70
16DeepSeek-OCROpenDeepSeekDec 2025github-readme75.40
17MinerU 2.5OpenOpenDataLabDec 2025alphaxiv-leaderboard75.20
18Mistral OCR 2APIMistralDec 2025alphaxiv-leaderboard72
19GPT-4o (Anchored)OpenAIDec 2025github-readme69.90
20Nanonets OCR2 3BNanonetsDec 2025alphaxiv-leaderboard69.50
21Gemini Flash 2GoogleDec 2025github-readme63.80
tables
5 rows
#ModelOrgSubmittedPaper / codetables
01LightOnOCR-2-1BOpenLightOnJan 2026paper89
02dots.ocr 3BOpenRedNote HILabDec 2025github-readme88.30
03Chandra v0.1.0Opendatalab-toDec 2025github-readme88
04olmOCR v0.4.0OpenAllen AIOct 2025paper84.90
05Qianfan-OCROpenBaidu QianfanMar 2026paper81.60
Fig 2 · Rows sorted by score within each metric. Shaded row marks SOTA. Dates reflect model or paper release where available, otherwise the date Codesota accessed the source.
§ 03 · Progress

4 steps
of state of the art.

Each row below marks a model that broke the previous record on pass-rate. Intermediate submissions are kept in the leaderboard above; only SOTA-setting entries are re-listed here.

Higher scores win. Each subsequent entry improved upon the previous best.

SOTA line · pass-rate
  1. Jun 5, 2025MonkeyOCR-pro-3B75.80
  2. Dec 16, 2025Chandra v0.1.0datalab-to83.10
  3. Jan 20, 2026LightOnOCR-2-1BLightOn83.20
  4. Mar 19, 2026dots.mocrRedNote83.90
Fig 3 · SOTA-setting models only. 4 entries span Jun 2025 Mar 2026.
§ 04 · Literature

14 papers
tied to this benchmark.

Every paper below corresponds to at least one row in the leaderboard above. Click through for the arXiv preprint and, when available, the reference implementation.

§ 06 · Contribute

Have a score that beats
this table?

Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.

Submit a result Read submission guide
What a submission needs
  • 01A public checkpoint or API endpoint
  • 02A reproduction script with frozen commit + seed
  • 03Declared evaluation environment (Python, deps)
  • 04One row per metric declared by this dataset
  • 05A contact so we can follow up on discrepancies