LlamaIndex's 2026 benchmark for document parsing evaluated against agent-facing semantic correctness. ~2,078 human-verified pages from ~1,211 enterprise documents (insurance, finance, government) with 169,011 rule-based tests across five dimensions: tables (GTRM: GriTS + TableRecordMatch), charts (ChartDataPointMatch), content faithfulness, semantic formatting, and visual grounding (element pass rate). Purely rule-based — no LLM-as-judge. Overall score is the unweighted mean across the five dimensions.
14 results indexed across 1 metric. Shaded row marks current SOTA; ties broken by submission date.
| # | Model | Org | Submitted | Paper / code | accuracy |
|---|---|---|---|---|---|
| 01 | LlamaParse Agentic | LlamaIndex | Apr 2026 | blog-post | 84.90 |
| 02 | LlamaParse Cost Effective | LlamaIndex | Apr 2026 | blog-post | 71.90 |
| 03 | Gemini 3 FlashAPI | Apr 2026 | blog-post | 71 | |
| 04 | Reducto | Reducto | Apr 2026 | blog-post | 67.80 |
| 05 | Qwen3-VL-4BOSS | Alibaba Qwen | Apr 2026 | blog-post | 62 |
| 06 | Azure Document Intelligence | Microsoft | Apr 2026 | blog-post | 59.60 |
| 07 | Extend | Extend | Apr 2026 | blog-post | 55.80 |
| 08 | Dots OCR 1.5OSS | RedNote HILab | Apr 2026 | blog-post | 55.80 |
| 09 | DoclingOSS | IBM Research | Apr 2026 | blog-post | 50.60 |
| 10 | Google Cloud Document AI | Google Cloud | Apr 2026 | blog-post | 50.40 |
| 11 | AWS Textract | Amazon Web Services | Apr 2026 | blog-post | 47.90 |
| 12 | GPT-5-mini | OpenAI | Apr 2026 | blog-post | 46.80 |
| 13 | LandingAI | LandingAI | Apr 2026 | blog-post | 45.20 |
| 14 | Anthropic Haiku 4.5 | Anthropic | Apr 2026 | blog-post | 45.20 |
Each row below marks a model that broke the previous record on accuracy. Intermediate submissions are kept in the leaderboard above; only SOTA-setting entries are re-listed here.
Higher scores win. Each subsequent entry improved upon the previous best.
Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.