Multi-scene text reading, key information extraction, multilingual text, and document parsing benchmark.
28 results indexed across 4 metrics. Shaded row marks current SOTA; ties broken by submission date.
| # | Model | Org | Submitted | Paper / code | document-parsing |
|---|---|---|---|---|---|
| 01 | Gemini 1.5 ProAPI | Dec 2025 | alphaxiv-leaderboard | 62.37 | |
| 02 | Qwen2-VL 72BOSS | Alibaba | Dec 2024 | cc-ocr-paper | 53.78 |
| 03 | GPT-4oAPI | OpenAI | Dec 2024 | cc-ocr-paper | 53.30 |
| 04 | Claude 3.5 SonnetAPI | Anthropic | Dec 2024 | cc-ocr-paper | 47.79 |
| 05 | GOT-OCR2.0 | Alibaba | Dec 2024 | cc-ocr-paper | 39.18 |
| 06 | InternVL2-76BOSS | Shanghai AI Lab | Dec 2024 | cc-ocr-paper | 35.33 |
| # | Model | Org | Submitted | Paper / code | kie-f1 |
|---|---|---|---|---|---|
| 01 | Qwen2-VL 72BOSS | Alibaba | Dec 2025 | alphaxiv-leaderboard | 71.76 |
| 02 | Gemini 1.5 ProAPI | Dec 2025 | alphaxiv-leaderboard | 67.28 | |
| 03 | Claude 3.5 SonnetAPI | Anthropic | Dec 2025 | alphaxiv-leaderboard | 64.58 |
| 04 | GPT-4oAPI | OpenAI | Dec 2025 | alphaxiv-leaderboard | 63.45 |
| 05 | InternVL2-76BOSS | Shanghai AI Lab | Dec 2024 | cc-ocr-paper | 61.60 |
| # | Model | Org | Submitted | Paper / code | multi-scene-f1 |
|---|---|---|---|---|---|
| 01 | Gemini 1.5 ProAPI | Dec 2025 | alphaxiv-leaderboard | 83.25 | |
| 02 | Qwen2-VL 72BOSS | Alibaba | Dec 2025 | alphaxiv-leaderboard | 77.95 |
| 03 | InternVL2-76BOSS | Shanghai AI Lab | Dec 2025 | alphaxiv-leaderboard | 76.92 |
| 04 | GPT-4oAPI | OpenAI | Dec 2025 | alphaxiv-leaderboard | 76.40 |
| 05 | Claude 3.5 SonnetAPI | Anthropic | Dec 2025 | alphaxiv-leaderboard | 72.87 |
| 06 | GOT-OCR2.0 | Alibaba | Dec 2024 | cc-ocr-paper | 61 |
| 07 | TextMonkey | Huawei | Dec 2024 | cc-ocr-paper | 56.88 |
| 08 | Florence-2-Large | Microsoft | Dec 2024 | cc-ocr-paper | 49.24 |
| 09 | KOSMOS-2.5 | Microsoft | Dec 2024 | cc-ocr-paper | 47.55 |
| # | Model | Org | Submitted | Paper / code | multilingual-f1 |
|---|---|---|---|---|---|
| 01 | Gemini 1.5 ProAPI | Dec 2025 | alphaxiv-leaderboard | 78.97 | |
| 02 | GPT-4oAPI | OpenAI | Dec 2025 | alphaxiv-leaderboard | 73.44 |
| 03 | Qwen2-VL 72BOSS | Alibaba | Dec 2024 | cc-ocr-paper | 71.14 |
| 04 | Claude 3.5 SonnetAPI | Anthropic | Dec 2024 | cc-ocr-paper | 65.68 |
| 05 | Florence-2-Large | Microsoft | Dec 2024 | cc-ocr-paper | 49.70 |
| 06 | InternVL2-76BOSS | Shanghai AI Lab | Dec 2024 | cc-ocr-paper | 46.57 |
| 07 | KOSMOS-2.5 | Microsoft | Dec 2024 | cc-ocr-paper | 36.23 |
| 08 | GOT-OCR2.0 | Alibaba | Dec 2024 | cc-ocr-paper | 24.95 |
Each row below marks a model that broke the previous record on multi-scene-f1. Intermediate submissions are kept in the leaderboard above; only SOTA-setting entries are re-listed here.
Higher scores win. Each subsequent entry improved upon the previous best.
Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.