Multi-scene text reading, key information extraction, multilingual text, and document parsing benchmark.
Multi Scene F1 is the reported evaluation metric for CC-OCR. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Edit |
|---|---|---|---|---|---|---|
| 01 | Gemini 1.5 Pro | unverified | 83.25 | 2025 | Source ↗ | Edit result |
| 02 | gemini-15-pro | paper | 83.25 | 2025 | Source ↗ | Edit result |
| 03 | qwen2-vl-72b | paper | 77.95 | 2025 | Source ↗ | Edit result |
| 04 | Qwen2-VL 72B | unverified | 77.95 | 2025 | Source ↗ | Edit result |
| 05 | InternVL2-76B | unverified | 76.92 | 2025 | Source ↗ | Edit result |
| 06 | gpt-4o | paper | 76.4 | 2025 | Source ↗ | Edit result |
| 07 | Claude 3.5 Sonnet | unverified | 72.87 | 2025 | Source ↗ | Edit result |
| 08 | claude-35-sonnet | paper | 72.87 | 2025 | Source ↗ | Edit result |
| 09 | GOT-OCR2.0 | verified | 61 | 2024 | Source ↗ | Edit result |
| 10 | TextMonkey | verified | 56.88 | 2024 | Source ↗ | Edit result |
| 11 | Florence-2-Large | verified | 49.24 | 2024 | Source ↗ | Edit result |
| 12 | KOSMOS-2.5 | verified | 47.55 | 2024 | Source ↗ | Edit result |
Multilingual F1 is the reported evaluation metric for CC-OCR. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Edit |
|---|---|---|---|---|---|---|
| 01 | gemini-15-pro | paper | 78.97 | 2025 | Source ↗ | Edit result |
| 02 | Gemini 1.5 Pro | unverified | 78.97 | 2025 | Source ↗ | Edit result |
| 03 | gpt-4o | paper | 73.44 | 2025 | Source ↗ | Edit result |
| 04 | Qwen2-VL 72B | verified | 71.14 | 2024 | Source ↗ | Edit result |
| 05 | Claude 3.5 Sonnet | verified | 65.68 | 2024 | Source ↗ | Edit result |
| 06 | Florence-2-Large | verified | 49.7 | 2024 | Source ↗ | Edit result |
| 07 | InternVL2-76B | verified | 46.57 | 2024 | Source ↗ | Edit result |
| 08 | KOSMOS-2.5 | verified | 36.23 | 2024 | Source ↗ | Edit result |
| 09 | GOT-OCR2.0 | verified | 24.95 | 2024 | Source ↗ | Edit result |
Kie F1 is the reported evaluation metric for CC-OCR. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Edit |
|---|---|---|---|---|---|---|
| 01 | qwen2-vl-72b | paper | 71.76 | 2025 | Source ↗ | Edit result |
| 02 | Qwen2-VL 72B | unverified | 71.76 | 2025 | Source ↗ | Edit result |
| 03 | gemini-15-pro | paper | 67.28 | 2025 | Source ↗ | Edit result |
| 04 | Gemini 1.5 Pro | unverified | 67.28 | 2025 | Source ↗ | Edit result |
| 05 | Claude 3.5 Sonnet | unverified | 64.58 | 2025 | Source ↗ | Edit result |
| 06 | claude-35-sonnet | paper | 64.58 | 2025 | Source ↗ | Edit result |
| 07 | GPT-4o | unverified | 63.45 | 2025 | Source ↗ | Edit result |
| 08 | InternVL2-76B | verified | 61.6 | 2024 | Source ↗ | Edit result |
Document Parsing is the reported evaluation metric for CC-OCR. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Edit |
|---|---|---|---|---|---|---|
| 01 | Gemini 1.5 Pro | unverified | 62.37 | 2025 | Source ↗ | Edit result |
| 02 | gemini-15-pro | paper | 62.37 | 2025 | Source ↗ | Edit result |
| 03 | Qwen2-VL 72B | verified | 53.78 | 2024 | Source ↗ | Edit result |
| 04 | GPT-4o | verified | 53.3 | 2024 | Source ↗ | Edit result |
| 05 | Claude 3.5 Sonnet | verified | 47.79 | 2024 | Source ↗ | Edit result |
| 06 | GOT-OCR2.0 | verified | 39.18 | 2024 | Source ↗ | Edit result |
| 07 | InternVL2-76B | verified | 35.33 | 2024 | Source ↗ | Edit result |