Who leads the CC-OCR benchmark?

Gemini 1.5 Pro currently leads CC-OCR with a score of 83.25 on Multi-Scene F1.

What is the state-of-the-art score on CC-OCR?

The state-of-the-art result on CC-OCR is 83.25 (Multi-Scene F1), achieved by Gemini 1.5 Pro as of 2025.

How many models are tracked on CC-OCR?

Codesota tracks 13 models on CC-OCR across 4 metrics.

When was the CC-OCR leaderboard last updated?

The CC-OCR leaderboard on Codesota includes results through 2025, with the earliest tracked result from 2024.

Codesota · Benchmark · CC-OCRHome/Leaderboards/CC-OCR

South China University of Technology

CC-OCR.

Name: CC-OCR Benchmark Results
Creator: South China University of Technology
Published: 2024-01-01
License: https://creativecommons.org/licenses/by/4.0/

Multi-scene text reading, key information extraction, multilingual text, and document parsing benchmark.

Paper ↗Leaderboard ↓

§ 01 · SOTA history

Year over year.

§ 02 · Leaderboard

Results by metric.

Found a wrong score or missing run?

Use row edits to send a sourced correction into moderation.

Add / edit result ↗Report issue ↗

Multi-Scene F1

Multi Scene F1 is the reported evaluation metric for CC-OCR. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for Multi-Scene F1verifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	Gemini 1.5 Pro Multi-Scene Text Reading - Overall F1 score	unverified	83.25	2025	Source ↗	Looks wrong?
02	gemini-15-pro Multi-Scene Text Reading - Overall F1 score	paper	83.25	2025	Source ↗	Looks wrong?
03	qwen2-vl-72b	paper	77.95	2025	Source ↗	Looks wrong?
04	Qwen2-VL 72B	unverified	77.95	2025	Source ↗	Looks wrong?
05	InternVL2-76B	unverified	76.92	2025	Source ↗	Looks wrong?
06	gpt-4o	paper	76.4	2025	Source ↗	Looks wrong?
07	Claude 3.5 Sonnet	unverified	72.87	2025	Source ↗	Looks wrong?
08	claude-35-sonnet	paper	72.87	2025	Source ↗	Looks wrong?
09	GOT-OCR2.0	verified	61	2024	Source ↗	Looks wrong?
10	TextMonkey	verified	56.88	2024	Source ↗	Looks wrong?
11	Florence-2-Large	verified	49.24	2024	Source ↗	Looks wrong?
12	KOSMOS-2.5	verified	47.55	2024	Source ↗	Looks wrong?

Multilingual F1

Multilingual F1 is the reported evaluation metric for CC-OCR. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for Multilingual F1verifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	gemini-15-pro Multilingual Text Reading - 10 languages	paper	78.97	2025	Source ↗	Looks wrong?
02	Gemini 1.5 Pro Multilingual Text Reading - 10 languages	unverified	78.97	2025	Source ↗	Looks wrong?
03	gpt-4o	paper	73.44	2025	Source ↗	Looks wrong?
04	Qwen2-VL 72B Multilingual Text Reading - 10 languages	verified	71.14	2024	Source ↗	Looks wrong?
05	Claude 3.5 Sonnet Multilingual Text Reading - 10 languages	verified	65.68	2024	Source ↗	Looks wrong?
06	Florence-2-Large Multilingual Text Reading - 10 languages	verified	49.7	2024	Source ↗	Looks wrong?
07	InternVL2-76B Multilingual Text Reading - 10 languages	verified	46.57	2024	Source ↗	Looks wrong?
08	KOSMOS-2.5 Multilingual Text Reading - 10 languages	verified	36.23	2024	Source ↗	Looks wrong?
09	GOT-OCR2.0 Multilingual Text Reading - 10 languages	verified	24.95	2024	Source ↗	Looks wrong?

KIE F1

Kie F1 is the reported evaluation metric for CC-OCR. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for KIE F1verifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	qwen2-vl-72b Key Information Extraction - Overall F1 score	paper	71.76	2025	Source ↗	Looks wrong?
02	Qwen2-VL 72B Key Information Extraction - Overall F1 score	unverified	71.76	2025	Source ↗	Looks wrong?
03	gemini-15-pro	paper	67.28	2025	Source ↗	Looks wrong?
04	Gemini 1.5 Pro	unverified	67.28	2025	Source ↗	Looks wrong?
05	Claude 3.5 Sonnet	unverified	64.58	2025	Source ↗	Looks wrong?
06	claude-35-sonnet	paper	64.58	2025	Source ↗	Looks wrong?
07	GPT-4o	unverified	63.45	2025	Source ↗	Looks wrong?
08	InternVL2-76B Key Information Extraction - Overall F1 score	verified	61.6	2024	Source ↗	Looks wrong?

Document Parsing

Document Parsing is the reported evaluation metric for CC-OCR. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for Document Parsingverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	Gemini 1.5 Pro Document Parsing - Average Score	unverified	62.37	2025	Source ↗	Looks wrong?
02	gemini-15-pro Document Parsing - Average Score	paper	62.37	2025	Source ↗	Looks wrong?
03	Qwen2-VL 72B Document Parsing - Average Score	verified	53.78	2024	Source ↗	Looks wrong?
04	GPT-4o Document Parsing - Average Score	verified	53.3	2024	Source ↗	Looks wrong?
05	Claude 3.5 Sonnet Document Parsing - Average Score	verified	47.79	2024	Source ↗	Looks wrong?
06	GOT-OCR2.0 Document Parsing - Average Score	verified	39.18	2024	Source ↗	Looks wrong?
07	InternVL2-76B Document Parsing - Average Score	verified	35.33	2024	Source ↗	Looks wrong?

§ 04 · Submit a result

Add to the leaderboard.

← Back to Leaderboards