Recent studyBlind TTS Elo is live. Compare two anonymous voice samples, vote after listening, and help separate real preference signal from noise.Vote in the study ->
Codesota · Tasks · General OCR CapabilitiesHome/Tasks/Computer Vision/General OCR Capabilities

General OCR Capabilities.

Comprehensive benchmarks covering multiple aspects of OCR performance.

4
Datasets
70
Results
overall-en-private
Canonical metric
§ 02 · Canonical benchmark

The reference dataset.

OCRBench v2

Tests 8 core OCR capabilities across 23 tasks. Evaluates LMMs on text recognition, referring, extraction.

Primary metric: overall-en-private
View full leaderboard →
§ 03 · Top 10

Leading models.

Leading models on OCRBench v2.

#Modelenglish-scoreYearSource
Ovis2.5-9B63.42025paper ↗
2Gemini 2.5 Pro62.22025paper ↗
3Seed1.6-vision62.22025paper ↗
4Qwen3-Omni-30B61.32025paper ↗
5Nemotron Nano V2 VL61.22025paper ↗
6Intern-S1-Pro60.62026paper ↗
7Intern-S1-Pro60.12026paper ↗
8Gemini 2.5 Pro59.32025paper ↗
9minicpm-v-4.5-8b58.82025paper ↗
10Ovis2.5-9B58.02025paper ↗

What were you looking for on General OCR Capabilities?

Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.

§ 04 · All datasets

Tracked datasets.

4 datasets tracked for this task.

OCRBench v2
CANONICAL
36 results · overall-en-private
Top: Ovis2.5-9B 63.4
CC-OCR
28 results · multi-scene-f1
Top: Gemini 1.5 Pro 83.3
MME-VideoOCR
6 results · total-accuracy
Top: Gemini 2.5 Pro 73.7
reVISION
0 results · accuracy
§ 05 · Related tasks

Other tasks in Computer Vision.

Document Image ClassificationDocument Layout AnalysisDocument ParsingDocument UnderstandingHandwriting RecognitionImage Feature ExtractionImage-to-3DImage-to-Image
Reply within 48 hours · No newsletter

Didn't find what you came for?

Still looking for something on General OCR Capabilities? A missing model, a stale score, a benchmark we should cover — drop it here and we'll handle it.

Real humans read every message. We track what people are asking for and prioritize accordingly.