General OCR Capabilities

Comprehensive benchmarks covering multiple aspects of OCR performance.

4
Datasets
52
Results
overall-en-private
Canonical metric
Canonical Benchmark

OCRBench v2

Tests 8 core OCR capabilities across 23 tasks. Evaluates LMMs on text recognition, referring, extraction.

Primary metric: overall-en-private
View full leaderboard

Top 10

Leading models on OCRBench v2.

RankModeloverall-en-privateYearSource
1
seed-1.6-vision
62.22025paper
2
gemini-25-pro
62.22025paper
3
qwen3-omni-30b
61.32025paper
4
nemotron-nano-v2-vl
61.22025paper
5
Qianfan-OCR
60.82026paper
6
gemini-25-pro
59.32025paper
7
minicpm-v-4.5-8b
58.82025paper
8
sail-vl2-8b
57.62025paper
9
llama-3.1-nemotron-nano-vl-8b
56.42025paper
10
Qianfan-OCR
56.02026paper

All datasets

4 datasets tracked for this task.

Related tasks

Other tasks in Computer Vision.