General OCR Capabilities2024multilingual

OCRBench v2

Tests 8 core OCR capabilities across 23 tasks. Evaluates LMMs on text recognition, referring, extraction.

Metrics:overall-en-private, overall-zh-private
Paper / Website
Current State of the Art

Seed1.6-vision

ByteDance

62.2

overall-en-private

Top Models Performance Comparison

Top 10 models ranked by overall-en-private

overall-en-private1Seed1.6-vision62.2100.0%2Qwen3-Omni-30B61.398.6%3Nemotron Nano V2 VL61.298.4%4Gemini 2.5 Pro59.395.3%5llama-3.1-nemotron-nano-v...56.490.7%6GPT-4o55.589.2%7ovis2.5-8b54.187.0%8gemini-1.5-pro51.683.0%9sail-vl2-8b49.379.3%10minicpm-v-4.5-8b48.477.8%0%25%50%75%100%% of best
Best Score
62.2
Top Model
Seed1.6-vision
Models Compared
10
Score Range
13.8

overall-en-privatePrimary

#ModelScorePaper / CodeDate
1
Seed1.6-visionAPI
ByteDance
62.2Dec 2025
2
Qwen3-Omni-30BOpen Source
Alibaba
61.3Dec 2025
3
Nemotron Nano V2 VLOpen Source
NVIDIA
61.2Dec 2025
4
Gemini 2.5 ProAPI
Google
59.3Dec 2025
5
llama-3.1-nemotron-nano-vl-8b
56.4Dec 2025
6
GPT-4oAPI
OpenAI
55.5Dec 2025
7
ovis2.5-8b
54.1Dec 2025
8
gemini-1.5-pro
51.6Dec 2025
9
sail-vl2-8b
49.3Dec 2025
10
minicpm-v-4.5-8b
48.4Dec 2025
11
gpt-4o-2024
47.6Dec 2025
12
claude-3.5-sonnet
47.5Dec 2025
13
internvl3.5-14b
47.1Dec 2025
14
step-1v
46.8Dec 2025
15
grok4
45Dec 2025
16
GPT-4o Mini
OpenAI
44.1Dec 2025
17
Claude Sonnet 4API
Anthropic
42.4Dec 2025
18
qwen2.5-vl-7b
41.8Dec 2025
19
deepseek-vl2-small
41Dec 2025
20
pixtral-12b
38.4Dec 2025
21
phi-4-multimodal
38.1Dec 2025
22
glm-4v-9b
37.1Dec 2025
23
molmo-7b
33.9Dec 2025
24
llava-ov-7b
33.7Dec 2025
25
idefics3-8b
26Dec 2025
26
mistral-ocr-2512
25.2Dec 2025
27
docowl2
23.4Dec 2025

overall-zh-private

#ModelScorePaper / CodeDate
1
Gemini 2.5 ProAPI
Google
62.2Dec 2025
2
minicpm-v-4.5-8b
58.8Dec 2025
3
sail-vl2-8b
57.6Dec 2025
4
claude-3.5-sonnet
48.4Dec 2025
5
gpt-4o-2024
45.7Dec 2025

Other General OCR Capabilities Datasets