Home/OCR/Benchmarks/OCRBench v2

OCRBench v2

South China University of Technology

Comprehensive benchmark evaluating 8 OCR capabilities across 23 tasks in 31 scenarios.

32
Total Results
27
Models Tested
2
Metrics
2025-12-21
Last Updated

Overall (English)

Average score on English private test set

Higher is better

RankModelScoreSource
1seed-1.6-vision

English, Private split. #1 on OCRBench v2

62.2alphaxiv-leaderboard
2qwen3-omni-30b61.3alphaxiv-leaderboard
3nemotron-nano-v2-vl61.2alphaxiv-leaderboard
4gemini-25-pro59.3alphaxiv-leaderboard
5llama-3.1-nemotron-nano-vl-8b56.4ocrbench-v2-leaderboard
6gpt-4o

Listed as GPT5-2025-08-07 on leaderboard

55.5alphaxiv-leaderboard
7ovis2.5-8b54.1ocrbench-v2-leaderboard
8gemini-1.5-pro51.6ocrbench-v2-leaderboard
9sail-vl2-8b49.3ocrbench-v2-leaderboard
10minicpm-v-4.5-8b48.4ocrbench-v2-leaderboard
11gpt-4o-2024

GPT-4o baseline (not GPT5-2025-08-07)

47.6ocrbench-v2-leaderboard
12claude-3.5-sonnet47.5ocrbench-v2-leaderboard
13internvl3.5-14b47.1ocrbench-v2-leaderboard
14step-1v46.8ocrbench-v2-leaderboard
15grok445ocrbench-v2-leaderboard
16gpt-4o-mini44.1ocrbench-v2-leaderboard
17claude-sonnet-4

Claude-sonnet-4-20250514

42.4ocrbench-v2-leaderboard
18qwen2.5-vl-7b41.8ocrbench-v2-leaderboard
19deepseek-vl2-small41ocrbench-v2-leaderboard
20pixtral-12b38.4ocrbench-v2-leaderboard
21phi-4-multimodal38.1ocrbench-v2-leaderboard
22glm-4v-9b37.1ocrbench-v2-leaderboard
23molmo-7b33.9ocrbench-v2-leaderboard
24llava-ov-7b33.7ocrbench-v2-leaderboard
25idefics3-8b26ocrbench-v2-leaderboard
26mistral-ocr-2512

Verified via CodeSOTA benchmark. 7,400 English samples. Mistral OCR is a pure OCR model (text extraction only) - not designed for VQA, chart parsing, or structured extraction tasks. Strong on full-page OCR (79.1%) and document parsing (55.2%).

25.2codesota-verified
27docowl223.4ocrbench-v2-leaderboard

Overall (Chinese)

Average score on Chinese private test set

Higher is better

RankModelScoreSource
1gemini-25-pro

Chinese, Private split. #1 on Chinese

62.2alphaxiv-leaderboard
2minicpm-v-4.5-8b

Chinese, Private split. #4 overall

58.8ocrbench-v2-leaderboard
3sail-vl2-8b57.6ocrbench-v2-leaderboard
4claude-3.5-sonnet48.4ocrbench-v2-leaderboard
5gpt-4o-202445.7ocrbench-v2-leaderboard

Explore More OCR Content