Codesota · OCR · Benchmarks · vqa-v2Home/OCR/Benchmarks/vqa-v2
Unknown

vqa-v2.

OCR benchmark

§ 01 · accuracy

accuracy.

Higher is better

#ModelScoreSource
Qwen2-VL 72B
Fetched from CodeSOTA API on 2026-04-20
87.6codesota-api
2
InternVL2-76B
Fetched from CodeSOTA API on 2026-04-20
87.2codesota-api
3
Gemini 1.5 Pro
Fetched from CodeSOTA API on 2026-04-20
86.5codesota-api
4
PaLI-X 55B
Fetched from CodeSOTA API on 2026-04-20
86.1codesota-api
5
NVLM-D 1.0 72B
Fetched from CodeSOTA API on 2026-04-20
85.4codesota-api
6
NVLM-X 1.0 72B
Fetched from CodeSOTA API on 2026-04-20
85.2codesota-api
7
NVLM-H 1.0 72B
Fetched from CodeSOTA API on 2026-04-20
85.2codesota-api
8
VILA-1.5 40B
Fetched from CodeSOTA API on 2026-04-20
84.3codesota-api
9
LLaVA-NeXT 34B
Fetched from CodeSOTA API on 2026-04-20
83.7codesota-api
10
LLaVA-NeXT 13B
Fetched from CodeSOTA API on 2026-04-20
82.8codesota-api
11
CogVLM-17B
Fetched from CodeSOTA API on 2026-04-20
82.3codesota-api
12
LLaVA-NeXT 7B (Mistral)
Fetched from CodeSOTA API on 2026-04-20
82.2codesota-api
13
BLIP-2
Fetched from CodeSOTA API on 2026-04-20
82.19codesota-api
14
LLaVA-NeXT 7B (Vicuna)
Fetched from CodeSOTA API on 2026-04-20
81.8codesota-api
15
Pixtral Large
Fetched from CodeSOTA API on 2026-04-20
80.9codesota-api
16
Llama 3-V 405B
Fetched from CodeSOTA API on 2026-04-20
80.2codesota-api
17
LLaVA-1.5 13B
Fetched from CodeSOTA API on 2026-04-20
80codesota-api
18
LLaVA-1.5
Fetched from CodeSOTA API on 2026-04-20
80codesota-api
19
Llama 3-V 70B
Fetched from CodeSOTA API on 2026-04-20
79.1codesota-api
20
Pixtral-12B
Fetched from CodeSOTA API on 2026-04-20
78.6codesota-api
21
GPT-4o
Fetched from CodeSOTA API on 2026-04-20
78.5codesota-api
22
Llama 3.2 90B Vision Instruct
Fetched from CodeSOTA API on 2026-04-20
78.1codesota-api
23
GPT-4V
Fetched from CodeSOTA API on 2026-04-20
77.2codesota-api
§ Related · Explore

More OCR content.

Verified Model Reviews
Comparisons & Guides
View all OCR benchmarks → Back to All Benchmarks