Codesota · Models · Gemini 2.5 ProGoogle16 results · 15 benchmarks
Model card

Gemini 2.5 Pro.

GoogleapiMultimodal LLMProprietary3 current SOTA

#1 on OCRBench v2 Chinese, MME-VideoOCR

§ 01 · Benchmarks

Every benchmark Gemini 2.5 Pro has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01OCRBench v2Computer Vision · General OCR Capabilitiesoverall-zh-private62.2%#1/52025-03-25source ↗
02ARC-AGI-2Reasoning · Logical Reasoningaccuracy5.0%#1/3source ↗
03MME-VideoOCRComputer Vision · General OCR Capabilitiestotal-accuracy73.7%#1/6source ↗
04AIME 2025Reasoning · Mathematical Reasoningaccuracy86.7%#2/5source ↗
05ARC-ChallengeReasoning · Commonsense Reasoningaccuracy97.8%#2/10source ↗
06ThaiOCRBenchComputer Vision · Optical Character Recognitionted-score0.8%#2/5source ↗
07AIME 2024Reasoning · Mathematical Reasoningaccuracy92.0%#3/8source ↗
08GSM8KReasoning · Mathematical Reasoningaccuracy99.0%#3/32source ↗
09OCRBench v2Computer Vision · General OCR Capabilitiesoverall-en-private59.3%#4/272025-03-25source ↗
10ARC-AGI-1Reasoning · Logical Reasoningaccuracy56.1%#4/5source ↗
11MATHReasoning · Mathematical Reasoningaccuracy97.3%#6/34source ↗
12GPQAReasoning · Multi-step Reasoningaccuracy84.0%#7/33source ↗
13OmniDocBenchComputer Vision · Document Parsingcomposite88.0%#13/33source ↗
14MMLUReasoning · Commonsense Reasoningaccuracy89.8%#16/412025-06-17source ↗
15SWE-Bench VerifiedComputer Code · Code Generationresolve-rate63.8%#25/39source ↗
16SWE-bench VerifiedAgentic AI · SWE-benchresolve-rate63.2%#53/81source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where Gemini 2.5 Pro actually performs.

Computer Vision
4
benchmarks
avg rank #4.2 · 2 SOTA
Reasoning
9
benchmarks
avg rank #4.9 · 1 SOTA
Computer Code
1
benchmark
avg rank #25.0
Agentic AI
1
benchmark
avg rank #53.0
§ 04 · Related models

Other Google models scored on Codesota.

Gemini 3 Pro
Undisclosed params · 13 results · 2 SOTA
Gemini 1.5 Pro
12 results · 1 SOTA
Gemini 3.1 Pro
3 results · 1 SOTA
ViT-H/14
632M params · 2 results · 1 SOTA
CoCa (finetuned)
2.1B params · 1 result · 1 SOTA
Gemini 2.0 Flash
1 result · 1 SOTA
Gemini 3.1 Pro Preview
1 result · 1 SOTA
Noise2Music
Unknown params · 1 result · 1 SOTA
§ 05 · Sources & freshness

Where these numbers come from.

google-technical-report
7
results
alphaxiv-leaderboard
4
results
arcprize-leaderboard
1
result
artificialanalysis
1
result
AlphaXiv
1
result
google-blog
1
result
editorial
1
result
9 of 16 rows marked verified. · first result 2025-03-25, latest 2025-06-17.