Codesota · Models · Gemini 1.5 ProGoogle21 results · 17 benchmarks
Model card

Gemini 1.5 Pro.

GoogleapiMultimodal LLMProprietary3 current SOTA

1M token context window. Released February 2024.

§ 02 · Benchmarks

Every benchmark Gemini 1.5 Pro has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01CC-OCRComputer Vision · General OCR Capabilitiesmultilingual-f179.0%#1/8source ↗
02CC-OCRComputer Vision · General OCR Capabilitiesdocument-parsing62.4%#1/6source ↗
03CC-OCRComputer Vision · General OCR Capabilitiesmulti-scene-f183.3%#1/9source ↗
04BIG-Bench HardReasoning · Multi-step Reasoningaccuracy89.2%#2/11source ↗
05CC-OCRComputer Vision · General OCR Capabilitieskie-f167.3%#2/5source ↗
06HellaSwagReasoning · Commonsense Reasoningaccuracy92.5%#2/17source ↗
07CNN/DailyMailNatural Language Processing · Text Summarizationrouge-145.8%#3/62024-02-15source ↗
08CNN/DailyMailNatural Language Processing · Text Summarizationrouge-l43.0%#3/72024-02-15source ↗
09VQA v2.0Multimodal · Visual Question Answeringaccuracy86.5%#3/162024-02-15source ↗
10SQuAD v2.0Natural Language Processing · Question Answeringf190.5%#4/262024-02-15source ↗
11MME-VideoOCRComputer Vision · General OCR Capabilitiestotal-accuracy64.9%#5/6source ↗
12ARC-ChallengeReasoning · Commonsense Reasoningaccuracy94.8%#9/10source ↗
13TextVQAMultimodal · Visual Question Answeringaccuracy82.2%#12/232024-02-15source ↗
14MMBenchMultimodal · Visual Question Answeringaccuracy73.9%#19/202024-02-15source ↗
15MMMUMultimodal · Visual Question Answeringaccuracy62.2%#21/302024-02-15source ↗
16GSM8KReasoning · Mathematical Reasoningaccuracy91.7%#34/48source ↗
17HumanEvalComputer Code · Code Generationpass@171.9%#38/42source ↗
18MATHReasoning · Mathematical Reasoningaccuracy67.7%#38/46source ↗
19MMLUReasoning · Commonsense Reasoningaccuracy85.9%#42/64source ↗
20HLEReasoning · Multi-step Reasoningaccuracy4.6%#67/74source ↗
21GPQA DiamondReasoning · Multi-step Reasoningaccuracy46.2%#69/74source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 03 · Strengths by area

Where Gemini 1.5 Pro actually performs.

Computer Vision
2
benchmarks
avg rank #2.0 · 3 SOTA
Natural Language Processing
2
benchmarks
avg rank #3.3
Multimodal
4
benchmarks
avg rank #13.8
Reasoning
8
benchmarks
avg rank #32.9
Computer Code
1
benchmark
avg rank #38.0
§ 04 · Papers

1 paper with results for Gemini 1.5 Pro.

  1. 2024-02-15· Natural Language Processing· 7 results

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

§ 05 · Related models

Other Google models scored on Codesota.

Gemini 2.5 Pro
16 results · 2 SOTA
Gemini-3.1-Pro
7 results · 2 SOTA
Gemini 3 Pro
Undisclosed params · 12 results · 1 SOTA
ViT-H/14
632M params · 2 results · 1 SOTA
CoCa (finetuned)
2.1B params · 1 result · 1 SOTA
Gemini 2.0 Flash
1 result · 1 SOTA
Noise2Music
Unknown params · 1 result · 1 SOTA
Gemini 3 Flash
Undisclosed params · 6 results
§ 06 · Sources & freshness

Where these numbers come from.

arxiv
7
results
alphaxiv-leaderboard
5
results
google-blog
4
results
openai-simple-evals
3
results
llm-stats-bbh
1
result
scale-hle-official
1
result
9 of 21 rows marked verified.