Codesota · Models · Gemini 1.5 ProGoogle20 results · 16 benchmarks
Model card

Gemini 1.5 Pro.

GoogleapiMultimodal LLMProprietary3 current SOTA

1M token context window. Released February 2024.

§ 01 · Benchmarks

Every benchmark Gemini 1.5 Pro has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01CC-OCRComputer Vision · General OCR Capabilitiesmultilingual-f179.0%#1/8source ↗
02CC-OCRComputer Vision · General OCR Capabilitiesdocument-parsing62.4%#1/6source ↗
03CC-OCRComputer Vision · General OCR Capabilitiesmulti-scene-f183.3%#1/9source ↗
04BIG-Bench HardReasoning · Multi-step Reasoningaccuracy89.2%#2/5source ↗
05CC-OCRComputer Vision · General OCR Capabilitieskie-f167.3%#2/5source ↗
06HellaSwagReasoning · Commonsense Reasoningaccuracy92.5%#2/5source ↗
07CNN/DailyMailNatural Language Processing · Text Summarizationrouge-145.8%#3/62024-02-15source ↗
08CNN/DailyMailNatural Language Processing · Text Summarizationrouge-l43.0%#3/62024-02-15source ↗
09SQuAD v2.0Natural Language Processing · Question Answeringf190.5%#3/222024-02-15source ↗
10VQA v2.0Multimodal · Visual Question Answeringaccuracy86.5%#3/72024-02-15source ↗
11TextVQAMultimodal · Visual Question Answeringaccuracy82.2%#5/92024-02-15source ↗
12MME-VideoOCRComputer Vision · General OCR Capabilitiestotal-accuracy64.9%#5/6source ↗
13MMBenchMultimodal · Visual Question Answeringaccuracy73.9%#7/82024-02-15source ↗
14ARC-ChallengeReasoning · Commonsense Reasoningaccuracy94.8%#9/10source ↗
15MMMUMultimodal · Visual Question Answeringaccuracy62.2%#15/182024-02-15source ↗
16GSM8KReasoning · Mathematical Reasoningaccuracy91.7%#27/32source ↗
17GPQAReasoning · Multi-step Reasoningaccuracy46.2%#31/33source ↗
18MATHReasoning · Mathematical Reasoningaccuracy67.7%#33/34source ↗
19MMLUReasoning · Commonsense Reasoningaccuracy85.9%#35/41source ↗
20HumanEvalComputer Code · Code Generationpass@171.9%#38/42source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where Gemini 1.5 Pro actually performs.

Computer Vision
2
benchmarks
avg rank #2.0 · 3 SOTA
Natural Language Processing
2
benchmarks
avg rank #3.0
Multimodal
4
benchmarks
avg rank #7.5
Reasoning
7
benchmarks
avg rank #19.9
Computer Code
1
benchmark
avg rank #38.0
§ 03 · Papers

1 paper with results for Gemini 1.5 Pro.

  1. 2024-02-15· Natural Language Processing· 7 results

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

§ 04 · Related models

Other Google models scored on Codesota.

Gemini 2.5 Pro
16 results · 3 SOTA
Gemini 3 Pro
Undisclosed params · 13 results · 2 SOTA
Gemini 3.1 Pro
3 results · 1 SOTA
ViT-H/14
632M params · 2 results · 1 SOTA
CoCa (finetuned)
2.1B params · 1 result · 1 SOTA
Gemini 2.0 Flash
1 result · 1 SOTA
Gemini 3.1 Pro Preview
1 result · 1 SOTA
Noise2Music
Unknown params · 1 result · 1 SOTA
§ 05 · Sources & freshness

Where these numbers come from.

arxiv
7
results
alphaxiv-leaderboard
5
results
google-blog
4
results
openai-simple-evals
3
results
llm-stats-bbh
1
result
8 of 20 rows marked verified.