Codesota · Models · Gemini-3.1-ProGoogle11 results · 3 benchmarks
Model card

Gemini-3.1-Pro.

Googleapi4 current SOTA

Imported from https://raw.githubusercontent.com/GAIR-NLP/AcademiClaw/main/README.md

§ 02 · Benchmarks

Every benchmark Gemini-3.1-Pro has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01AcademiClawAgentic AI · Task agentsavg-time-sec822.00#1/52026-05-04source ↗
02AcademiClawAgentic AI · Task agentsavg-tokens-per-task-k2857.00#1/62026-05-04source ↗
03AcademiClawAgentic AI · Task agentstool-calls-per-task57.0%#1/62026-05-04source ↗
04MMMU-ProMultimodal · Visual Question Answeringaccuracy82.0%#1/312026-03-18source ↗
05AcademiClawAgentic AI · Task agentspass43.8%#3/62026-05-04source ↗
06React Native EvalsMobile Development · React Native Code Generationnavigation-satisfaction94.4%#4/10source ↗
07AcademiClawAgentic AI · Task agentsavg-score64.3%#5/62026-05-04source ↗
08React Native EvalsMobile Development · React Native Code Generationasync-state-satisfaction80.8%#5/10source ↗
09React Native EvalsMobile Development · React Native Code Generationrequirement-satisfaction78.9%#5/10source ↗
10AcademiClawAgentic AI · Task agentssafety-score74.9%#6/62026-05-04source ↗
11React Native EvalsMobile Development · React Native Code Generationanimation-satisfaction64.2%#6/10source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 03 · Strengths by area

Where Gemini-3.1-Pro actually performs.

Agentic AI
1
benchmark
avg rank #2.8 · 3 SOTA
Multimodal
1
benchmark
avg rank #1.0 · 1 SOTA
Mobile Development
1
benchmark
avg rank #5.0
§ 04 · Papers

1 paper with results for Gemini-3.1-Pro.

  1. 2026-05-04· Agentic AI· 6 results

    AcademiClaw: When Students Set Challenges for AI Agents

    Junjie Yu, Pengrui Lu, Weiye Si, Hongliang Lu et al.
§ 05 · Related models

Other Google models scored on Codesota.

Gemini 2.5 Pro
16 results · 2 SOTA
Gemini 1.5 Pro
15 results · 1 SOTA
Gemini 3 Pro
Undisclosed params · 12 results · 1 SOTA
ViT-H/14
632M params · 2 results · 1 SOTA
CoCa (finetuned)
2.1B params · 1 result · 1 SOTA
Gemini 2.0 Flash
1 result · 1 SOTA
Noise2Music
Unknown params · 1 result · 1 SOTA
Gemini 3 Flash
Undisclosed params · 6 results
§ 06 · Sources & freshness

Where these numbers come from.

paper
6
results
Callstack Incubator
4
results
artificialanalysis.ai
1
result
11 of 11 rows marked verified. · first result 2026-03-18, latest 2026-05-04.