Who leads the BrowseComp benchmark?

DeepSeek-V4-Pro Max currently leads BrowseComp with a score of 83.4 on Accuracy.

What is the state-of-the-art score on BrowseComp?

The state-of-the-art result on BrowseComp is 83.4 (Accuracy), achieved by DeepSeek-V4-Pro Max as of 2026.

How many models are tracked on BrowseComp?

Codesota tracks 16 models on BrowseComp.

When was the BrowseComp leaderboard last updated?

The BrowseComp leaderboard on Codesota includes results through 2026, with the earliest tracked result from 2025.

Codesota · Benchmark · BrowseCompHome/Leaderboards/Language & Knowledge/Question Answering/BrowseComp

Unknown

BrowseComp.

Name: BrowseComp Benchmark Results
Creator: Unknown
Published: 2025-01-01
License: https://creativecommons.org/licenses/by/4.0/

Hard web-browsing QA benchmark with short factual answers that require persistent search over many online sources.

Paper ↗Leaderboard ↓

§ 01 · SOTA history

Year over year.

§ 02 · Leaderboard

Results by metric.

Found a wrong score or missing run?

Use row edits to send a sourced correction into moderation.

Add / edit result ↗Report issue ↗

Accuracy

Accuracy is the reported evaluation metric for BrowseComp. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for Accuracyverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	DeepSeek-V4-Pro Max	unverified	83.4	2026	Paper ↗Code ↗	Looks wrong?
02	Kimi K2.6	unverified	83.2	2026	Paper ↗	Looks wrong?
03	MiniMax-M2.5	unverified	76.3	2026	Paper ↗Code ↗	Looks wrong?
04	DeepSeek-V4-Flash Max	unverified	73.2	2026	Paper ↗Code ↗	Looks wrong?
05	Qwen3.5-397B-A17B	unverified	69	2026	Paper ↗Code ↗	Looks wrong?
06	GLM-5.1	unverified	68	2026	Paper ↗Code ↗	Looks wrong?
07	Qwen3.5-122B-A10B	unverified	63.8	2026	Paper ↗Code ↗Source ↗	Looks wrong?
08	GLM-5	unverified	62	2026	Paper ↗Code ↗Source ↗	Looks wrong?
09	Qwen3.5-35B-A3B	unverified	61	2026	Paper ↗Code ↗Source ↗	Looks wrong?
10	Qwen3.5-27B	unverified	61	2026	Paper ↗Code ↗Source ↗	Looks wrong?
11	Kimi-K2.5	unverified	60.6	2026	Paper ↗Code ↗	Looks wrong?
12	Step-3.5-Flash	unverified	51.6	2026	Paper ↗Code ↗	Looks wrong?
13	DeepSeek-V3.2	unverified	51.4	2025	Paper ↗Source ↗	Looks wrong?
14	NVIDIA-Nemotron-3-Super-120B-A12B-BF16	unverified	31.28	2025	Paper ↗Source ↗	Looks wrong?
15	GLM-4.5	unverified	26.4	2025	Paper ↗Code ↗	Looks wrong?
16	GLM-4.5-Air	unverified	21.3	2025	Paper ↗Code ↗Source ↗	Looks wrong?

§ 04 · Submit a result

Add to the leaderboard.

← Back to Question Answering