Every model, measured.

Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.

Vendor:Areas overview speakleash · 253 OpenAI · 85 Google · 71 Qwen · 52 Alibaba · 47 Anthropic · 44 Microsoft · 35 Meta · 30 Mistral · 30 DeepSeek · 28 google · 19 meta-llama · 19 mistralai · 19 Meta AI · 15 CYFRAGOVPL · 14 Zhipu AI · 13 NVIDIA · 10 SpeakLeash · 10 internlm · 10 xAI · 10 ByteDance · 9 Baidu · 8 PLLuM · 8 ibm-granite · 8 microsoft · 8 Amazon · 7 Google DeepMind · 7 MiniMax · 7 Mistral AI · 7 Remek · 7 Shanghai AI Lab · 7 allenai · 7 utter-project · 7 CohereForAI · 6 Microsoft Research · 6 Salesforce · 6 01-ai · 5 Alibaba Cloud · 5 Cohere · 5 Moonshot AI · 5 NousResearch · 5 THUML · 5 deepseek-ai · 5 DeepMind · 4 Facebook AI · 4 IBM · 4 Meituan · 4 Stanford · 4 THUDM · 4 UC San Diego · 4 VikParuchuri · 4 gguf-iq · 4 nvidia · 4 openchat · 4 tiiuae · 4 Allen AI · 3 BAAI · 3 Du et al. · 3 ForgeCode · 3 Fudan University · 3 IDEA Research · 3 Liao et al. · 3 Moonshot.AI · 3 Nam Tuan Ly / NII · 3 OPI-PG · 3 OpenDataLab · 3 ViCoS Lab Ljubljana · 3 Xiaomi · 3 Zhao et al. · 3 gguf · 3 gguf11bv30 · 3 gguf7bv30 · 3 upstage · 3+ 247 smaller vendors (291 models)

§ 01 · Agentic AI models

164 models in Agentic AI · page 3 of 4.

#	Model	Vendor	Parameters	Architecture	SOTA	Benchmarks	Results
101	Capy / Claude Opus 4.6	Capy	—	—	—	1	1
102	Claude Code (Haiku 4.5)	Anthropic	—	—	—	1	1
103	Claude Code (Opus 4.7)	Anthropic	—	—	—	1	1
104	Claude Code (Sonnet 4.6)	Anthropic	—	—	—	1	1
105	CoAct-1	Salesforce	—	—	—	1	1
106	CodeBrain-1 / GPT-5.3-Codex	CodeBrain	—	—	—	1	1
107	Codex CLI (GPT-5.4-mini)	OpenAI	—	—	—	1	1
108	Crux / Claude Opus 4.6	Crux	—	—	—	1	1
109	DeepSeek-V3.2 (Thinking)	DeepSeek	—	—	—	1	1
110	DeepSeek-V3.2-Exp	DeepSeek	—	—	—	1	1
111	DeepSeek-V3.2-Speciale	DeepSeek	—	—	—	1	1
112	DeepSeek-V4-Pro	DeepSeek-AI	—	—	—	1	1
113	Devstral Medium	Mistral AI	—	—	—	1	1
114	Devstral Small 1.1	Mistral AI	—	—	—	1	1
115	Droid / Claude Opus 4.6	Droid	—	—	—	1	1
116	Droid / GPT-5.3-Codex	Droid	—	—	—	1	1
117	ForgeCode (DeepSeek-V4)	DeepSeek	—	—	—	1	1
118	ForgeCode / Claude Opus 4.6	ForgeCode	—	—	—	1	1
119	ForgeCode / GPT-5.4	ForgeCode	—	—	—	1	1
120	ForgeCode / Gemini 3.1 Pro	ForgeCode	—	—	—	1	1
121	GLM-4.6	Zhipu AI	—	—	—	1	1
122	GLM-4.7	Zhipu AI	—	—	—	1	1
123	GLM-4.7-Flash	Zhipu AI	—	—	—	1	1
124	GLM-5V-Turbo	—	—	—	—	1	1
125	GPT-5.1 Codex	OpenAI	—	—	—	1	1
126	GTA1 (7B)	Salesforce	—	—	—	1	1
127	Gemini 2.5 Flash-Lite	Google	—	—	—	1	1
128	Gemini 2.5 Pro Preview	Google	—	—	—	1	1
129	Gemini Diffusion	Google	—	—	—	1	1
130	Grok Code Fast 1	xAI	—	—	—	1	1
131	IndusAGI Coding Agent / GPT-5.3-Codex	IndusAGI	—	—	—	1	1
132	JEDI-7B with o3 planner	—	—	—	—	1	1
133	Junie CLI / Multiple	JetBrains	—	—	—	1	1
134	Kimi K2-Instruct-0905	Moonshot AI	—	—	—	1	1
135	LongCat-Flash-Chat	Meituan	—	—	—	1	1
136	LongCat-Flash-Lite	Meituan	—	—	—	1	1
137	LongCat-Flash-Thinking	Meituan	—	—	—	1	1
138	LongCat-Flash-Thinking-2601	Meituan	—	—	—	1	1
139	MAYA-V2 / Claude 4.6 Opus	MAYA	—	—	—	1	1
140	MemOS	Unknown	—	—	—	1	1
141	MiMo-V2-Flash	Xiaomi	—	—	—	1	1
142	MiMo-V2-Omni	Xiaomi	—	—	—	1	1
143	MiniMax M1 40K	MiniMax	—	—	—	1	1
144	MiniMax M1 80K	MiniMax	—	—	—	1	1
145	Mux / GPT-5.3-Codex	Mux	—	—	—	1	1
146	NVIDIA-Nemotron-3-Super-120B-A12B-BF16	NVIDIA	—	—	—	1	1
147	Nemotron 3 Nano (30B)	NVIDIA	—	—	—	1	1
148	OpenAI CUA (o1)	OpenAI	—	—	—	1	1
149	OpenAI Operator (CUA)	OpenAI	—	—	—	1	1
150	Qwen3.5-35B-A3B	Alibaba Cloud	—	—	—	1	1