Codesota · Models1,357 models indexed · 164 match filter
Editorial · Models

Every model, measured.

Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.

Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Agentic AI models

164 models in Agentic AI · page 3 of 4.

#ModelVendorParametersArchitectureSOTABenchmarksResults
101Capy / Claude Opus 4.6Capy11
102Claude Code (Haiku 4.5)Anthropic11
103Claude Code (Opus 4.7)Anthropic11
104Claude Code (Sonnet 4.6)Anthropic11
105CoAct-1Salesforce11
106CodeBrain-1 / GPT-5.3-CodexCodeBrain11
107Codex CLI (GPT-5.4-mini)OpenAI11
108Crux / Claude Opus 4.6Crux11
109DeepSeek-V3.2 (Thinking)DeepSeek11
110DeepSeek-V3.2-ExpDeepSeek11
111DeepSeek-V3.2-SpecialeDeepSeek11
112DeepSeek-V4-ProDeepSeek-AI11
113Devstral MediumMistral AI11
114Devstral Small 1.1Mistral AI11
115Droid / Claude Opus 4.6Droid11
116Droid / GPT-5.3-CodexDroid11
117ForgeCode (DeepSeek-V4)DeepSeek11
118ForgeCode / Claude Opus 4.6ForgeCode11
119ForgeCode / GPT-5.4ForgeCode11
120ForgeCode / Gemini 3.1 ProForgeCode11
121GLM-4.6Zhipu AI11
122GLM-4.7Zhipu AI11
123GLM-4.7-FlashZhipu AI11
124GLM-5V-Turbo11
125GPT-5.1 CodexOpenAI11
126GTA1 (7B)Salesforce11
127Gemini 2.5 Flash-LiteGoogle11
128Gemini 2.5 Pro PreviewGoogle11
129Gemini DiffusionGoogle11
130Grok Code Fast 1xAI11
131IndusAGI Coding Agent / GPT-5.3-CodexIndusAGI11
132JEDI-7B with o3 planner11
133Junie CLI / MultipleJetBrains11
134Kimi K2-Instruct-0905Moonshot AI11
135LongCat-Flash-ChatMeituan11
136LongCat-Flash-LiteMeituan11
137LongCat-Flash-ThinkingMeituan11
138LongCat-Flash-Thinking-2601Meituan11
139MAYA-V2 / Claude 4.6 OpusMAYA11
140MemOSUnknown11
141MiMo-V2-FlashXiaomi11
142MiMo-V2-OmniXiaomi11
143MiniMax M1 40KMiniMax11
144MiniMax M1 80KMiniMax11
145Mux / GPT-5.3-CodexMux11
146NVIDIA-Nemotron-3-Super-120B-A12B-BF16NVIDIA11
147Nemotron 3 Nano (30B)NVIDIA11
148OpenAI CUA (o1)OpenAI11
149OpenAI Operator (CUA)OpenAI11
150Qwen3.5-35B-A3BAlibaba Cloud11