Codesota · Models1,357 models indexed · 164 match filter
Editorial · Models

Every model, measured.

Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.

Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Agentic AI models

164 models in Agentic AI · page 1 of 4.

#ModelVendorParametersArchitectureSOTABenchmarksResults
001GPT-4oOpenAIUndisclosedMultimodal LLM154557
002o3OpenAI51819
003Gemini-3.1-ProGoogle4311
004Claude Sonnet 4AnthropicMultimodal LLM31521
005Gemini 2.5 ProGoogleMultimodal LLM31516
006Claude 3.5 SonnetAnthropicUndisclosedMultimodal LLM22732
007Claude Opus 4AnthropicUndisclosed21623
008Qwen3.5-397B-A17BAlibaba21420
009o4-miniOpenAI21616
010Claude Opus 4.6Anthropic2615
011Claude Sonnet 4.6Anthropic2513
012SenseNova-U1-A3B-MoTSenseTime266
013Claude Opus 4.5Anthropic244
014Qwen3-235B-A22BAlibaba235B (22B active)moe11321
015GLM-5Zhipu AI130B1919
016Gemini 3 ProGoogleUndisclosed11113
017Claude Opus 4.5AnthropicUndisclosed1812
018Qwen3-VL-235B-A22B-InstructQwen11212
019Intern-S1-ProShanghai AI Lab156
020Gemini 3.1 ProGoogle144
021Qwen3.6 PlusAlibaba133
022Agent S3 w/ bBoN111
023Agent-E (GPT-4o)Emergence AIUnknownHierarchical agent with DOM distillation on GPT-4o111
024Claude Mythos PreviewAnthropic111
025Codex / GPT-5.5OpenAI111
026Codex CLI (GPT-5.5)OpenAI111
027GPT-5.5OpenAI111
028Holo3-35B-A3B111
029LettaUnknown111
030Qwen3.5-122B-A10BAlibaba1016
031Qwen3.5-27BAlibaba1016
032Qwen3.5-35B-A3BAlibaba1016
033DeepSeek-V3DeepSeekLLM915
034DeepSeek-V3.2DeepSeek814
035Qwen2.5-VL-72B1414
036Qwen3-VL-235B-A22B-ThinkingQwen1212
037Qwen3-VL-8B-InstructQwen1212
038o1OpenAI1012
039GPT-5.4OpenAI311
040Claude 3.7 SonnetAnthropic1010
041DeepSeek-R1-0528DeepSeek410
042GPT-5OpenAI910
043GPT-4.1OpenAI99
044Gemini 3 FlashGoogleUndisclosed78
045Kimi K2-Thinking-0905Moonshot AI28
046MiMo-V2-ProXiaomi28
047Qwen3 MaxAlibaba Cloud28
048o1-previewOpenAIUndisclosedReasoning LLM88
049o3-miniOpenAI88
050MiniMax M2.7Anthropic/OpenAI16