Codesota · Models1,368 models indexed · 174 match filter
Editorial · Models

Every model, measured.

Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.

Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 60Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4Xiaomi · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3StepFun · 3ViCoS Lab Ljubljana · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 246 smaller vendors (290 models)
§ 01 · Reasoning models

174 models in Reasoning · page 1 of 4.

#ModelVendorParametersArchitectureSOTABenchmarksResults
001GPT-4oOpenAIUndisclosedMultimodal LLM154557
002Qwen2.5-PlusQwen91010
003Gemma 3 (27B, IT)Google6210
004o3OpenAI51819
005Claude Sonnet 4AnthropicMultimodal LLM31521
006Gemini 1.5 ProGoogleMultimodal LLM31721
007Gemini 2.5 ProGoogleMultimodal LLM31516
008GPT-4OpenAITransformer (LLM)3613
009Qwen3.5-Omni-Plus31010
010DeepSeek-V4-Pro MaxDeepSeek367
011Claude 3.5 SonnetAnthropicUndisclosedMultimodal LLM22732
012Claude Opus 4AnthropicUndisclosed21623
013Qwen3.5-397B-A17BAlibaba21420
014o4-miniOpenAI21616
015Claude Opus 4.6Anthropic2615
016Claude Sonnet 4.6Anthropic2513
017Llama 3 (405B, Instruct)Meta21011
018SenseNova-U1-A3B-MoTSenseTime278
019Claude Opus 4.5Anthropic244
020Qwen3.6 PlusAlibaba244
021Qwen3-235B-A22BAlibaba235B (22B active)moe11422
022GLM-5Zhipu AI130B1919
023Qwen3-VL-235B-A22B-InstructQwen11314
024Gemini 3 ProGoogleUndisclosed11113
025Claude Opus 4.5AnthropicUndisclosed1812
026Qwen3.6-35B-A3B11111
027Qwen3.6-27B11010
028Gemini 2.5 Pro199
029LLaMA-65B199
030Intern-S1-ProShanghai AI Lab168
031MiniMax-Text-01MiniMax178
032Kimi K2.6166
033Llama 2 70B (5-shot)166
034Gemini 3.1 ProGoogle144
035Step-3.5-Flash PaCoRe144
036Claude Opus 4.7Anthropic122
037Gemini 3 Pro PreviewGoogle122
038o3 (high)OpenAI122
039ERNIE 5.0Baidu111
040o4-mini (high)OpenAI111
041DeepSeek R1DeepSeek671B MoE1319
042nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16nvidia32B418
043Mixtral-8x22bMistral317
044Qwen3.5-27BAlibaba1117
045Qwen3.5-35B-A3BAlibaba1117
046Kimi-K2.5Moonshot.AI1016
047Qwen3.5-122B-A10BAlibaba1016
048DeepSeek-V3DeepSeekLLM915
049DeepSeek-V3.2DeepSeek915
050GLM-4.5Zhipu AI915