Codesota · Models1,357 models indexed · 151 match filter
Editorial · Models
Every model, measured.
Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.
Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Reasoning models
151 models in Reasoning · page 2 of 4.
| # | Model | Vendor | Parameters | Architecture | SOTA | Benchmarks | Results |
|---|---|---|---|---|---|---|---|
| 051 | GPT-4 Turbo | OpenAI | Undisclosed | — | 6 | 13 | |
| 052 | Llama 3.1 405B | Meta | — | — | 12 | 13 | |
| 053 | Qwen3-VL-235B-A22B-Thinking | Qwen | — | — | 12 | 12 | |
| 054 | Qwen3-VL-8B-Instruct | Qwen | — | — | 12 | 12 | |
| 055 | o1 | OpenAI | — | — | 10 | 12 | |
| 056 | DeepSeek-V3.2-Speciale | DeepSeek | — | — | 5 | 11 | |
| 057 | GPT-5.4 | OpenAI | — | — | 3 | 11 | |
| 058 | Gemma-3-27b | 27B | transformer | 5 | 11 | ||
| 059 | Llama 3 70B | Meta | — | LLM | 11 | 11 | |
| 060 | MiniCPM-o 4.5-Instruct | — | — | — | 11 | 11 | |
| 061 | MiniMax-M2.5 | MiniMaxAI | — | — | 5 | 11 | |
| 062 | Claude 3.7 Sonnet | Anthropic | — | — | 10 | 10 | |
| 063 | GPT-5 | OpenAI | — | — | 9 | 10 | |
| 064 | SmoLM2 (1.7B) | — | — | — | 10 | 10 | |
| 065 | Step-3.5-Flash Base | — | — | — | 10 | 10 | |
| 066 | DeepSeek-v3-0324 | DeepSeek | — | — | 3 | 9 | |
| 067 | GLM-4.7 | Zhipu AI | — | — | 3 | 9 | |
| 068 | GPT-4.1 | OpenAI | — | — | 9 | 9 | |
| 069 | Gemini 2.5 Flash | — | — | — | 9 | 9 | |
| 070 | GPT-4o mini | OpenAI | — | Multimodal LLM | 7 | 8 | |
| 071 | Gemini 3 Flash | Undisclosed | — | 7 | 8 | ||
| 072 | Mistral-Medium-3 | Mistral | — | — | 2 | 8 | |
| 073 | o1-preview | OpenAI | Undisclosed | Reasoning LLM | 8 | 8 | |
| 074 | o3-mini | OpenAI | — | — | 8 | 8 | |
| 075 | Apertus-70B-Instruct | — | — | — | 7 | 7 | |
| 076 | Aria | — | — | — | 7 | 7 | |
| 077 | LongCat-Flash-Omni | — | — | — | 7 | 7 | |
| 078 | BitNet b1.58 2B4T | — | — | — | 6 | 6 | |
| 079 | HRM-Text-1B | — | — | — | 6 | 6 | |
| 080 | NVIDIA-Nemotron-3-Super-120B-A12B-BF16 | — | — | — | 6 | 6 | |
| 081 | Qwen3-Coder-Next | Qwen | — | — | 6 | 6 | |
| 082 | Step-3.5-Flash | — | — | — | 6 | 6 | |
| 083 | Chameleon 34B | — | — | — | 5 | 5 | |
| 084 | DeepSeek-V4-Flash Max | DeepSeek | — | — | 5 | 5 | |
| 085 | GPT-4.1 mini | OpenAI | — | transformer | 5 | 5 | |
| 086 | GPT-4.5 Preview | OpenAI | — | — | 5 | 5 | |
| 087 | Gemini 2.5 Flash | — | — | 4 | 5 | ||
| 088 | Gemini 2.5 Pro | — | — | 4 | 5 | ||
| 089 | Gemma 3 (27B, IT) | — | — | — | 5 | 5 | |
| 090 | Kimi K2.5 | Moonshot AI | — | — | 2 | 5 | |
| 091 | Kimi K2.5 | Moonshot AI | Undisclosed | — | 4 | 5 | |
| 092 | OLMo-2-7B-1124 (olmOCR-peS2o) | — | — | — | 5 | 5 | |
| 093 | Qwen2.5-Plus | — | — | — | 5 | 5 | |
| 094 | BLT-Entropy 8B | — | — | — | 4 | 4 | |
| 095 | Claude Sonnet 4.5 | Anthropic | — | — | 4 | 4 | |
| 096 | GPT-4.5 | OpenAI | Undisclosed | — | 3 | 4 | |
| 097 | GPT-5.1 | OpenAI | — | — | 4 | 4 | |
| 098 | GPT-5.2 | OpenAI | — | — | 4 | 4 | |
| 099 | Gemma 4 31B | — | — | 4 | 4 | ||
| 100 | Grok 2 | xAI | — | — | 4 | 4 |