Codesota · Models1,357 models indexed · 842 match filter
Editorial · Models

Every model, measured.

Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.

Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Natural Language Processing models

842 models in Natural Language Processing · page 14 of 17.

#ModelVendorParametersArchitectureSOTABenchmarksResults
651GPT-5.2-2025-12-11 (medium reasoning)OpenAI17
652GPT-5.2-2025-12-11 (no reasoning)OpenAI17
653GPT-5.2-2025-12-11 (xhigh reasoning)OpenAI17
654GPT-5.4-2026-03-05 (high reasoning)OpenAI17
655GPT-5.4-2026-03-05 (low reasoning)OpenAI17
656GPT-5.4-2026-03-05 (no reasoning)OpenAI17
657GPT-5.4-mini-2026-03-17 (high reasoning)OpenAI17
658GPT-5.4-mini-2026-03-17 (no reasoning)OpenAI17
659GPT-5.4-nano-2026-03-17 (high reasoning)OpenAI17
660GPT-5.4-nano-2026-03-17 (no reasoning)OpenAI17
661GPT-OSS-120bOpenAI17
662GPT-OSS-20bOpenAI17
663Gemini-2.0-Flash-ExperimentalGoogle17
664Gemini-2.0-Flash-Thinking-Exp-01-21Google17
665Gemini-2.5-Flash-Preview-04-17Google17
666Gemini-2.5-Pro-Exp-03-25Google17
667Gemini-2.5-Pro-Preview-06-05Google17
668Gemini-3-Flash-PreviewGoogle17
669Gemini-Exp-1206Google17
670Gemini-Flash-1.5Google17
671Gemini-Pro-1.5Google17
672Gemma-2-27bGoogle17
673Gemma-2-9bGoogle17
674Grok-2-1212xAI17
675Grok-3-BetaxAI17
676Grok-3-Mini-BetaxAI17
677Grok-4-FastxAI17
678Grok-4.1-FastxAI17
679Grok-4.20xAI17
680Kimi-K2-0905Moonshot.AI17
681Llama-3.0-70BMeta17
682Llama-3.1-405bMeta17
683Llama-3.1-70BMeta17
684Llama-3.1-8BMeta17
685Llama-3.1-Tulu-3-405BMeta17
686Llama-3.3-70BMeta17
687Llama-PLLuM-70B-chat-250801PLLuM17
688Magistral-Small-2506Mistral17
689MiniMax-M2.7MiniMaxAI17
690Ministral-14b-2512Mistral17
691Ministral-3b-2512Mistral17
692Ministral-8bMistral17
693Ministral-8b-2512Mistral17
694Mistral-7b-v0.3Mistral17
695Mistral-Large-2407Mistral17
696Mistral-Large-2411Mistral17
697Mistral-Large-2512Mistral17
698Mistral-NemoMistral17
699Mistral-Small-24B-2501Mistral17
700Mistral-Small-3.1-24B-2503Mistral17