Codesota · Models1,357 models indexed · 842 match filter
Editorial · Models

Every model, measured.

Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.

Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Natural Language Processing models

842 models in Natural Language Processing · page 1 of 17.

#ModelVendorParametersArchitectureSOTABenchmarksResults
001GPT-4oOpenAIUndisclosedMultimodal LLM154557
002Gemini-3.1-Pro-PreviewGoogle717
003Gemma 3 (27B, IT)Google619
004mistralai/Mistral-Large-Instruct-2411mistralai123B4317
005Claude Sonnet 4AnthropicMultimodal LLM31521
006Gemini 1.5 ProGoogleMultimodal LLM31721
007GPT-4OpenAITransformer (LLM)3613
008BRIOYale NLPUnknownBART-large with contrastive learning objective326
009DeBERTa-v3-largeMicrosoft304MDeBERTa-v3-large356
010DeepSeek-V4-Pro MaxDeepSeek355
011Claude 3.5 SonnetAnthropicUndisclosedMultimodal LLM22732
012Claude Opus 4AnthropicUndisclosed21623
013Qwen3.5-397B-A17BAlibaba21420
014Phi-4Microsoft14Btransformer2317
015Mistral-Small-3.1-24B-Instruct-2503Mistral219
016gemma-3-12b-itGoogle219
017Gemini-3.0-Pro-PreviewGoogle217
018gemini-2.0-flash-001Google215
019NV-Embed-v2NVIDIA7BMistral-7B (LLM-based embedding)223
020ALBERT ensemble222
021Qwen3-235B-A22BAlibaba235B (22B active)moe11321
022GLM-5Zhipu AI130B1919
023Qwen/Qwen2.5-14B-InstructQwen14.8B1317
024Qwen/Qwen2.5-72B-InstructQwen72.7B1317
025mistralai/Mistral-Large-Instruct-2407mistralai123B1317
026meta-llama/Llama-4-Scout-17B-16E-Instruct (API)meta-llama109B1216
027Meta-Llama-3.1-405B-Instruct-FP8meta-llama1212
028internlm2-1_8binternlm1112
029Bielik-11B-v3.0-Instruct.Q4_K_M.ggufgguf11bv301111
030Qwen2.5-32BQwen1111
031b11t2347yth03847tyhy03847yt1111
032Gemini 2.5 Pro199
033Gemma-2-27b-itGoogle119
034LLaMA-65B199
035Mistral-Large-Instruct-2407Mistral119
036Mistral-Small-24B-Instruct-2501Mistral119
037Mistral-Small-Instruct-2409Mistral119
038Qwen2.5-32B-InstructAlibaba119
039aya-expanse-32bUnknown119
040Kimi K2.6166
041Llama 2 70B (5-shot)166
042MiniMax-Text-01MiniMax166
043Qwen/Qwen3.5-27B thinking (API)Qwen27B115
044Qwen/Qwen3.5-35B-A3B thinking (API)Qwen35B115
045deepseek-ai/DeepSeek-V3.2 (API)deepseek-ai685B115
046GTE-Qwen2-7B-instructAlibaba7BQwen2-7B (LLM-based embedding)134
047ByT5 XXL122
048GLiNER-multitaskKnowledgatorUnknownDeBERTa-based generalist IE model111
049ModernBERT (large)111
050QZhou-Embedding111