Codesota · Models1,357 models indexed · 842 match filter
Editorial · Models

Every model, measured.

Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.

Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Natural Language Processing models

842 models in Natural Language Processing · page 2 of 17.

#ModelVendorParametersArchitectureSOTABenchmarksResults
051RankLLaMA-7BCastorini (Waterloo)7BLLaMA-2-7B (pointwise reranker)111
052DeepSeek R1DeepSeek671B MoE1319
053Voicelab/trurl-2-13b-academicVoicelab13B318
054berkeley-nest/Starling-LM-7B-alphaberkeley-nest7.24B318
055meta-llama/Llama-3.3-70B-Instructmeta-llama70.6B418
056openchat/openchat-3.5-0106openchat7.24B318
057speakleash/Bielik-11B-v2.0-Instructspeakleash11.2B318
058speakleash/Bielik-7B-Instruct-v0.1speakleash7.24B318
059upstage/SOLAR-10.7B-Instruct-v1.0upstage10.7B318
06001-ai/Yi-1.5-34B-Chat01-ai34.4B317
061GPT-3.5-turboOpenAI317
062Mixtral-8x22bMistral317
063Qwen/Qwen1.5-72B-ChatQwen72.3B317
064Qwen/Qwen2-72B-InstructQwen72.7B317
065Qwen/Qwen2.5-1.5B-InstructQwen1.54B317
066Qwen/Qwen2.5-32B-InstructQwen32.8B317
067Qwen/Qwen2.5-3B-InstructQwen3.09B317
068Qwen/Qwen2.5-7B-InstructQwen7.62B317
069THUDM/glm-4-9b-chatTHUDM9.4B317
070alpindale/WizardLM-2-8x22B (API)alpindale141B317
071internlm/internlm2-chat-20binternlm19.9B317
072meta-llama/Llama-3.2-1B-Instructmeta-llama1.24B317
073meta-llama/Llama-3.2-3B-Instructmeta-llama3.21B317
074meta-llama/Meta-Llama-3-70B-Instructmeta-llama70.6B317
075meta-llama/Meta-Llama-3-8B-Instructmeta-llama8.03B317
076meta-llama/Meta-Llama-3.1-70B-Instructmeta-llama70.6B317
077microsoft/Phi-4-mini-instructmicrosoft3.84B317
078mistralai/Mistral-7B-Instruct-v0.3mistralai7.25B317
079mistralai/Mistral-Nemo-Instruct-2407mistralai12.2B317
080mistralai/Mistral-Small-24B-Instruct-2501mistralai23.6B317
081mistralai/Mixtral-8x22B-Instruct-v0.1 (API)mistralai141B317
082mistralai/Mixtral-8x7B-Instruct-v0.1mistralai46.7B317
083nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16nvidia32B317
084openchat/openchat-3.5-0106-gemmaopenchat8.54B317
085speakleash/Bielik-11B-v2.1-Instructspeakleash11.2B317
086speakleash/Bielik-11B-v2.2-Instructspeakleash11.2B317
087speakleash/Bielik-11B-v2.3-Instructspeakleash11.2B317
088speakleash/Bielik-11B-v2.6-Instructspeakleash11.2B317
089speakleash/Bielik-11B-v3.0-Instructspeakleash11.2B317
090utter-project/EuroLLM-9B-Instructutter-project9B317
091Kimi-K2.5Moonshot.AI1016
092Llama-PLLuM-70B-chatPLLuM216
093Llama-PLLuM-8B-chatPLLuM216
094Mixtral-8x7bMistral216
095PLLuM-12B-chatPLLuM216
096PLLuM-12B-nc-chatPLLuM216
097PLLuM-8x7B-chatPLLuM216
098PLLuM-8x7B-nc-chatPLLuM216
099Qwen3.5-122B-A10BAlibaba1016
100Qwen3.5-27BAlibaba1016