Codesota · Models1,357 models indexed
Editorial · Models

Every model, measured.

Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.

Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Research areas

20 areas, each with a complete model index.

computer-vision
Computer Vision
896 models · 2,328 results
led by Unknown
nlp
Natural Language Processing
842 models · 7,436 results
led by speakleash
agentic
Agentic AI
164 models · 225 results
led by OpenAI
computer-code
Computer Code
152 models · 297 results
led by Anthropic
reasoning
Reasoning
151 models · 415 results
led by OpenAI
speech
Speech
104 models · 532 results
led by NVIDIA
multimodal
Multimodal
88 models · 267 results
led by Alibaba
medical
Medical
50 models · 83 results
led by Research
industrial-inspection
Industrial Inspection
22 models · 27 results
led by Research
reinforcement-learning
Reinforcement Learning
20 models · 21 results
led by UC San Diego
2
Natural Language Processing
19 models · 32 results
led by Google
7
Time-series
16 models · 75 results
led by THUML
6
Audio
14 models · 19 results
led by Microsoft
graphs
Graphs
12 models · 12 results
led by Unknown
audio
Audio
11 models · 13 results
led by Google
mobile-development
Mobile Development
10 models · 40 results
led by Anthropic
knowledge-base
Knowledge Base
9 models · 9 results
led by Meta AI
robots
Robots
5 models · 5 results
led by Stanford / Google DeepMind / TRI
time-series
Time Series
5 models · 7 results
led by Amazon
3
General
3 models · 8 results
led by Unknown

Click an area to see the full paginated list of models scored on its benchmarks. Stats (SOTA, result count) are computed for each page, not the whole table — the index stays responsive even with 1,357 models indexed.