Codesota · Models1,357 models indexed

Editorial · Models

Every model, measured.

Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.

Vendor:Areas overview speakleash · 253 OpenAI · 85 Google · 71 Qwen · 52 Alibaba · 47 Anthropic · 44 Microsoft · 35 Meta · 30 Mistral · 30 DeepSeek · 28 google · 19 meta-llama · 19 mistralai · 19 Meta AI · 15 CYFRAGOVPL · 14 Zhipu AI · 13 NVIDIA · 10 SpeakLeash · 10 internlm · 10 xAI · 10 ByteDance · 9 Baidu · 8 PLLuM · 8 ibm-granite · 8 microsoft · 8 Amazon · 7 Google DeepMind · 7 MiniMax · 7 Mistral AI · 7 Remek · 7 Shanghai AI Lab · 7 allenai · 7 utter-project · 7 CohereForAI · 6 Microsoft Research · 6 Salesforce · 6 01-ai · 5 Alibaba Cloud · 5 Cohere · 5 Moonshot AI · 5 NousResearch · 5 THUML · 5 deepseek-ai · 5 DeepMind · 4 Facebook AI · 4 IBM · 4 Meituan · 4 Stanford · 4 THUDM · 4 UC San Diego · 4 VikParuchuri · 4 gguf-iq · 4 nvidia · 4 openchat · 4 tiiuae · 4 Allen AI · 3 BAAI · 3 Du et al. · 3 ForgeCode · 3 Fudan University · 3 IDEA Research · 3 Liao et al. · 3 Moonshot.AI · 3 Nam Tuan Ly / NII · 3 OPI-PG · 3 OpenDataLab · 3 ViCoS Lab Ljubljana · 3 Xiaomi · 3 Zhao et al. · 3 gguf · 3 gguf11bv30 · 3 gguf7bv30 · 3 upstage · 3+ 247 smaller vendors (291 models)

§ 01 · Research areas

20 areas, each with a complete model index.

computer-vision

Computer Vision

896 models · 2,328 results

led by Unknown

nlp

Natural Language Processing

842 models · 7,436 results

led by speakleash

agentic

Agentic AI

164 models · 225 results

led by OpenAI

computer-code

Computer Code

152 models · 297 results

led by Anthropic

reasoning

Reasoning

151 models · 415 results

led by OpenAI

speech

Speech

104 models · 532 results

led by NVIDIA

multimodal

Multimodal

88 models · 267 results

led by Alibaba

medical

Medical

50 models · 83 results

led by Research

industrial-inspection

Industrial Inspection

22 models · 27 results

led by Research

reinforcement-learning

Reinforcement Learning

20 models · 21 results

led by UC San Diego

Natural Language Processing

19 models · 32 results

led by Google

Time-series

16 models · 75 results

led by THUML

Audio

14 models · 19 results

led by Microsoft

graphs

Graphs

12 models · 12 results

led by Unknown

audio

Audio

11 models · 13 results

led by Google

mobile-development

Mobile Development

10 models · 40 results

led by Stanford / Google DeepMind / TRI

Click an area to see the full paginated list of models scored on its benchmarks. Stats (SOTA, result count) are computed for each page, not the whole table — the index stays responsive even with 1,357 models indexed.