Codesota · Models1,357 models indexed · 35 match filter
Editorial · Models
Every model, measured.
Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.
Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Microsoft models
35 models from Microsoft · page 1 of 1.
| # | Model | Vendor | Parameters | Architecture | SOTA | Benchmarks | Results |
|---|---|---|---|---|---|---|---|
| 001 | DeBERTa-v3-large | Microsoft | 304M | DeBERTa-v3-large | 3 | 5 | 6 |
| 002 | Phi-4 | Microsoft | 14B | transformer | 2 | 3 | 17 |
| 003 | RAD-DINO | Microsoft | — | Self-supervised ViT | 1 | 2 | 2 |
| 004 | VALL-E 2 | Microsoft | Unknown | Neural codec language model (EnCodec tokens) | 1 | 2 | 2 |
| 005 | NaturalSpeech 3 | Microsoft | ~500M | Factorized codec + non-AR diffusion | 1 | 1 | 1 |
| 006 | Swin Transformer V2 Large | Microsoft | 197M | Hierarchical Vision Transformer | 1 | 1 | 1 |
| 007 | WavLM Large (SV) | Microsoft | 316M | WavLM Large + ECAPA-TDNN head | 1 | 1 | 1 |
| 008 | Phi-4 Multimodal Instruct | Microsoft | 6B | Phi-4 multimodal | 8 | 9 | |
| 009 | WizardLM-2-8x22b | Microsoft | — | — | 1 | 7 | |
| 010 | E5-Mistral-7B-instruct | Microsoft | 7B | Mistral-7B (LLM-based embedding) | 3 | 4 | |
| 011 | ResNet-152 | Microsoft | 60M | CNN | 2 | 3 | |
| 012 | ResNet-50 | Microsoft | 25M | CNN | 3 | 3 | |
| 013 | UniXcoder | Microsoft | Unknown | Transformer encoder-decoder | 3 | 3 | |
| 014 | CodeBERT | Microsoft | Unknown | BERT | 2 | 2 | |
| 015 | Florence-2-Large | Microsoft | — | — | 1 | 2 | |
| 016 | KOSMOS-2.5 | Microsoft | — | — | 1 | 2 | |
| 017 | LightGBM | Microsoft | — | Gradient Boosted Trees (leaf-wise) | 2 | 2 | |
| 018 | Azure Document Intelligence | Microsoft | Unknown | Managed layout + OCR extraction service | 1 | 1 | |
| 019 | Azure OCR | Microsoft | — | Cloud OCR Service | 1 | 1 | |
| 020 | BEiT-3 (ViT-L) | Microsoft | Unknown | Multiway Transformer (ViT-L/14) | 1 | 1 | |
| 021 | BioViL | Microsoft | — | Vision-Language Transformer | 1 | 1 | |
| 022 | CodeBERT | Microsoft | — | BERT pretrained on code + NL | 1 | 1 | |
| 023 | DeBERTa (ensemble) | Microsoft | — | — | 1 | 1 | |
| 024 | DiT-Base | Microsoft | — | Vision Transformer (self-supervised) | 1 | 1 | |
| 025 | DiT-Large | Microsoft | Unknown | Document Image Transformer Large | 1 | 1 | |
| 026 | GraphCodeBERT | Microsoft | 125M | transformer | 1 | 1 | |
| 027 | LayoutLMv3 | Microsoft | Unknown | Multimodal Transformer (text + layout + image) | 1 | 1 | |
| 028 | Pengi | Microsoft | ~300M | CLAP audio encoder + GPT-2 decoder | 1 | 1 | |
| 029 | Phi-4 14B | Microsoft | 14B | — | 1 | 1 | |
| 030 | Swin Transformer Large | Microsoft | 197M | Hierarchical Vision Transformer | 1 | 1 | |
| 031 | Swin-L + UperNet | Microsoft | Unknown | Swin Transformer Large backbone + UperNet head | 1 | 1 | |
| 032 | UFO (GPT-4V) | Microsoft | Unknown | UI-Focused agent with dual-agent architecture on GPT-4V | 1 | 1 | |
| 033 | VALL-E | Microsoft | ~400M | Neural codec LM (EnCodec tokens) | 1 | 1 | |
| 034 | mDeBERTa-v3-base | Microsoft | 86M | DeBERTa-v3 (multilingual) | 1 | 1 | |
| 035 | swin_large.ms_in22k_ft_in1k | Microsoft | — | Swin-L, IN22K pre-train, IN1K fine-tune | 1 | 1 |