Codesota · Models1,357 models indexed · 35 match filter
Editorial · Models

Every model, measured.

Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.

Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Microsoft models

35 models from Microsoft · page 1 of 1.

#ModelVendorParametersArchitectureSOTABenchmarksResults
001DeBERTa-v3-largeMicrosoft304MDeBERTa-v3-large356
002Phi-4Microsoft14Btransformer2317
003RAD-DINOMicrosoftSelf-supervised ViT122
004VALL-E 2MicrosoftUnknownNeural codec language model (EnCodec tokens)122
005NaturalSpeech 3Microsoft~500MFactorized codec + non-AR diffusion111
006Swin Transformer V2 LargeMicrosoft197MHierarchical Vision Transformer111
007WavLM Large (SV)Microsoft316MWavLM Large + ECAPA-TDNN head111
008Phi-4 Multimodal InstructMicrosoft6BPhi-4 multimodal89
009WizardLM-2-8x22bMicrosoft17
010E5-Mistral-7B-instructMicrosoft7BMistral-7B (LLM-based embedding)34
011ResNet-152Microsoft60MCNN23
012ResNet-50Microsoft25MCNN33
013UniXcoderMicrosoftUnknownTransformer encoder-decoder33
014CodeBERTMicrosoftUnknownBERT22
015Florence-2-LargeMicrosoft12
016KOSMOS-2.5Microsoft12
017LightGBMMicrosoftGradient Boosted Trees (leaf-wise)22
018Azure Document IntelligenceMicrosoftUnknownManaged layout + OCR extraction service11
019Azure OCRMicrosoftCloud OCR Service11
020BEiT-3 (ViT-L)MicrosoftUnknownMultiway Transformer (ViT-L/14)11
021BioViLMicrosoftVision-Language Transformer11
022CodeBERTMicrosoftBERT pretrained on code + NL11
023DeBERTa (ensemble)Microsoft11
024DiT-BaseMicrosoftVision Transformer (self-supervised)11
025DiT-LargeMicrosoftUnknownDocument Image Transformer Large11
026GraphCodeBERTMicrosoft125Mtransformer11
027LayoutLMv3MicrosoftUnknownMultimodal Transformer (text + layout + image)11
028PengiMicrosoft~300MCLAP audio encoder + GPT-2 decoder11
029Phi-4 14BMicrosoft14B11
030Swin Transformer LargeMicrosoft197MHierarchical Vision Transformer11
031Swin-L + UperNetMicrosoftUnknownSwin Transformer Large backbone + UperNet head11
032UFO (GPT-4V)MicrosoftUnknownUI-Focused agent with dual-agent architecture on GPT-4V11
033VALL-EMicrosoft~400MNeural codec LM (EnCodec tokens)11
034mDeBERTa-v3-baseMicrosoft86MDeBERTa-v3 (multilingual)11
035swin_large.ms_in22k_ft_in1kMicrosoftSwin-L, IN22K pre-train, IN1K fine-tune11