Codesota · Models1,357 models indexed · 88 match filter
Editorial · Models

Every model, measured.

Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.

Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Multimodal models

88 models in Multimodal · page 2 of 2.

#ModelVendorParametersArchitectureSOTABenchmarksResults
051MiniCPM-V 4.6-Thinking (16x)55
052Qwen2.5-VL 72BAlibaba72BVision-Language Model55
053ZAYA1-VL-8B55
054GPT-4VUnknownUnknownTransformer44
055GPT-5.1OpenAI44
056GPT-5.2OpenAI44
057Gemma 4 31BGoogle44
058Llama 3-V (405B)44
059ALIGN33
060AltCLIP33
061BAGEL (7B MoT)33
062LLaVA-1.5UW-Madison / MicrosoftUnknownCLIP ViT-L + MLP projector + Vicuna-13B33
063MiniMax-VL-0133
064Qwen3-Omni-30B-A3B-Base-20250733
065Qwen3-Omni-Flash-Thinking33
066qwen2.5-vl-7bUnknownUnknownUnknown33
067Flamingo (32-shot)22
068GLIPv2-H (fine-tuned)22
069GPT-5.1 InstantOpenAI22
070GPT-5.1 ThinkingOpenAI22
071Llama 3.2 Vision 90BMetaUnknownLlama 3.1 + cross-attention vision adapter22
072LongVU22
073Qwen3.5-122B-A10BAlibaba Cloud22
074Qwen3.5-27BAlibaba Cloud22
075Qwen3.5-397B-A17BAlibaba22
076AsymFLUX.2 klein11
077BAGEL (7B MoT) with LLM rewriter11
078BLIP CapFilt-L11
079BLIP-2 ViT-g FlanT5 XXL11
080BLIP-2 ViT-g OPT 6.7B11
081Chameleon-MultiTask11
082CoCaGoogleUnknownImage encoder + cross-attention + causal decoder11
083Emu3.5 (34B, AR)11
084Grok-1.5V11
085Lumina-DiMOO11
086Qwen2.5-VL-3B11
087SiLVR11
088Spectral Progressive Diffusion (PixelGen, TF)11