Codesota · Models1,357 models indexed · 896 match filter
Editorial · Models
Every model, measured.
Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.
Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Computer Vision models
896 models in Computer Vision · page 5 of 18.
| # | Model | Vendor | Parameters | Architecture | SOTA | Benchmarks | Results |
|---|---|---|---|---|---|---|---|
| 201 | ViTPose-G | — | — | — | 1 | 1 | 1 |
| 202 | VideoMAE ViT-B | — | — | — | 1 | 1 | 1 |
| 203 | cascadetabnet | Unknown | Unknown | Unknown | 1 | 1 | 1 |
| 204 | dots.mocr | RedNote | 3B | Multimodal OCR (3B params) | 1 | 1 | 1 |
| 205 | pMF-H + FD-loss | N/A | — | — | 1 | 1 | 1 |
| 206 | pil_maskrcnn | ICT, Chinese Academy of Sciences | Unknown | Mask R-CNN based scene text detector | 1 | 1 | 1 |
| 207 | Kimi-K2.5 | Moonshot.AI | — | — | 10 | 16 | |
| 208 | Qwen2.5-VL-72B | — | — | — | 14 | 14 | |
| 209 | PAN | Unknown | Unknown | Unknown | 4 | 12 | |
| 210 | Qwen3-VL-235B-A22B-Thinking | Qwen | — | — | 12 | 12 | |
| 211 | Qwen3-VL-8B-Instruct | Qwen | — | — | 12 | 12 | |
| 212 | SPCNET | Unknown | Unknown | Unknown | 4 | 12 | |
| 213 | TESTR | Unknown | Unknown | Unknown | 4 | 12 | |
| 214 | TextSnake | Unknown | Unknown | Unknown | 4 | 12 | |
| 215 | MiniCPM-o 4.5-Instruct | — | — | — | 11 | 11 | |
| 216 | Qwen2-VL 7B | Alibaba | 7B | — | 11 | 11 | |
| 217 | Qwen2-VL-2B | — | — | — | 10 | 10 | |
| 218 | ABCNet v2 | Unknown | Unknown | Unknown | 4 | 9 | |
| 219 | Corner Localization (multi-scale) | Unknown | Unknown | Unknown | 3 | 9 | |
| 220 | DeepSeek-Coder-V2-Instruct | DeepSeek | Unknown | MoE Transformer | 7 | 9 | |
| 221 | DeepSolo (ResNet-50) | Unknown | Unknown | Unknown | 4 | 9 | |
| 222 | FOTS | Unknown | Unknown | Unknown | 2 | 9 | |
| 223 | MGP-STR | Unknown | Unknown | Unknown | 9 | 9 | |
| 224 | Mask TextSpotter | Unknown | Unknown | Unknown | 3 | 9 | |
| 225 | PSENet-1s | Unknown | Unknown | Unknown | 3 | 9 | |
| 226 | Qwen2.5-Coder 32B | Alibaba | 32B | Dense Transformer | 8 | 9 | |
| 227 | RoBERTa | Unknown | Unknown | Unknown | 9 | 9 | |
| 228 | SSTD | Unknown | Unknown | Unknown | 3 | 9 | |
| 229 | SegLink | Unknown | Unknown | Unknown | 3 | 9 | |
| 230 | SwinTextSpotter | Unknown | Unknown | Unknown | 4 | 9 | |
| 231 | DBNet++ (ResNet-18) (736) | Unknown | Unknown | Unknown | 2 | 8 | |
| 232 | DeiT-B | Meta | 86M | Vision Transformer | 3 | 8 | |
| 233 | FAST-B-512 | Unknown | Unknown | Unknown | 2 | 8 | |
| 234 | FAST-B-640 | Unknown | Unknown | Unknown | 2 | 8 | |
| 235 | FAST-B-736 | Unknown | Unknown | Unknown | 2 | 8 | |
| 236 | FAST-S-512 | Unknown | Unknown | Unknown | 2 | 8 | |
| 237 | FAST-S-736 | Unknown | Unknown | Unknown | 2 | 8 | |
| 238 | GPT-4o mini | OpenAI | — | Multimodal LLM | 7 | 8 | |
| 239 | Gemini 3 Flash | Undisclosed | — | 7 | 8 | ||
| 240 | InternVL2-76B | Shanghai AI Lab | 76B | Vision-Language Model | 5 | 8 | |
| 241 | Aria | — | — | — | 7 | 7 | |
| 242 | BEiT-B | Unknown | Unknown | Unknown | 2 | 7 | |
| 243 | DBNet++ (ResNet-18) (800) | Unknown | Unknown | Unknown | 2 | 7 | |
| 244 | DBNet++ (ResNet-50) (800) | Unknown | Unknown | Unknown | 2 | 7 | |
| 245 | DINOv2 (ViT-g/14) | — | — | — | 7 | 7 | |
| 246 | DeepSolo (ResNet-50, TextOCR) | Unknown | Unknown | Unknown | 3 | 7 | |
| 247 | DiT-L | Unknown | Unknown | Unknown | 2 | 7 | |
| 248 | MANGO | Unknown | Unknown | Unknown | 3 | 7 | |
| 249 | MATRN | Research | Unknown | Unknown | 7 | 7 | |
| 250 | Mask R-CNN | Meta AI / FAIR | Unknown | Unknown | 2 | 7 |