Codesota · Models1,357 models indexed · 896 match filter
Editorial · Models
Every model, measured.
Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.
Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Computer Vision models
896 models in Computer Vision · page 7 of 18.
| # | Model | Vendor | Parameters | Architecture | SOTA | Benchmarks | Results |
|---|---|---|---|---|---|---|---|
| 301 | SigLIP 2 (g/16) | — | — | — | 5 | 5 | |
| 302 | T5-base (STSM) | 220M | Transformer | 1 | 5 | ||
| 303 | TextDragon | Unknown | Unknown | Unknown | 2 | 5 | |
| 304 | ZAYA1-VL-8B | — | — | — | 5 | 5 | |
| 305 | Baek et al. | Unknown | Unknown | Unknown | 4 | 4 | |
| 306 | BiLSTM (UN) | Unknown | Unknown | Unknown | 4 | 4 | |
| 307 | C2F + ALTERNATE | Unknown | Unknown | Unknown | 1 | 4 | |
| 308 | CCD-ViT-Base(ARD_2.8M) | Unknown | Unknown | Unknown | 4 | 4 | |
| 309 | CCD-ViT-Small(ARD_2.8M) | Unknown | Unknown | Unknown | 4 | 4 | |
| 310 | CCD-ViT-Tiny(ARD_2.8M) | Unknown | Unknown | Unknown | 4 | 4 | |
| 311 | CSTR | Unknown | Unknown | Unknown | 4 | 4 | |
| 312 | DBNet++ (ResNet-18) (512) | Unknown | Unknown | Unknown | 1 | 4 | |
| 313 | DBNet++ (ResNet-50) (1152) | Unknown | Unknown | Unknown | 1 | 4 | |
| 314 | DBNet++ (ResNet-50) (736) | Unknown | Unknown | Unknown | 1 | 4 | |
| 315 | DeepSeek-OCR | DeepSeek | — | Vision-Language OCR Model | 2 | 4 | |
| 316 | EAST | Unknown | Unknown | Unknown | 2 | 4 | |
| 317 | FAST-B-1280 | Unknown | Unknown | Unknown | 1 | 4 | |
| 318 | FAST-B-800 | Unknown | Unknown | Unknown | 1 | 4 | |
| 319 | FAST-B-896 | Unknown | Unknown | Unknown | 1 | 4 | |
| 320 | LRANet | AAAI 2024 | — | — | 2 | 4 | |
| 321 | Lightweight Text CNN | Unknown | Unknown | Unknown | 2 | 4 | |
| 322 | Lightweight TextCNN with Dual Optimizer | Unknown | Unknown | Unknown | 2 | 4 | |
| 323 | Llama 3-V (405B) | — | — | — | 4 | 4 | |
| 324 | Mask2Former (Swin-L) | Meta AI / UIUC | — | Transformer | 2 | 4 | |
| 325 | MuTabNet | Unknown | Unknown | Unknown | 2 | 4 | |
| 326 | OrigamiNet-12 | Unknown | Unknown | Unknown | 2 | 4 | |
| 327 | Orthogonalized Soft VSM | Unknown | Unknown | Unknown | 4 | 4 | |
| 328 | SAFL | Unknown | Unknown | Unknown | 4 | 4 | |
| 329 | SATRN | Unknown | Unknown | Unknown | 4 | 4 | |
| 330 | SEED | Unknown | Unknown | Unknown | 4 | 4 | |
| 331 | T5-11B | Unknown | Unknown | 2 | 4 | ||
| 332 | TextPerceptron | Unknown | Unknown | Unknown | 2 | 4 | |
| 333 | ViTSTR | Unknown | Unknown | Unknown | 4 | 4 | |
| 334 | ALIGN | — | — | — | 3 | 3 | |
| 335 | ASTER | Unknown | Unknown | Unknown | 3 | 3 | |
| 336 | AltCLIP | — | — | — | 3 | 3 | |
| 337 | BERTSUM+Transformer | Unknown | Unknown | Unknown | 1 | 3 | |
| 338 | BRIDO | BRIDO Authors | — | BART-based with democratic contrastive learning for factual consistency | 1 | 3 | |
| 339 | BertSumExt | Unknown | Unknown | Unknown | 1 | 3 | |
| 340 | BiT-L | — | — | — | 3 | 3 | |
| 341 | CCD-ViT-Base | Unknown | Unknown | Unknown | 3 | 3 | |
| 342 | CN-CLIP | — | — | — | 3 | 3 | |
| 343 | Ch,ng et al. | Unknown | Unknown | Unknown | 1 | 3 | |
| 344 | CharNet H-50 (multi-scale) | Unknown | Unknown | Unknown | 1 | 3 | |
| 345 | CharNet H-50 (single-scale) | Unknown | Unknown | Unknown | 1 | 3 | |
| 346 | CharNet H-57 (multi-scale) | Unknown | Unknown | Unknown | 1 | 3 | |
| 347 | CharNet H-57 (single-scale) | Unknown | Unknown | Unknown | 1 | 3 | |
| 348 | CharNet R-50 | Unknown | Unknown | Unknown | 1 | 3 | |
| 349 | Claude-3 Sonnet | Unknown | Unknown | Unknown | 1 | 3 | |
| 350 | CodeT5+ | Salesforce | Unknown | T5-based encoder-decoder | 3 | 3 |