Codesota · Models1,357 models indexed · 896 match filter
Editorial · Models
Every model, measured.
Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.
Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Computer Vision models
896 models in Computer Vision · page 2 of 18.
| # | Model | Vendor | Parameters | Architecture | SOTA | Benchmarks | Results |
|---|---|---|---|---|---|---|---|
| 051 | MultiFiT, pseudo | Unknown | Unknown | Unknown | 2 | 7 | 7 |
| 052 | PMTD* | Unknown | Unknown | Unknown | 2 | 2 | 6 |
| 053 | FactT5B | Unknown | Unknown | Unknown | 2 | 1 | 5 |
| 054 | GPT-2-Large (prefix-tuning) | OpenAI | 774M | Transformer | 2 | 1 | 5 |
| 055 | clearOCR | TeamQuest | — | Traditional OCR | 2 | 1 | 5 |
| 056 | HTR-VT(line-level) | Unknown | Unknown | Unknown | 2 | 2 | 4 |
| 057 | AIMv2 ViT-3B/14 448px | — | — | — | 2 | 3 | 3 |
| 058 | DnC-SC | Unknown | Unknown | Unknown | 2 | 1 | 3 |
| 059 | GLM-OCR | Zhipu AI | — | — | 2 | 2 | 3 |
| 060 | HDLTex | Unknown | Unknown | Unknown | 2 | 3 | 3 |
| 061 | Hierarchical Table Recognizer | Takaya Kawakatsu | — | — | 2 | 1 | 3 |
| 062 | Holistic | Unknown | Unknown | Unknown | 2 | 1 | 3 |
| 063 | KD-LSTMreg | Unknown | Unknown | Unknown | 2 | 3 | 3 |
| 064 | RetinaNet | Unknown | Unknown | Unknown | 2 | 2 | 3 |
| 065 | Scrambled code + broken (alter) | Unknown | Unknown | Unknown | 2 | 1 | 3 |
| 066 | Vision Transformer (ViT-H/14) | — | — | — | 2 | 3 | 3 |
| 067 | ABINet-LV+TPS++ | Unknown | Unknown | Unknown | 2 | 2 | 2 |
| 068 | Accurate Content Copying | — | — | — | 2 | 1 | 2 |
| 069 | Bert | Unknown | Unknown | Unknown | 2 | 2 | 2 |
| 070 | Biinclusion (Euro500kReuters) | Unknown | Unknown | Unknown | 2 | 2 | 2 |
| 071 | BilBOWA | Unknown | Unknown | Unknown | 2 | 2 | 2 |
| 072 | CLIP4STR-H (DFN-5B) | Unknown | Unknown | Unknown | 2 | 2 | 2 |
| 073 | CV-Group | Unknown | Unknown | Unknown | 2 | 1 | 2 |
| 074 | ChuLo | Unknown | Unknown | Unknown | 2 | 2 | 2 |
| 075 | DINO-X | IDEA Research | Unknown | Unified vision model with DINO-based detection head + large language model | 2 | 1 | 2 |
| 076 | DeepPyramidion | Unknown | Unknown | Unknown | 2 | 2 | 2 |
| 077 | HunyuanOCR (1B) | Unknown | — | — | 2 | 2 | 2 |
| 078 | JSTR | Fujitake | Unknown | DTrOCR + judgment module for image-text matching to reduce misrecognition | 2 | 2 | 2 |
| 079 | MPAD-path | Unknown | Unknown | Unknown | 2 | 2 | 2 |
| 080 | OmniParser | Alibaba | — | Unified framework: text spotting, KIE, table recognition | 2 | 1 | 2 |
| 081 | PyLaia (human transcriptions + random split) | Unknown | Unknown | Unknown | 2 | 1 | 2 |
| 082 | SSD512 (VGG-16) | Google / UNC | ~27M | Single-shot multibox detector with VGG-16 backbone, 512x512 input | 2 | 1 | 2 |
| 083 | VTM | Unknown | Unknown | Unknown | 2 | 1 | 2 |
| 084 | VaeDiff-DocRE | Unknown | Unknown | Unknown | 2 | 2 | 2 |
| 085 | VisualWordGrid | Unknown | Unknown | Unknown | 2 | 1 | 2 |
| 086 | XLNet | Unknown | Unknown | Unknown | 2 | 1 | 2 |
| 087 | Gemini 3 Pro | Undisclosed | — | 1 | 11 | 13 | |
| 088 | MixNet | Unknown | Unknown | Unknown | 1 | 4 | 13 |
| 089 | CLIP4STR-B | Research | Unknown | Unknown | 1 | 12 | 12 |
| 090 | Qwen3-VL-235B-A22B-Instruct | Qwen | — | — | 1 | 12 | 12 |
| 091 | Corner-based Region Proposals | Unknown | Unknown | Unknown | 1 | 3 | 9 |
| 092 | olmOCR v0.4.0 | Allen AI | — | OCR Pipeline | 1 | 1 | 9 |
| 093 | DoPTA | Unknown | Unknown | Unknown | 1 | 3 | 8 |
| 094 | FAST-T-736 | Unknown | Unknown | Unknown | 1 | 2 | 8 |
| 095 | A3S | Unknown | Unknown | Unknown | 1 | 3 | 7 |
| 096 | CodeBERT (MLM) | Unknown | Unknown | Unknown | 1 | 7 | 7 |
| 097 | CodeBERT (MLM+RTD) | Unknown | Unknown | Unknown | 1 | 7 | 7 |
| 098 | SPTS | Unknown | Unknown | Unknown | 1 | 3 | 7 |
| 099 | CPPD | Unknown | Unknown | Unknown | 1 | 6 | 6 |
| 100 | Intern-S1-Pro | Shanghai AI Lab | — | — | 1 | 5 | 6 |