Codesota · Models1,357 models indexed · 896 match filter
Editorial · Models

Every model, measured.

Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.

Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Computer Vision models

896 models in Computer Vision · page 5 of 18.

#ModelVendorParametersArchitectureSOTABenchmarksResults
201ViTPose-G111
202VideoMAE ViT-B111
203cascadetabnetUnknownUnknownUnknown111
204dots.mocrRedNote3BMultimodal OCR (3B params)111
205pMF-H + FD-lossN/A111
206pil_maskrcnnICT, Chinese Academy of SciencesUnknownMask R-CNN based scene text detector111
207Kimi-K2.5Moonshot.AI1016
208Qwen2.5-VL-72B1414
209PANUnknownUnknownUnknown412
210Qwen3-VL-235B-A22B-ThinkingQwen1212
211Qwen3-VL-8B-InstructQwen1212
212SPCNETUnknownUnknownUnknown412
213TESTRUnknownUnknownUnknown412
214TextSnakeUnknownUnknownUnknown412
215MiniCPM-o 4.5-Instruct1111
216Qwen2-VL 7BAlibaba7B1111
217Qwen2-VL-2B1010
218ABCNet v2UnknownUnknownUnknown49
219Corner Localization (multi-scale)UnknownUnknownUnknown39
220DeepSeek-Coder-V2-InstructDeepSeekUnknownMoE Transformer79
221DeepSolo (ResNet-50)UnknownUnknownUnknown49
222FOTSUnknownUnknownUnknown29
223MGP-STRUnknownUnknownUnknown99
224Mask TextSpotterUnknownUnknownUnknown39
225PSENet-1sUnknownUnknownUnknown39
226Qwen2.5-Coder 32BAlibaba32BDense Transformer89
227RoBERTaUnknownUnknownUnknown99
228SSTDUnknownUnknownUnknown39
229SegLinkUnknownUnknownUnknown39
230SwinTextSpotterUnknownUnknownUnknown49
231DBNet++ (ResNet-18) (736)UnknownUnknownUnknown28
232DeiT-BMeta86MVision Transformer38
233FAST-B-512UnknownUnknownUnknown28
234FAST-B-640UnknownUnknownUnknown28
235FAST-B-736UnknownUnknownUnknown28
236FAST-S-512UnknownUnknownUnknown28
237FAST-S-736UnknownUnknownUnknown28
238GPT-4o miniOpenAIMultimodal LLM78
239Gemini 3 FlashGoogleUndisclosed78
240InternVL2-76BShanghai AI Lab76BVision-Language Model58
241Aria77
242BEiT-BUnknownUnknownUnknown27
243DBNet++ (ResNet-18) (800)UnknownUnknownUnknown27
244DBNet++ (ResNet-50) (800)UnknownUnknownUnknown27
245DINOv2 (ViT-g/14)77
246DeepSolo (ResNet-50, TextOCR)UnknownUnknownUnknown37
247DiT-LUnknownUnknownUnknown27
248MANGOUnknownUnknownUnknown37
249MATRNResearchUnknownUnknown77
250Mask R-CNNMeta AI / FAIRUnknownUnknown27