Codesota · Models1,357 models indexed · 896 match filter
Editorial · Models
Every model, measured.
Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.
Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Computer Vision models
896 models in Computer Vision · page 9 of 18.
| # | Model | Vendor | Parameters | Architecture | SOTA | Benchmarks | Results |
|---|---|---|---|---|---|---|---|
| 401 | MTL-TabNet (WS) | Nam Tuan Ly / NII | — | — | 1 | 3 | |
| 402 | MatchSum (BERT-base) | Unknown | Unknown | Unknown | 1 | 3 | |
| 403 | MatchSum (RoBERTa-base) | Unknown | Unknown | Unknown | 1 | 3 | |
| 404 | Mistral-7B-Instruct-v0.1 | Mistral AI | — | Mistral 7B with instruction tuning | 1 | 3 | |
| 405 | MonkeyOCR-pro-3B | Unknown | 3B | Vision-Language Model | 2 | 3 | |
| 406 | NeuSUM | Unknown | Unknown | Unknown | 1 | 3 | |
| 407 | Neumann et al. * | Unknown | Unknown | Unknown | 1 | 3 | |
| 408 | PEGASUS + SummaReranker | Unknown | Unknown | Unknown | 1 | 3 | |
| 409 | PGNet | Unknown | Unknown | Unknown | 1 | 3 | |
| 410 | PSENet (ResNet-152) | Unknown | Unknown | Unknown | 1 | 3 | |
| 411 | PSENet [67] | Unknown | Unknown | Unknown | 1 | 3 | |
| 412 | PSENet-4s | Unknown | Unknown | Unknown | 1 | 3 | |
| 413 | PixelLink+VGG16 2s MS | Unknown | Unknown | Unknown | 1 | 3 | |
| 414 | Quad_MS | Unknown | Unknown | Unknown | 1 | 3 | |
| 415 | RARE | Unknown | Unknown | Unknown | 3 | 3 | |
| 416 | RCEED | Unknown | Unknown | Unknown | 3 | 3 | |
| 417 | RMIPN | Zheng et al. | Unknown | Region Multiple Information Perception plug-and-play module on DB baseline | 1 | 3 | |
| 418 | RRD∗ | Unknown | Unknown | Unknown | 1 | 3 | |
| 419 | ResNet-152 | Microsoft | 60M | CNN | 2 | 3 | |
| 420 | ResNet-50 | Microsoft | 25M | CNN | 3 | 3 | |
| 421 | SAR | Unknown | Unknown | Unknown | 3 | 3 | |
| 422 | SAST | Unknown | Unknown | Unknown | 1 | 3 | |
| 423 | SRN | Unknown | Unknown | Unknown | 3 | 3 | |
| 424 | SRTS | Unknown | Unknown | Unknown | 1 | 3 | |
| 425 | STAR-Net | Unknown | Unknown | Unknown | 3 | 3 | |
| 426 | Selector+Pointer Generator | Unknown | Unknown | Unknown | 1 | 3 | |
| 427 | Synthesizer (R+V) | Unknown | Unknown | Unknown | 1 | 3 | |
| 428 | TaLK Convolutions (Deep) | Unknown | Unknown | Unknown | 1 | 3 | |
| 429 | TaLK Convolutions (Standard) | Unknown | Unknown | Unknown | 1 | 3 | |
| 430 | TableMaster | Unknown | Unknown | Unknown | 2 | 3 | |
| 431 | TextBPN++ (ResNet-18) | Zhang et al. | Unknown | ResNet-18 + Boundary Transformer | 1 | 3 | |
| 432 | TextBPN++ (ResNet-50) | Zhang et al. (HCIILAB) | Unknown | ResNet-50 + Boundary Transformer (single-scale, no DCN) | 1 | 3 | |
| 433 | TextFiled | Unknown | Unknown | Unknown | 1 | 3 | |
| 434 | TextScanner | Unknown | Unknown | Unknown | 3 | 3 | |
| 435 | Total w/o. joint | Unknown | Unknown | Unknown | 1 | 3 | |
| 436 | Total3D joint | Unknown | Unknown | Unknown | 1 | 3 | |
| 437 | TrOCR | Unknown | Unknown | Unknown | 2 | 3 | |
| 438 | TriSum-J | TriSum Authors | — | BART-large distilled from GPT-3.5 with structured rationale | 1 | 3 | |
| 439 | U-SPEC | Unknown | Unknown | Unknown | 1 | 3 | |
| 440 | UniLM (Abstractive Summarization) | Unknown | Unknown | Unknown | 1 | 3 | |
| 441 | UniXcoder | Microsoft | Unknown | Transformer encoder-decoder | 3 | 3 | |
| 442 | VCGroup | Unknown | Unknown | Unknown | 1 | 3 | |
| 443 | WordSup (VGG16-synth-coco) | Unknown | Unknown | Unknown | 1 | 3 | |
| 444 | XM | Unknown (ICDAR 2021 participant) | — | — | 1 | 3 | |
| 445 | YOLOv10-X | Tsinghua University | — | CNN (Real-time) | 1 | 3 | |
| 446 | Yao et al. | Unknown | Unknown | Unknown | 1 | 3 | |
| 447 | dots.ocr 3B | RedNote HILab | 3B | Vision-Language Model | 2 | 3 | |
| 448 | olmOCR-7B | Allen AI | — | — | 2 | 3 | |
| 449 | qwen2.5-vl-7b | Unknown | Unknown | Unknown | 3 | 3 | |
| 450 | ABCNet | Unknown | Unknown | Unknown | 1 | 2 |