Codesota · Models1,357 models indexed · 896 match filter
Editorial · Models
Every model, measured.
Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.
Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Computer Vision models
896 models in Computer Vision · page 14 of 18.
| # | Model | Vendor | Parameters | Architecture | SOTA | Benchmarks | Results |
|---|---|---|---|---|---|---|---|
| 651 | FCENet | CVPR 2021 | — | — | 1 | 1 | |
| 652 | FPHR Paragraph Level (~145 dpi) | Unknown | Unknown | Unknown | 1 | 1 | |
| 653 | FPHR+Aug Line Level (~145 dpi) | Unknown | Unknown | Unknown | 1 | 1 | |
| 654 | FPHR+Aug Paragraph Level (~145 dpi) | Unknown | Unknown | Unknown | 1 | 1 | |
| 655 | FireRed-OCR | — | — | — | 1 | 1 | |
| 656 | FireRed-OCR-2B | — | — | — | 1 | 1 | |
| 657 | Flor | Unknown | Unknown | Unknown | 1 | 1 | |
| 658 | FreeReal+DBNet | SJTU | — | — | 1 | 1 | |
| 659 | GPT-4o (Anchored) | OpenAI | — | Multimodal LLM | 1 | 1 | |
| 660 | Gemini Flash 2 | — | Multimodal LLM | 1 | 1 | ||
| 661 | Gemma 3 | — | — | 1 | 1 | ||
| 662 | GoogLeNet | — | — | 1 | 1 | ||
| 663 | Google Cloud Document AI | Google Cloud | Unknown | Managed document understanding service (layout parser) | 1 | 1 | |
| 664 | GraphCodeBERT | Microsoft | 125M | transformer | 1 | 1 | |
| 665 | GraphCodeBERT+AdvFusion | University of Leicester | 125M | transformer | 1 | 1 | |
| 666 | GreedyRel (query: method + article + steps titles) | Unknown | Unknown | Unknown | 1 | 1 | |
| 667 | GreedyRel (query: method + article titles) | Unknown | Unknown | Unknown | 1 | 1 | |
| 668 | GreedyRel (query: method title) | Unknown | Unknown | Unknown | 1 | 1 | |
| 669 | GreedyRel (query: step + method + article titles) | Unknown | — | extractive | 1 | 1 | |
| 670 | GreedyRel (query: step + method titles) | Unknown | Unknown | Unknown | 1 | 1 | |
| 671 | GreedyRel (query: step title) | Unknown | Unknown | Unknown | 1 | 1 | |
| 672 | Grounding DINO | IDEA Research | Unknown | Open-Set Object Detection with Grounded Pre-Training | 1 | 1 | |
| 673 | Grounding DINO L (Swin-L) | — | — | — | 1 | 1 | |
| 674 | IGTR-AR | Yongkun Du et al. | Unknown | Instruction-Guided Transformer (Auto-Regressive variant) | 1 | 1 | |
| 675 | InternImage-H | Shanghai AI Lab | Unknown | Deformable Convolution v3 + Cascade Mask R-CNN | 1 | 1 | |
| 676 | InternImage-H (OneFormer) | PJLab & Tsinghua | — | — | 1 | 1 | |
| 677 | InternVL3 14B | OpenGVLab | — | Vision-Language Model | 1 | 1 | |
| 678 | InternVL3-76B | Shanghai AI Lab | — | — | 1 | 1 | |
| 679 | InternVL3.5-241B | Shanghai AI Lab | — | — | 1 | 1 | |
| 680 | JT-OCR | Unknown | — | — | 1 | 1 | |
| 681 | L3i++ | Unknown | Unknown | Unknown | 1 | 1 | |
| 682 | LISTER | Cheng et al. | Unknown | Length-Insensitive Scene TExt Recognizer with Neighbor Decoder | 1 | 1 | |
| 683 | LPV-S | Research | Unknown | Language-Guided Progressive Vison transformer (Small) | 1 | 1 | |
| 684 | LandingAI | LandingAI | Unknown | Agentic document extraction (ADE) service | 1 | 1 | |
| 685 | LayoutLMV3Large | Unknown | Unknown | Unknown | 1 | 1 | |
| 686 | LayoutLMv2 Large | — | — | — | 1 | 1 | |
| 687 | LayoutLMv2 Large + QG | — | — | — | 1 | 1 | |
| 688 | LayoutLMv2BASE | Unknown | Unknown | Unknown | 1 | 1 | |
| 689 | LayoutLMv2LARGE | Unknown | Unknown | Unknown | 1 | 1 | |
| 690 | LayoutLMv3 | Microsoft | Unknown | Multimodal Transformer (text + layout + image) | 1 | 1 | |
| 691 | LayoutLMv3-Large | Microsoft Research | Unknown | Multimodal Transformer (text + layout + image unified pre-training) | 1 | 1 | |
| 692 | LayoutLMv3BASE | Unknown | Unknown | Unknown | 1 | 1 | |
| 693 | LayoutXLM | Unknown | Unknown | Unknown | 1 | 1 | |
| 694 | LexRank (query: method + article + steps titles) | Unknown | Unknown | Unknown | 1 | 1 | |
| 695 | LexRank (query: method + article titles) | Unknown | Unknown | Unknown | 1 | 1 | |
| 696 | LexRank (query: method title) | Unknown | Unknown | Unknown | 1 | 1 | |
| 697 | LexRank (query: step + method + article titles) | Unknown | Unknown | Unknown | 1 | 1 | |
| 698 | LexRank (query: step + method titles) | Unknown | Unknown | Unknown | 1 | 1 | |
| 699 | LexRank (query: step title) | Unknown | Unknown | Unknown | 1 | 1 | |
| 700 | LiLT[EN-R]BASE | Unknown | Unknown | Unknown | 1 | 1 |