Codesota · Models1,357 models indexed · 896 match filter
Editorial · Models

Every model, measured.

Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.

Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Computer Vision models

896 models in Computer Vision · page 3 of 18.

#ModelVendorParametersArchitectureSOTABenchmarksResults
101Mistral OCR 3MistralVision-Language Model126
102REL-RWMD k-NNUnknownUnknownUnknown166
103SBDUnknownUnknownUnknown126
104TRDLUUnknownUnknownUnknown116
105TextBoxes++_MSUnknownUnknownUnknown126
106TextMambaZhao et al.UnknownMamba (SSM) + CNN126
107TransformerUnknownUnknownUnknown166
108VSRUnknownUnknownUnknown116
109BiLSTM (Europarl)UnknownUnknownUnknown155
110EasyOCRJaidedAIDeep Learning OCR135
111GPT-2-Medium (prefix-tuning)OpenAI355MTransformer115
112Infinity-Parser2-Pro155
113PaddleOCR-VLBaidu0.9B-7BVision-Language Model125
114AIMv2 ViT-3B/14 + Llama 3.0 8B144
115Bottom-Up SumUnknownUnknownUnknown114
116FAST-T-448UnknownUnknownUnknown114
117Yet Another Text RecognizerUnknownUnknownUnknown144
118APE-LargeTsinghua / MEGVIIUnknownAligned vision encoder + region-text alignment with EVA-02 ViT-L backbone113
119ASNMF-SRPZhong and Gao113
120CharNet H-88 (single-scale)UnknownUnknownUnknown113
121DBNet++ (ResNet-50) (1024)Liao et al.UnknownResNet-50 + Differentiable Binarization + Adaptive Scale Fusion113
122DeepSolo (with pre-training)ViTAE-TransformerUnknownDETR-like Transformer decoder with explicit points113
123EoMT (ViT-L)133
124InternImage-HShanghai AI LabDeformable Convolution133
125KB-to-Language Generation ModelUnknownUnknownUnknown113
126MinerU 2.5OpenDataLabDocument extraction pipeline123
127Pixel-level RCUnknownUnknownUnknown133
128RapidOCRUnknownUnknownUnknown113
129Re0UnknownUnknownUnknown113
130SumHiSSumHiS AuthorsExtractive summarization exploiting hidden document structure113
131V-JEPA 2 ViT-g (1B, 384px)133
132BEiT-L+122
133BioGPTUnknownUnknownUnknown112
134BioLinkBERT (large)UnknownUnknownUnknown112
135CodeT5-baseSalesforceT5 encoder-decoder pretrained on code122
136DTrOCRUnknownUnknownUnknown122
137Gemini 2.0 FlashGoogleMultimodal LLM122
138HEADoC-Large90.58MTransformer122
139I2L-STRIPSUnknownUnknownUnknown122
140LBDMUnknownUnknownUnknown112
141MAGNETUnknownUnknownUnknown122
142MBDUnknownUnknownUnknown112
143PASTAUnknownUnknownUnknown112
144SEMv3IFLYTEK / USTC (Zhang et al.)UnknownKeypoint Offset Regression (KOR) module; split-and-merge paradigm for table separation line detection112
145SPECTERUnknownUnknownUnknown122
146SciNCLUnknownUnknownUnknown122
147Selective SearchUnknownUnknownUnknown112
148SpanUnknownUnknownUnknown112
149Start, Follow, ReadUnknownUnknownUnknown112
150StrucTexTv2 (small)UnknownUnknownUnknown122