Codesota · Models1,357 models indexed · 896 match filter
Editorial · Models

Every model, measured.

Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.

Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Computer Vision models

896 models in Computer Vision · page 6 of 18.

#ModelVendorParametersArchitectureSOTABenchmarksResults
251Massively Multilingual Sentence EmbeddingsUnknownUnknownUnknown77
252MultiCCA + CNNUnknownUnknownUnknown77
253PARSeqResearchUnknownScene Text Recognition with Permuted Autoregressive Sequence Models67
254SRFormer (ResNet-50)UnknownUnknownUnknown37
255VideoLLaMA3 7B77
256pre-train w/ code onlyUnknownUnknownUnknown77
257seq2seqUnknownUnknownUnknown77
258CDistNet (Ours)UnknownUnknownUnknown66
259CRNNUnknownUnknownUnknown56
260CharNet H-88UnknownUnknownUnknown26
261CharNet H-88 (multi-scale)UnknownUnknownUnknown26
262DPANUnknownUnknownUnknown66
263DiffusionSTRUnknownUnknownUnknown66
264EK-NetZhu et al.UnknownResNet-18 + Expand Kernel Distance26
265FOTS MSUnknownUnknownUnknown26
266FTSN + MNMSUnknownUnknownUnknown26
267GLAMUnknownUnknownUnknown16
268GNNetsUnknownUnknownUnknown26
269HTR-ConvTextDAIR-Group65.9MCNN+Transformer hybrid (ConvText block)36
270HTR-VTUnknownUnknownUnknown36
271InternVL3-78BShanghai AI Lab78BVision-Language Model56
272LayoutLMv3-BUnknownUnknownUnknown16
273PAN-640UnknownUnknownUnknown26
274PixelLink+VGG16 2sUnknownUnknownUnknown26
275ResNext-101-32×8dUnknownUnknownUnknown16
276S-GTRUnknownUnknownUnknown66
277SLPRUnknownUnknownUnknown26
278TextBPN++ (ResNet-50+DCN)Zhang et al.UnknownResNet-50 with Deformable Convolution + Boundary Transformer26
279TrOCR-base 334MUnknownUnknownUnknown66
280TrOCR-large 558MUnknownUnknownUnknown66
281UDocUnknownUnknownUnknown16
282VANUnknownUnknownUnknown36
283VideoLLaMA3 2B66
284WordSup (VGG16-synth-icdar)UnknownUnknownUnknown26
285ABINet-LVFang et al.UnknownResNet + Bidirectional Language Model (LV)55
286BART-base (STSM)Meta139MTransformer15
287CodeBERT (RTD)UnknownUnknownUnknown55
288DPText-DETR (ResNet-50)UnknownUnknownUnknown25
289FLAN-T5-base (STSM)Google250MTransformer15
290FactJointGTUnknownUnknownUnknown15
291GLASSUnknownUnknownUnknown25
292GPT-2-Medium (fine-tuning)OpenAI355MTransformer15
293HTLM (prefix-tuning)UnknownUnknownTransformer15
294JointGT BaselineUnknownUnknownUnknown15
295MaskTextSpotter v3UnknownUnknownUnknown25
296MiniCPM-Llama3-V 2.555
297MiniCPM-V 4.6-Thinking (16x)55
298Qwen2.5-VL 72BAlibaba72BVision-Language Model55
299SIGA_TUnknownUnknownUnknown55
300SPTS v2UnknownUnknownUnknown25