Codesota · Models1,357 models indexed · 896 match filter
Editorial · Models

Every model, measured.

Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.

Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Computer Vision models

896 models in Computer Vision · page 13 of 18.

#ModelVendorParametersArchitectureSOTABenchmarksResults
601DETR-DC511
602DETR-DC5-R10111
603DETR-R10111
604DINO (ResNet-50)Research (IDEA Research)UnknownDETR with Improved DeNoising Anchor Boxes + ResNet-50 backbone11
605DINO (Swin-L)ResearchTransformer Detector11
606DINO (Swin-L)IDEA ResearchUnknownDETR with Improved deNoising anchOr boxes11
607DINO-ViT-LIDEA-Research11
608DINOv2 (ViT-g) + LinearMeta AIUnknownSelf-supervised ViT-giant + linear head11
609DINOv3 + Plain-DETR11
610DINOv3 + linear probe11
611DPText-DETRAAAI 202311
612DRRGCVPR 202011
613DaterUnknownUnknownUnknown11
614DeepLabV3+UnknownUnknownUnknown11
615Deformable DETR11
616Deformable DETR + iterative bounding box refinement11
617Deformable DETR + iterative bounding box refinement + two-stage Deformable DETR11
618DiT-BUnknownUnknownUnknown11
619DiT-B (Cascade)UnknownUnknownUnknown11
620DiT-BaseMicrosoftVision Transformer (self-supervised)11
621DiT-L (Cascade R-CNN)Microsoft ResearchUnknownDocument Image Transformer (BEiT-based) + Cascade R-CNN detection head11
622DiT-LargeMicrosoftUnknownDocument Image Transformer Large11
623DistillCodeT5FSOFT AI LabTransformer encoder-decoder11
624DoPTA (224×224)Transformer11
625DoPTA-HR (512×512)Transformer11
626DocBert [DOCBERT]UnknownUnknownUnknown11
627DocFormer largeUnknownUnknownUnknown11
628DocFormerBASEUnknownUnknownUnknown11
629DocLayout-YOLOUnknownUnknownUnknown11
630DocXClassifier-BUnknownUnknownUnknown11
631DocXClassifier-FPNSaifullah et al.CNN with Feature Pyramid Network11
632DocXClassifier-LUnknownUnknownUnknown11
633DoclingIBM ResearchUnknownOpen-source document parsing toolkit (layout + OCR + table)11
634DolphinResearch11
635Dolphin-1.5ByteDance11
636Dolphin-v2ByteDance11
637DonutUnknownUnknownUnknown11
638Dots OCR 1.5RedNote HILabUnknownOCR-specialised open-weight VLM11
639EK-Net++Research11
640ESALEEast China Normal University125Mtransformer11
641EVA-02 (ViT-L/14+)BAAI304MEVA-02 ViT-L/14+, public data only11
642EVA-02-LBAAIUnknownEVA-02 Large + Cascade Mask R-CNN11
643EVA-02-L (LVIS)BAAIUnknownEVA-02 Large + ViTDet11
644Easter2.0UnknownUnknownUnknown11
645Eff-GNN + Word2Vec [word2vec]UnknownUnknownUnknown11
646Eff-GNN + Word2Vec [word2vec] + Image EmbeddingUnknownUnknownUnknown11
647EfficientDet-D7xGoogleEfficientNet+BiFPN11
648EfficientNet-B0Google5.3MCNN11
649EfficientNetV2-LGoogle120MCNN11
650ExtendExtendUnknownDocument parsing + extraction API11