Every model, measured.

Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.

Vendor:Areas overview speakleash · 253 OpenAI · 85 Google · 71 Qwen · 52 Alibaba · 47 Anthropic · 44 Microsoft · 35 Meta · 30 Mistral · 30 DeepSeek · 28 google · 19 meta-llama · 19 mistralai · 19 Meta AI · 15 CYFRAGOVPL · 14 Zhipu AI · 13 NVIDIA · 10 SpeakLeash · 10 internlm · 10 xAI · 10 ByteDance · 9 Baidu · 8 PLLuM · 8 ibm-granite · 8 microsoft · 8 Amazon · 7 Google DeepMind · 7 MiniMax · 7 Mistral AI · 7 Remek · 7 Shanghai AI Lab · 7 allenai · 7 utter-project · 7 CohereForAI · 6 Microsoft Research · 6 Salesforce · 6 01-ai · 5 Alibaba Cloud · 5 Cohere · 5 Moonshot AI · 5 NousResearch · 5 THUML · 5 deepseek-ai · 5 DeepMind · 4 Facebook AI · 4 IBM · 4 Meituan · 4 Stanford · 4 THUDM · 4 UC San Diego · 4 VikParuchuri · 4 gguf-iq · 4 nvidia · 4 openchat · 4 tiiuae · 4 Allen AI · 3 BAAI · 3 Du et al. · 3 ForgeCode · 3 Fudan University · 3 IDEA Research · 3 Liao et al. · 3 Moonshot.AI · 3 Nam Tuan Ly / NII · 3 OPI-PG · 3 OpenDataLab · 3 ViCoS Lab Ljubljana · 3 Xiaomi · 3 Zhao et al. · 3 gguf · 3 gguf11bv30 · 3 gguf7bv30 · 3 upstage · 3+ 247 smaller vendors (291 models)

§ 01 · Computer Vision models

896 models in Computer Vision · page 13 of 18.

#	Model	Vendor	Parameters	Architecture	SOTA	Benchmarks	Results
601	DETR-DC5	—	—	—	—	1	1
602	DETR-DC5-R101	—	—	—	—	1	1
603	DETR-R101	—	—	—	—	1	1
604	DINO (ResNet-50)	Research (IDEA Research)	Unknown	DETR with Improved DeNoising Anchor Boxes + ResNet-50 backbone	—	1	1
605	DINO (Swin-L)	Research	—	Transformer Detector	—	1	1
606	DINO (Swin-L)	IDEA Research	Unknown	DETR with Improved deNoising anchOr boxes	—	1	1
607	DINO-ViT-L	IDEA-Research	—	—	—	1	1
608	DINOv2 (ViT-g) + Linear	Meta AI	Unknown	Self-supervised ViT-giant + linear head	—	1	1
609	DINOv3 + Plain-DETR	—	—	—	—	1	1
610	DINOv3 + linear probe	—	—	—	—	1	1
611	DPText-DETR	AAAI 2023	—	—	—	1	1
612	DRRG	CVPR 2020	—	—	—	1	1
613	Dater	Unknown	Unknown	Unknown	—	1	1
614	DeepLabV3+	Unknown	Unknown	Unknown	—	1	1
615	Deformable DETR	—	—	—	—	1	1
616	Deformable DETR + iterative bounding box refinement	—	—	—	—	1	1
617	Deformable DETR + iterative bounding box refinement + two-stage Deformable DETR	—	—	—	—	1	1
618	DiT-B	Unknown	Unknown	Unknown	—	1	1
619	DiT-B (Cascade)	Unknown	Unknown	Unknown	—	1	1
620	DiT-Base	Microsoft	—	Vision Transformer (self-supervised)	—	1	1
621	DiT-L (Cascade R-CNN)	Microsoft Research	Unknown	Document Image Transformer (BEiT-based) + Cascade R-CNN detection head	—	1	1
622	DiT-Large	Microsoft	Unknown	Document Image Transformer Large	—	1	1
623	DistillCodeT5	FSOFT AI Lab	—	Transformer encoder-decoder	—	1	1
624	DoPTA (224×224)	—	—	Transformer	—	1	1
625	DoPTA-HR (512×512)	—	—	Transformer	—	1	1
626	DocBert [DOCBERT]	Unknown	Unknown	Unknown	—	1	1
627	DocFormer large	Unknown	Unknown	Unknown	—	1	1
628	DocFormerBASE	Unknown	Unknown	Unknown	—	1	1
629	DocLayout-YOLO	Unknown	Unknown	Unknown	—	1	1
630	DocXClassifier-B	Unknown	Unknown	Unknown	—	1	1
631	DocXClassifier-FPN	Saifullah et al.	—	CNN with Feature Pyramid Network	—	1	1
632	DocXClassifier-L	Unknown	Unknown	Unknown	—	1	1
633	Docling	IBM Research	Unknown	Open-source document parsing toolkit (layout + OCR + table)	—	1	1
634	Dolphin	Research	—	—	—	1	1
635	Dolphin-1.5	ByteDance	—	—	—	1	1
636	Dolphin-v2	ByteDance	—	—	—	1	1
637	Donut	Unknown	Unknown	Unknown	—	1	1
638	Dots OCR 1.5	RedNote HILab	Unknown	OCR-specialised open-weight VLM	—	1	1
639	EK-Net++	Research	—	—	—	1	1
640	ESALE	East China Normal University	125M	transformer	—	1	1
641	EVA-02 (ViT-L/14+)	BAAI	304M	EVA-02 ViT-L/14+, public data only	—	1	1
642	EVA-02-L	BAAI	Unknown	EVA-02 Large + Cascade Mask R-CNN	—	1	1
643	EVA-02-L (LVIS)	BAAI	Unknown	EVA-02 Large + ViTDet	—	1	1
644	Easter2.0	Unknown	Unknown	Unknown	—	1	1
645	Eff-GNN + Word2Vec [word2vec]	Unknown	Unknown	Unknown	—	1	1
646	Eff-GNN + Word2Vec [word2vec] + Image Embedding	Unknown	Unknown	Unknown	—	1	1
647	EfficientDet-D7x	Google	—	EfficientNet+BiFPN	—	1	1
648	EfficientNet-B0	Google	5.3M	CNN	—	1	1
649	EfficientNetV2-L	Google	120M	CNN	—	1	1
650	Extend	Extend	Unknown	Document parsing + extraction API	—	1	1