Codesota · Models1,357 models indexed · 896 match filter
Editorial · Models

Every model, measured.

Start with a research area, drill into a vendor, or page through the full index. Only models with at least one benchmark score appear — a model without a recorded score can’t be ranked.

Vendor:Areas overviewspeakleash · 253OpenAI · 85Google · 71Qwen · 52Alibaba · 47Anthropic · 44Microsoft · 35Meta · 30Mistral · 30DeepSeek · 28google · 19meta-llama · 19mistralai · 19Meta AI · 15CYFRAGOVPL · 14Zhipu AI · 13NVIDIA · 10SpeakLeash · 10internlm · 10xAI · 10ByteDance · 9Baidu · 8PLLuM · 8ibm-granite · 8microsoft · 8Amazon · 7Google DeepMind · 7MiniMax · 7Mistral AI · 7Remek · 7Shanghai AI Lab · 7allenai · 7utter-project · 7CohereForAI · 6Microsoft Research · 6Salesforce · 601-ai · 5Alibaba Cloud · 5Cohere · 5Moonshot AI · 5NousResearch · 5THUML · 5deepseek-ai · 5DeepMind · 4Facebook AI · 4IBM · 4Meituan · 4Stanford · 4THUDM · 4UC San Diego · 4VikParuchuri · 4gguf-iq · 4nvidia · 4openchat · 4tiiuae · 4Allen AI · 3BAAI · 3Du et al. · 3ForgeCode · 3Fudan University · 3IDEA Research · 3Liao et al. · 3Moonshot.AI · 3Nam Tuan Ly / NII · 3OPI-PG · 3OpenDataLab · 3ViCoS Lab Ljubljana · 3Xiaomi · 3Zhao et al. · 3gguf · 3gguf11bv30 · 3gguf7bv30 · 3upstage · 3+ 247 smaller vendors (291 models)
§ 01 · Computer Vision models

896 models in Computer Vision · page 2 of 18.

#ModelVendorParametersArchitectureSOTABenchmarksResults
051MultiFiT, pseudoUnknownUnknownUnknown277
052PMTD*UnknownUnknownUnknown226
053FactT5BUnknownUnknownUnknown215
054GPT-2-Large (prefix-tuning)OpenAI774MTransformer215
055clearOCRTeamQuestTraditional OCR215
056HTR-VT(line-level)UnknownUnknownUnknown224
057AIMv2 ViT-3B/14 448px233
058DnC-SCUnknownUnknownUnknown213
059GLM-OCRZhipu AI223
060HDLTexUnknownUnknownUnknown233
061Hierarchical Table RecognizerTakaya Kawakatsu213
062HolisticUnknownUnknownUnknown213
063KD-LSTMregUnknownUnknownUnknown233
064RetinaNetUnknownUnknownUnknown223
065Scrambled code + broken (alter)UnknownUnknownUnknown213
066Vision Transformer (ViT-H/14)233
067ABINet-LV+TPS++UnknownUnknownUnknown222
068Accurate Content Copying212
069BertUnknownUnknownUnknown222
070Biinclusion (Euro500kReuters)UnknownUnknownUnknown222
071BilBOWAUnknownUnknownUnknown222
072CLIP4STR-H (DFN-5B)UnknownUnknownUnknown222
073CV-GroupUnknownUnknownUnknown212
074ChuLoUnknownUnknownUnknown222
075DINO-XIDEA ResearchUnknownUnified vision model with DINO-based detection head + large language model212
076DeepPyramidionUnknownUnknownUnknown222
077HunyuanOCR (1B)Unknown222
078JSTRFujitakeUnknownDTrOCR + judgment module for image-text matching to reduce misrecognition222
079MPAD-pathUnknownUnknownUnknown222
080OmniParserAlibabaUnified framework: text spotting, KIE, table recognition212
081PyLaia (human transcriptions + random split)UnknownUnknownUnknown212
082SSD512 (VGG-16)Google / UNC~27MSingle-shot multibox detector with VGG-16 backbone, 512x512 input212
083VTMUnknownUnknownUnknown212
084VaeDiff-DocREUnknownUnknownUnknown222
085VisualWordGridUnknownUnknownUnknown212
086XLNetUnknownUnknownUnknown212
087Gemini 3 ProGoogleUndisclosed11113
088MixNetUnknownUnknownUnknown1413
089CLIP4STR-BResearchUnknownUnknown11212
090Qwen3-VL-235B-A22B-InstructQwen11212
091Corner-based Region ProposalsUnknownUnknownUnknown139
092olmOCR v0.4.0Allen AIOCR Pipeline119
093DoPTAUnknownUnknownUnknown138
094FAST-T-736UnknownUnknownUnknown128
095A3SUnknownUnknownUnknown137
096CodeBERT (MLM)UnknownUnknownUnknown177
097CodeBERT (MLM+RTD)UnknownUnknownUnknown177
098SPTSUnknownUnknownUnknown137
099CPPDUnknownUnknownUnknown166
100Intern-S1-ProShanghai AI Lab156