Codesota · Models · InternVL2-76BShanghai AI Lab8 results · 5 benchmarks
Model card

InternVL2-76B.

Shanghai AI Labopen-source76B paramsVision-Language ModelMIT
§ 01 · Benchmarks

Every benchmark InternVL2-76B has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01VQA v2.0Multimodal · Visual Question Answeringaccuracy87.2%#2/72024-04-25source ↗
02TextVQAMultimodal · Visual Question Answeringaccuracy84.4%#3/92024-04-25source ↗
03CC-OCRComputer Vision · General OCR Capabilitiesmulti-scene-f176.9%#3/9source ↗
04MMBenchMultimodal · Visual Question Answeringaccuracy86.5%#4/82024-04-25source ↗
05CC-OCRComputer Vision · General OCR Capabilitieskie-f161.6%#5/5source ↗
06CC-OCRComputer Vision · General OCR Capabilitiesdocument-parsing35.3%#6/6source ↗
07CC-OCRComputer Vision · General OCR Capabilitiesmultilingual-f146.6%#6/8source ↗
08MMMUMultimodal · Visual Question Answeringaccuracy67.4%#13/182024-04-25source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where InternVL2-76B actually performs.

Computer Vision
1
benchmark
avg rank #5.0
Multimodal
4
benchmarks
avg rank #5.5
§ 03 · Papers

1 paper with results for InternVL2-76B.

  1. 2024-04-25· Multimodal· 4 results

    InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

§ 04 · Related models

Other Shanghai AI Lab models scored on Codesota.

InternImage-H
2 results · 1 SOTA
InternImage-H
Unknown params · 1 result
InternVL3-76B
1 result
InternVL3-78B
78B params · 1 result
InternVL3.5-241B
1 result
InternImage-XL
0 results
TCP
0 results
§ 05 · Sources & freshness

Where these numbers come from.

arxiv
4
results
cc-ocr-paper
3
results
alphaxiv-leaderboard
1
result
7 of 8 rows marked verified.