Codesota · Models · InternVL2-76BShanghai AI Lab8 results · 5 benchmarks
Model card

InternVL2-76B.

Shanghai AI Labopen-source76B paramsVision-Language ModelMIT
§ 02 · Benchmarks

Every benchmark InternVL2-76B has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01VQA v2.0Multimodal · Visual Question Answeringaccuracy87.2%#2/162024-04-25source ↗
02CC-OCRComputer Vision · General OCR Capabilitiesmulti-scene-f176.9%#3/9source ↗
03CC-OCRComputer Vision · General OCR Capabilitieskie-f161.6%#5/5source ↗
04TextVQAMultimodal · Visual Question Answeringaccuracy84.4%#6/232024-04-25source ↗
05CC-OCRComputer Vision · General OCR Capabilitiesdocument-parsing35.3%#6/6source ↗
06CC-OCRComputer Vision · General OCR Capabilitiesmultilingual-f146.6%#6/8source ↗
07MMBenchMultimodal · Visual Question Answeringaccuracy86.5%#10/202024-04-25source ↗
08MMMUMultimodal · Visual Question Answeringaccuracy67.4%#17/302024-04-25source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 03 · Strengths by area

Where InternVL2-76B actually performs.

Computer Vision
1
benchmark
avg rank #5.0
Multimodal
4
benchmarks
avg rank #8.8
§ 04 · Papers

1 paper with results for InternVL2-76B.

  1. 2024-04-25· Multimodal· 4 results

    InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

§ 05 · Related models

Other Shanghai AI Lab models scored on Codesota.

InternImage-H
2 results · 1 SOTA
Intern-S1-Pro
5 results
InternVL3-78B
78B params · 2 results
InternImage-H
Unknown params · 1 result
InternVL3-76B
1 result
InternVL3.5-241B
1 result
InternImage-XL
0 results
TCP
0 results
§ 06 · Sources & freshness

Where these numbers come from.

arxiv
4
results
cc-ocr-paper
3
results
alphaxiv-leaderboard
1
result
7 of 8 rows marked verified.