Codesota · Models · GPT-4VUnknown4 results · 4 benchmarks
Model card

GPT-4V.

UnknownmultimodalUnknown paramsTransformer

GPT-4 with Vision. First major multimodal GPT-4 release, Sept 2023. Evaluated on MMMU, VQA, TextVQA. Source: GPT-4 Technical Report.

§ 01 · Benchmarks

Every benchmark GPT-4V has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01MMBenchMultimodal · Visual Question Answeringaccuracy75.8%#6/82023-03-15source ↗
02TextVQAMultimodal · Visual Question Answeringaccuracy78.0%#6/92023-03-15source ↗
03VQA v2.0Multimodal · Visual Question Answeringaccuracy77.2%#7/72023-03-15source ↗
04MMMUMultimodal · Visual Question Answeringaccuracy56.8%#18/182023-03-15source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where GPT-4V actually performs.

Multimodal
4
benchmarks
avg rank #9.3
§ 03 · Papers

1 paper with results for GPT-4V.

  1. 2023-03-15· Natural Language Processing· 4 results

    GPT-4 Technical Report

§ 04 · Related models

Other Unknown models scored on Codesota.

fglihai
Unknown params · 6 results · 1 SOTA
CLIP4STR-L
Unknown params · 1 result · 1 SOTA
USYD NLP_CS29-2
Unknown params · 6 results
Corner-based Region Proposals
Unknown params · 3 results
EAST + VGG16
Unknown params · 3 results
SSTD
Unknown params · 3 results
TextBoxes++_MS
Unknown params · 3 results
WordSup (VGG16-synth-coco)
Unknown params · 3 results
§ 05 · Sources & freshness

Where these numbers come from.

arxiv
4
results
4 of 4 rows marked verified.