Home/OCR/All Results

All Verified Results

286 benchmark results across 51 datasets. Every data point links to its source.

286
Total Results
51
Benchmarks
141
Models

JSON API: Download raw data at /data/benchmarks.json

Complete Results Table

ModelDatasetMetricValueSource
mistral-ocr-2512codesota-verificationpages-per-second1.22codesota-verified
mistral-ocr-2512omnidocbenchcomposite79.75codesota-verified
coca-finetunedimagenet-1ktop-1-accuracy91google-research
vit-g-14imagenet-1ktop-1-accuracy90.45google-research
convnext-v2-hugeimagenet-1ktop-1-accuracy88.9meta-research
vit-h-14imagenet-1ktop-1-accuracy88.55google-research
swin-largeimagenet-1ktop-1-accuracy87.3microsoft-research
efficientnet-v2-limagenet-1ktop-1-accuracy85.7google-research
deit-b-distilledimagenet-1ktop-1-accuracy85.2meta-research
efficientnet-b7imagenet-1ktop-1-accuracy84.4google-research
deit-bimagenet-1ktop-1-accuracy83.1meta-research
convnext-v2-tinyimagenet-1ktop-1-accuracy83meta-research
vit-l-16imagenet-1ktop-1-accuracy82.7google-research
vit-b-16imagenet-1ktop-1-accuracy81.2google-research
resnet-50-a3imagenet-1ktop-1-accuracy80.4timm-research
resnet-152imagenet-1ktop-1-accuracy78.6microsoft-research
efficientnet-b0imagenet-1ktop-1-accuracy77.1google-research
resnet-50imagenet-1ktop-1-accuracy76.15pytorch-vision
swin-v2-largeimagenet-v2top-1-accuracy84microsoft-research
convnext-v2-hugeimagenet-v2top-1-accuracy80.5meta-research
vit-h-14cifar-100accuracy94.55google-research
vit-b-16cifar-100accuracy91.48huggingface
deit-b-distilledcifar-10accuracy99.1meta-research
convnext-v2-basecifar-10accuracy98.7meta-research
resnet-50cifar-10accuracy96.01cutout-paper
efficientnet-b7cifar-100accuracy91.7google-research
resnet-50cifar-100accuracy78.04cutout-paper
paddleocr-vlomnidocbenchcomposite92.86alphaxiv-leaderboard
paddleocr-vl-0.9bomnidocbenchcomposite92.56alphaxiv-leaderboard
mineru-2.5omnidocbenchcomposite90.67alphaxiv-leaderboard
qwen3-vl-235bomnidocbenchcomposite89.15alphaxiv-leaderboard
monkeyocr-pro-3bomnidocbenchcomposite88.85alphaxiv-leaderboard
gemini-25-proomnidocbenchcomposite88.03alphaxiv-leaderboard
qwen25-vlomnidocbenchcomposite87.02alphaxiv-leaderboard
ocrverse-4bomnidocbenchcomposite88.56github-leaderboard
dots-ocr-3bomnidocbenchcomposite88.41github-leaderboard
mistral-ocr-3omnidocbenchcomposite79.75codesota-verified
mistral-ocr-3omnidocbenchtext-edit-distance0.099codesota-verified
mistral-ocr-3omnidocbenchtable-teds70.88codesota-verified
mistral-ocr-3omnidocbenchformula-edit-distance0.218codesota-verified
mistral-ocr-3omnidocbenchreading-order91.63codesota-verified
clearocr-teamquestomnidocbenchcomposite31.7codesota-verified
clearocr-teamquestomnidocbenchtext-edit-distance0.154codesota-verified
clearocr-teamquestomnidocbenchtable-teds0.8codesota-verified
clearocr-teamquestomnidocbenchformula-edit-distance0.902codesota-verified
clearocr-teamquestomnidocbenchreading-order86.04codesota-verified
gpt-4oomnidocbenchocr-edit-distance0.02alphaxiv-leaderboard
paddleocr-vlomnidocbenchtable-teds93.52alphaxiv-leaderboard
mineru-2.5omnidocbenchlayout-map97.5alphaxiv-leaderboard
seed-1.6-visionocrbench-v2overall-en-private62.2alphaxiv-leaderboard
qwen3-omni-30bocrbench-v2overall-en-private61.3alphaxiv-leaderboard
nemotron-nano-v2-vlocrbench-v2overall-en-private61.2alphaxiv-leaderboard
gemini-25-proocrbench-v2overall-en-private59.3alphaxiv-leaderboard
gpt-4oocrbench-v2overall-en-private55.5alphaxiv-leaderboard
gemini-25-proocrbench-v2overall-zh-private62.2alphaxiv-leaderboard
mistral-ocr-2512ocrbench-v2overall-en-private25.2codesota-verified
llama-3.1-nemotron-nano-vl-8bocrbench-v2overall-en-private56.4ocrbench-v2-leaderboard
ovis2.5-8bocrbench-v2overall-en-private54.1ocrbench-v2-leaderboard
gemini-1.5-proocrbench-v2overall-en-private51.6ocrbench-v2-leaderboard
sail-vl2-8bocrbench-v2overall-en-private49.3ocrbench-v2-leaderboard
minicpm-v-4.5-8bocrbench-v2overall-en-private48.4ocrbench-v2-leaderboard
gpt-4o-2024ocrbench-v2overall-en-private47.6ocrbench-v2-leaderboard
claude-3.5-sonnetocrbench-v2overall-en-private47.5ocrbench-v2-leaderboard
internvl3.5-14bocrbench-v2overall-en-private47.1ocrbench-v2-leaderboard
step-1vocrbench-v2overall-en-private46.8ocrbench-v2-leaderboard
grok4ocrbench-v2overall-en-private45ocrbench-v2-leaderboard
gpt-4o-miniocrbench-v2overall-en-private44.1ocrbench-v2-leaderboard
claude-sonnet-4ocrbench-v2overall-en-private42.4ocrbench-v2-leaderboard
qwen2.5-vl-7bocrbench-v2overall-en-private41.8ocrbench-v2-leaderboard
deepseek-vl2-smallocrbench-v2overall-en-private41ocrbench-v2-leaderboard
pixtral-12bocrbench-v2overall-en-private38.4ocrbench-v2-leaderboard
phi-4-multimodalocrbench-v2overall-en-private38.1ocrbench-v2-leaderboard
glm-4v-9bocrbench-v2overall-en-private37.1ocrbench-v2-leaderboard
molmo-7bocrbench-v2overall-en-private33.9ocrbench-v2-leaderboard
llava-ov-7bocrbench-v2overall-en-private33.7ocrbench-v2-leaderboard
idefics3-8bocrbench-v2overall-en-private26ocrbench-v2-leaderboard
docowl2ocrbench-v2overall-en-private23.4ocrbench-v2-leaderboard
minicpm-v-4.5-8bocrbench-v2overall-zh-private58.8ocrbench-v2-leaderboard
sail-vl2-8bocrbench-v2overall-zh-private57.6ocrbench-v2-leaderboard
claude-3.5-sonnetocrbench-v2overall-zh-private48.4ocrbench-v2-leaderboard
gpt-4o-2024ocrbench-v2overall-zh-private45.7ocrbench-v2-leaderboard
chandra-ocr-0.1.0olmocr-benchpass-rate83.1alphaxiv-leaderboard
chandra-ocr-0.1.0olmocr-benchtables88github-readme
chandra-ocr-0.1.0olmocr-benchold-scans-math80.3github-readme
chandra-ocr-0.1.0olmocr-benchlong-tiny-text92.3github-readme
chandra-ocr-0.1.0olmocr-benchbase99.9github-readme
chandra-ocr-0.1.0olmocr-benchheaders-footers90.8github-readme
chandra-ocr-0.1.0olmocr-benchmulti-column81.2github-readme
chandra-ocr-0.1.0olmocr-bencharxiv82.2github-readme
chandra-ocr-0.1.0olmocr-benchold-scans50.4github-readme
deepseek-ocrolmocr-benchpass-rate75.4github-readme
dots-ocr-3bolmocr-benchpass-rate79.1github-readme
marker-1.10.0olmocr-benchpass-rate76.5github-readme
gpt-4o-anchoredolmocr-benchpass-rate69.9github-readme
gemini-flash-2olmocr-benchpass-rate63.8github-readme
dots-ocr-3bolmocr-benchtables88.3github-readme
olmocr-v0.3.0olmocr-benchold-scans-math79.9github-readme
olmocr-v0.3.0olmocr-benchheaders-footers95.1github-readme
marker-1.10.0olmocr-bencharxiv83.8github-readme
gpt-4oolmocr-benchold-scans40.7github-readme
infinity-parser-7bolmocr-benchpass-rate82.5alphaxiv-leaderboard
olmocr-v0.4.0olmocr-benchpass-rate82.4alphaxiv-leaderboard
paddleocr-vlolmocr-benchpass-rate80alphaxiv-leaderboard
marker-1.10.1olmocr-benchpass-rate76.1alphaxiv-leaderboard
deepseek-ocrolmocr-benchpass-rate75.7alphaxiv-leaderboard
mineru-2.5olmocr-benchpass-rate75.2alphaxiv-leaderboard
mistral-ocr-3olmocr-benchpass-rate78mistral-announcement
mistral-ocr-3internal-mistraloverall-accuracy94.9mistral-announcement
mistral-ocr-3ocr-cer-benchmarkcer3.7sparkco-benchmark
mistral-ocr-3ocr-wer-benchmarkwer7.1sparkco-benchmark
mistral-ocr-apiolmocr-benchpass-rate72alphaxiv-leaderboard
nanonets-ocr2-3bolmocr-benchpass-rate69.5alphaxiv-leaderboard
churro-3bchurro-dshandwritten-levenshtein70.1alphaxiv-leaderboard
churro-3bchurro-dsprinted-levenshtein82.3alphaxiv-leaderboard
gemini-25-prochurro-dshandwritten-levenshtein63.6alphaxiv-leaderboard
gemini-25-prochurro-dsprinted-levenshtein80.9alphaxiv-leaderboard
gemini-25-flashchurro-dshandwritten-levenshtein58.7alphaxiv-leaderboard
qwen25-vl-72bchurro-dshandwritten-levenshtein54.5alphaxiv-leaderboard
claude-sonnet-4churro-dshandwritten-levenshtein37.1alphaxiv-leaderboard
gpt-4ochurro-dshandwritten-levenshtein34.2alphaxiv-leaderboard
gemini-15-procc-ocrmulti-scene-f183.25alphaxiv-leaderboard
qwen2-vl-72bcc-ocrmulti-scene-f177.95alphaxiv-leaderboard
internvl2-76bcc-ocrmulti-scene-f176.92alphaxiv-leaderboard
gpt-4occ-ocrmulti-scene-f176.4alphaxiv-leaderboard
claude-35-sonnetcc-ocrmulti-scene-f172.87alphaxiv-leaderboard
qwen2-vl-72bcc-ocrkie-f171.76alphaxiv-leaderboard
gemini-15-procc-ocrkie-f167.28alphaxiv-leaderboard
claude-35-sonnetcc-ocrkie-f164.58alphaxiv-leaderboard
gpt-4occ-ocrkie-f163.45alphaxiv-leaderboard
gemini-15-procc-ocrmultilingual-f178.97alphaxiv-leaderboard
gpt-4occ-ocrmultilingual-f173.44alphaxiv-leaderboard
gemini-15-procc-ocrdocument-parsing62.37alphaxiv-leaderboard
gemini-25-promme-videoocrtotal-accuracy73.7alphaxiv-leaderboard
qwen25-vl-72bmme-videoocrtotal-accuracy69alphaxiv-leaderboard
internvl3-78bmme-videoocrtotal-accuracy67.2alphaxiv-leaderboard
gpt-4omme-videoocrtotal-accuracy66.4alphaxiv-leaderboard
gemini-15-promme-videoocrtotal-accuracy64.9alphaxiv-leaderboard
qwen25-vl-32bmme-videoocrtotal-accuracy61alphaxiv-leaderboard
gemini-20-flashkitab-benchcer0.13alphaxiv-leaderboard
ain-7bkitab-benchcer0.2alphaxiv-leaderboard
gpt-4okitab-benchcer0.31alphaxiv-leaderboard
gpt-4o-minikitab-benchcer0.43alphaxiv-leaderboard
azure-ocrkitab-benchcer0.52alphaxiv-leaderboard
tesseractkitab-benchcer0.54alphaxiv-leaderboard
easyocrkitab-benchcer0.58alphaxiv-leaderboard
paddleocrkitab-benchcer0.79alphaxiv-leaderboard
claude-sonnet-4thaiocrbenchted-score0.84alphaxiv-leaderboard
gemini-25-prothaiocrbenchted-score0.77alphaxiv-leaderboard
qwen25-vl-32bthaiocrbenchted-score0.765alphaxiv-leaderboard
internvl3-14bthaiocrbenchted-score0.76alphaxiv-leaderboard
qwen25-vl-72bthaiocrbenchted-score0.72alphaxiv-leaderboard
o1-previewgsm8kaccuracy97.8openai-blog
gpt-4ogsm8kaccuracy92openai-blog
claude-35-sonnetgsm8kaccuracy96.4anthropic-blog
gemini-15-progsm8kaccuracy91.7google-blog
llama-3-70bgsm8kaccuracy93meta-blog
o1-previewmathaccuracy94.8openai-blog
gpt-4omathaccuracy76.6openai-blog
claude-35-sonnetmathaccuracy71.1anthropic-blog
gemini-15-promathaccuracy67.7google-blog
deepseek-v3mathaccuracy90.2deepseek-blog
o1-previewaime-2024accuracy83.3openai-blog
gpt-4oaime-2024accuracy13.4openai-blog
claude-35-opusaime-2024accuracy16anthropic-blog
gpt-4ohellaswagaccuracy95.3openai-blog
claude-35-sonnethellaswagaccuracy89anthropic-blog
llama-3-70bhellaswagaccuracy88meta-blog
gemini-15-prohellaswagaccuracy92.5google-blog
gpt-4owinograndeaccuracy87.5openai-blog
claude-35-sonnetwinograndeaccuracy85.4anthropic-blog
llama-3-70bwinograndeaccuracy85.3meta-blog
gpt-4oarc-challengeaccuracy96.4openai-blog
claude-35-sonnetarc-challengeaccuracy96.7anthropic-blog
llama-3-70barc-challengeaccuracy93meta-blog
gemini-15-proarc-challengeaccuracy94.8google-blog
gpt-4ommluaccuracy88.7openai-blog
o1-previewmmluaccuracy92.3openai-blog
claude-35-sonnetmmluaccuracy88.7anthropic-blog
gemini-15-prommluaccuracy85.9google-blog
llama-3-70bmmluaccuracy82meta-blog
deepseek-v3mmluaccuracy88.5deepseek-blog
o1-previewgpqaaccuracy78openai-blog
gpt-4ogpqaaccuracy53.6openai-blog
claude-35-sonnetgpqaaccuracy59.4anthropic-blog
gemini-15-progpqaaccuracy46.2google-blog
gpt-4ocommonsenseqaaccuracy85.4openai-blog
claude-35-sonnetcommonsenseqaaccuracy83.2anthropic-blog
llama-3-70bcommonsenseqaaccuracy80.9meta-blog
gpt-4ohotpotqaf171.3arxiv-paper
claude-35-sonnethotpotqaf168.5arxiv-paper
gpt-4ostrategyqaaccuracy82.1arxiv-paper
claude-35-sonnetstrategyqaaccuracy79.8arxiv-paper
gpt-4ologiqaaccuracy56.3arxiv-paper
claude-35-sonnetlogiqaaccuracy53.8arxiv-paper
gpt-4orecloraccuracy72.4arxiv-paper
claude-35-sonnetrecloraccuracy68.9arxiv-paper
gpt-4osvampaccuracy93.7arxiv-paper
claude-35-sonnetsvampaccuracy91.2arxiv-paper
llama-3-70bsvampaccuracy89.5meta-blog
gpt-4omawpsaccuracy97.2arxiv-paper
claude-35-sonnetmawpsaccuracy95.8arxiv-paper
llama-3-70bmawpsaccuracy94.1meta-blog
plymouth-dl-modelabide-iaccuracy98research-paper
deepasdabide-iiauc93research-paper
mcbertabide-iaccuracy93.4research-paper
ae-fcnabide-iaccuracy85research-paper
braingтabide-iauc78.7research-paper
asd-swnetabide-iaccuracy76.52research-paper
asd-swnetabide-iauc81research-paper
al-negatabide-iaccuracy74.7research-paper
braingnnabide-iaccuracy73.3research-paper
gcnabide-iaccuracy72.2research-paper
gcnabide-iauc78research-paper
multi-task-transformerabide-iaccuracy72research-paper
svm-connectivityabide-iaccuracy70.1research-paper
svm-connectivityabide-iauc77research-paper
deep-learning-heinsfeldabide-iaccuracy70research-paper
mvs-gcnabide-iaccuracy69.38research-paper
mvs-gcnabide-iauc69.01research-paper
phgcl-ddgformerabide-iaccuracy70.9research-paper
random-forestabide-iaccuracy63research-paper
maacnnabide-iaccuracy75.12research-paper
maacnnabide-iiaccuracy72.88research-paper
multi-atlas-dnnabide-iaccuracy78.07research-paper
abraham-connectomesabide-iaccuracy67research-paper
o1-previewhumanevalpass@192.4openai-blog
claude-35-sonnethumanevalpass@192anthropic-blog
gpt-4ohumanevalpass@190.2openai-blog
deepseek-v3humanevalpass@182.6deepseek-blog
llama-3-70bhumanevalpass@181.7meta-blog
claude-35-sonnetswe-bench-verifiedresolve-rate49anthropic-blog
gpt-4oswe-bench-verifiedresolve-rate41.2swe-bench-leaderboard
deepseek-v25swe-bench-verifiedresolve-rate37deepseek-blog
gpt-4ombpppass@187.8openai-blog
claude-35-sonnetmbpppass@189.2anthropic-blog
internimage-hcocomAP65.4arxiv-paper
co-detr-swin-lcocomAP66arxiv-paper
dino-swin-lcocomAP63.3arxiv-paper
yolov10-xcocomAP57.4github-readme
efficientdet-d7-xcocomAP55.1google-research
internimage-hade20kmIoU62.9arxiv-paper
mask2former-swin-lade20kmIoU57.3arxiv-paper
agent57atari-2600human-normalized-score4731.3deepmind-research
go-exploreatari-2600human-normalized-score40000nature-paper
muzeroatari-2600human-normalized-score731nature-paper
dreamerv3atari-2600human-normalized-score840arxiv-paper
rainbow-dqnatari-2600human-normalized-score231aaai-paper
dqnatari-2600human-normalized-score79nature-paper
human-gameratari-2600human-normalized-score100baseline
bbos-1atari-2600human-normalized-score1100research
gdi-h3atari-2600human-normalized-score950research
chexpert-auc-maximizerchexpertauroc93stanford-leaderboard
chexzerochexpertauroc88.6research-paper
torchxrayvisionchexpertauroc87.4github-readme
densenet-121-cxrchexpertauroc86.5research-paper
gloriachexpertauroc88.2research-paper
medclipchexpertauroc87.8research-paper
biovilchexpertauroc89.1microsoft-research
chexnetnih-chestxray14auroc84.1research-paper
torchxrayvisionnih-chestxray14auroc85.8github-readme
densenet-121-cxrnih-chestxray14auroc82.6research-paper
resnet-50-cxrnih-chestxray14auroc80.4research-paper
chexzeromimic-cxrauroc89.2research-paper
torchxrayvisionmimic-cxrauroc86.3github-readme
convirtmimic-cxrauroc85.7research-paper
rad-dinovindr-cxrauroc91.2microsoft-research
torchxrayvisionvindr-cxrauroc87.9research-paper
densenet-121-cxrrsna-pneumoniaauroc88.5kaggle-competition
chexnetrsna-pneumoniaauroc87.2research-paper
torchxrayvisionpadchestauroc84.6github-readme
densenet-121-cxrcovid-chestxrayauroc94.7research-paper
torchxrayvisioncovid-chestxrayauroc93.2github-readme
patchcoremvtec-adauroc99.1research-paper
efficientadmvtec-adauroc99.1research-paper
simplenetmvtec-adauroc99.6research-paper
padimmvtec-adauroc97.9research-paper
fastflowmvtec-adauroc99.4research-paper
draemmvtec-adauroc98research-paper
cflow-admvtec-adauroc98.3research-paper
reverse-distillationmvtec-adauroc98.5research-paper
patchcorevisaauroc92.1research-paper
simplenetvisaauroc95.5research-paper
efficientadvisaauroc94.8research-paper
yolov8-weldweld-defect-xraymap87.3research
defectdet-resnetneu-detmap78.4research
yolov8-weldseverstal-steeldice91.2kaggle

Pending Verification

These results are claimed in papers but need manual verification from the source PDF.

ModelDatasetClaimed ValueStatus
trocr-largesroie96.58needs-pdf-verification
trocr-largeiam2.89needs-pdf-verification
paddleocr-v4icdar-2015Unknownneeds-documentation-verification
polish-roberta-ocrpoleval-2021-ocrUnknown
polish-t5-ocrpoleval-2021-ocrUnknown
herbertpoleval-2021-ocrUnknown
abbyy-finereaderimpact-psncUnknown
tesseract-polishimpact-psncUnknown
abbyy-finereaderimpact-psncUnknown
tesseract-polishimpact-psncUnknown
tesseract-polishcodesota-polishUnknown
tesseract-polishcodesota-polishUnknown
tesseract-polishcodesota-polishUnknown
tesseract-polishcodesota-polish-wikipediaUnknown
tesseract-polishcodesota-polish-realUnknown
tesseract-polishcodesota-polish-synth-randomUnknown
tesseract-polishcodesota-polish-synth-wordsUnknown
claude-sonnet-4swe-bench-verifiedUnknown
claude-sonnet-4-high-computeswe-bench-verifiedUnknown
claude-opus-4.5swe-bench-verifiedUnknown
o3swe-bench-verifiedUnknown
claude-3.7-sonnetswe-bench-verifiedUnknown
claude-3.5-sonnetswe-bench-verifiedUnknown
o1swe-bench-verifiedUnknown
gpt-4oswe-bench-verifiedUnknown
o3aime-2024Unknown
o1aime-2024Unknown
deepseek-r1aime-2024Unknown
o1aime-2024Unknown
gpt-4oaime-2024Unknown
o3gpqa-diamondUnknown
gemini-2.5-progpqa-diamondUnknown
o1gpqa-diamondUnknown
o3-minigpqa-diamondUnknown
claude-3.5-sonnetgpqa-diamondUnknown
gpt-4ogpqa-diamondUnknown

Data Quality

All benchmark results are sourced from AlphaXiv benchmark leaderboards. Each data point includes the source URL for verification.

Results marked as "pending verification" are claimed in papers but have not been independently confirmed. We do not include estimated or interpolated values.