Computer Vision

Optical Character Recognition

Extracting text from document images

110 datasets680 results

Optical Character Recognition is a key task in computer vision. Below you will find the standard benchmarks used to evaluate models, along with current state-of-the-art results.

Benchmarks & SOTA

cnn-/-daily-mail

202080 results

Dataset from Papers With Code

State of the Art

Scrambled code + broken (alter)

48.18

rouge-1

scut-ctw1500

202073 results

Dataset from Papers With Code

State of the Art

FAST-T-512

129.1

fps

icdar2013

202039 results

Dataset from Papers With Code

State of the Art

DTrOCR 105M

99.4

accuracy

dart

202032 results

Dataset from Papers With Code

State of the Art

FactT5B

97.6

factspotter

icdar2015

202026 results

Dataset from Papers With Code

State of the Art

DTrOCR 105M

93.5

accuracy

tabfact

202023 results

Dataset from Papers With Code

State of the Art

ARTEMIS-DA

93.1

test

sun-rgb-d

202019 results

Dataset from Papers With Code

State of the Art

IM3D

64.4

iou

inverse-text

202018 results

Dataset from Papers With Code

State of the Art

DeepSolo (ViTAEv2-S, TextOCR)

75.8

f-measure-full-lexicon

pendigits

202015 results

Dataset from Papers With Code

State of the Art

DnC-SC

82.86

nmi

videodb's-ocr-benchmark-public-collection

202015 results

Dataset from Papers With Code

State of the Art

GPT-4o

OpenAI

76.22

accuracy

lam(line-level)

202012 results

Dataset from Papers With Code

State of the Art

GFCN

18.5

test-wer

howsumm-step

202011 results

Dataset from Papers With Code

State of the Art

LexRank (query: step title)

39.6

rouge-1

e2e

202010 results

Dataset from Papers With Code

State of the Art

HTLM (fine-tuning)

70.8

rouge-l

urdudoc

20209 results

Dataset from Papers With Code

State of the Art

ContourNet [69]

88.68

recall

howsumm-method

20209 results

Dataset from Papers With Code

State of the Art

LexRank (query: method + article + steps titles)

53.5

rouge-1

iam(line-level)

20209 results

Dataset from Papers With Code

State of the Art

GFCN

28.6

test-wer

read2016(line-level)

20209 results

Dataset from Papers With Code

State of the Art

Span

21.1

test-wer

KITAB-Bench

KITAB Arabic OCR Benchmark

20248 results

8,809 Arabic text samples across 9 domains. Tests Arabic script recognition.

State of the Art

PaddleOCR

Baidu

0.790

cer

belfort

20208 results

Dataset from Papers With Code

State of the Art

PyLaia (human transcriptions + random split)

28.11

wer

wikibio

20208 results

Dataset from Papers With Code

State of the Art

MBD

56.16

parent

codesearchnet---java

20208 results

Dataset from Papers With Code

State of the Art

CodeTrans-MT-Large

21.87

smoothed-bleu-4

codesearchnet---javascript

20208 results

Dataset from Papers With Code

State of the Art

Transformer

25.61

smoothed-bleu-4

codesearchnet---php

20208 results

Dataset from Papers With Code

State of the Art

CodeTrans-MT-Base

26.23

smoothed-bleu-4

reuters-21578

20208 results

Dataset from Papers With Code

State of the Art

ApproxRepSet

97.17

accuracy

codesearchnet---ruby

20207 results

Dataset from Papers With Code

State of the Art

CodeTrans-MT-Base

15.26

smoothed-bleu-4

codesearchnet---go

20207 results

Dataset from Papers With Code

State of the Art

CodeBERT (MLM)

26.79

smoothed-bleu-4

codesearchnet

20207 results

Dataset from Papers With Code

State of the Art

CodeBERT (MLM+RTD)

15.99

smoothed-bleu-4

benchmarking-chinese-text-recognition:-datasets,-b

20207 results

Dataset from Papers With Code

State of the Art

DTrOCR

89.6

accuracy

codesearchnet---python

20207 results

Dataset from Papers With Code

State of the Art

CodeTrans-MT-Base

20.39

smoothed-bleu-4

mldoc-zero-shot-english-to-french

20206 results

Dataset from Papers With Code

State of the Art

XLMft UDA

96.05

accuracy

webnlg-(unseen)

20206 results

Dataset from Papers With Code

State of the Art

HTLM (fine-tuning)

48.4

bleu

hoc

20206 results

Dataset from Papers With Code

State of the Art

BioLinkBERT (large)

88.1

f1

webnlg-(seen)

20206 results

Dataset from Papers With Code

State of the Art

HTLM (fine-tuning)

65.4

bleu

webnlg-(all)

20206 results

Dataset from Papers With Code

State of the Art

HTLM (fine-tuning)

55.6

bleu

mldoc-zero-shot-english-to-spanish

20206 results

Dataset from Papers With Code

State of the Art

XLMft UDA

96.8

accuracy

tobacco-small-3482

20206 results

Dataset from Papers With Code

State of the Art

Optimized Text CNN

84

accuracy

mldoc-zero-shot-english-to-russian

20205 results

Dataset from Papers With Code

State of the Art

XLMft UDA

89.7

accuracy

wikipedia-person-and-animal-dataset

20205 results

Dataset from Papers With Code

State of the Art

VTM

45.36

rouge

mldoc-zero-shot-english-to-german

20205 results

Dataset from Papers With Code

State of the Art

XLMft UDA

96.95

accuracy

ThaiOCRBench

Thai OCR Benchmark

20245 results

2,808 Thai text samples across 13 tasks. Tests Thai script structural understanding.

State of the Art

Claude Sonnet 4

Anthropic

0.840

ted-score

mldoc-zero-shot-english-to-chinese

20205 results

Dataset from Papers With Code

State of the Art

XLMft UDA

93.32

accuracy

stdw

20204 results

Dataset from Papers With Code

State of the Art

RetinaNet

0.780

ap

mldoc-zero-shot-english-to-italian

20204 results

Dataset from Papers With Code

State of the Art

MultiFiT, pseudo

76.02

accuracy

bbcsport

20204 results

Dataset from Papers With Code

State of the Art

MPAD-path

99.59

accuracy

read-2016

20204 results

Dataset from Papers With Code

State of the Art

HTR-VT(line-level)

16.5

wer

sut

20203 results

Dataset from Papers With Code

State of the Art

CNN

86

accuracy

twitter

20203 results

Dataset from Papers With Code

State of the Art

ApproxRepSet

72.6

accuracy

cub-200-2011

20203 results

Dataset from Papers With Code

State of the Art

Q-SENN

85.9

top-1-accuracy

amazon

20203 results

Dataset from Papers With Code

State of the Art

ApproxRepSet

94.31

accuracy

rotowire

20203 results

Dataset from Papers With Code

State of the Art

HierarchicalEncoder + NR + IR

55.88

content-selection-f1

reuters-rcv1/rcv2-german-to-english

20203 results

Dataset from Papers With Code

State of the Art

Biinclusion (Euro500kReuters)

84.4

accuracy

reuters-rcv1/rcv2-english-to-german

20203 results

Dataset from Papers With Code

State of the Art

Biinclusion (Euro500kReuters)

92.7

accuracy

fsns---test

20203 results

Dataset from Papers With Code

State of the Art

STREET

27.54

sequence-error

mldoc-zero-shot-english-to-japanese

20203 results

Dataset from Papers With Code

State of the Art

MultiFiT, pseudo

69.57

accuracy

dareczech

20203 results

Dataset from Papers With Code

State of the Art

Query-doc RobeCzech (Roberta-base)

46.73

p-10

bbc-xsum

20203 results

Dataset from Papers With Code

State of the Art

BigBird-Pegasus

47.12

rouge-1

scidocs-(mesh)

20202 results

Dataset from Papers With Code

State of the Art

SciNCL

88.7

f1-micro

cedar-signature

20202 results

Dataset from Papers With Code

State of the Art

Siamese_MultiHeadCrossAttention_SoftAttention (Siamese_MHCA_SA)

5.7

far

classic

20202 results

Dataset from Papers With Code

State of the Art

REL-RWMD k-NN

96.85

accuracy

clueweb09-b

20202 results

Dataset from Papers With Code

State of the Art

XLNet

31.1

ndcg-20

dise-2021-dataset

20202 results

Dataset from Papers With Code

State of the Art

JDeskew

0.860

percentage-correct

i2l-140k

20202 results

Dataset from Papers With Code

State of the Art

I2L-NOPOOL

89.09

bleu

icdar-2019

20202 results

Dataset from Papers With Code

State of the Art

DiT-L (Cascade)

96.55

weighted-average-f1-score

imdb-m

20202 results

Dataset from Papers With Code

State of the Art

Document Classification Using Importance of Sentences

54.8

accuracy

recipe

20202 results

Dataset from Papers With Code

State of the Art

ApproxRepSet

59.06

accuracy

scidocs-(mag)

20202 results

Dataset from Papers With Code

State of the Art

SPECTER

82

f1-micro

aapd

20202 results

Dataset from Papers With Code

State of the Art

KD-LSTMreg

72.9

f1

simara

20202 results

Dataset from Papers With Code

State of the Art

DAN

14.79

wer

textzoom

20202 results

Dataset from Papers With Code

State of the Art

CCD-ViT-Small

21.84

average-psnr-db

wos-5736

20202 results

Dataset from Papers With Code

State of the Art

ConvTextTM

91.28

accuracy

re-docred

20201 results

Dataset from Papers With Code

State of the Art

VaeDiff-DocRE

0.790

f1

iris

20201 results

Dataset from Papers With Code

State of the Art

ELSC

97.7

accuracy

mldoc-zero-shot-german-to-french

20201 results

Dataset from Papers With Code

State of the Art

BiLSTM (Europarl)

75.45

accuracy

mpqa

20201 results

Dataset from Papers With Code

State of the Art

MPAD-path

89.81

accuracy

jaffe

20201 results

Dataset from Papers With Code

State of the Art

ELSC

98.6

accuracy

pixraw10p

20201 results

Dataset from Papers With Code

State of the Art

ELSC

96

accuracy

and-dataset

20201 results

Dataset from Papers With Code

State of the Art

Siamese_MHCA_SA

0.810

average-f1

im2latex-100k

20201 results

Dataset from Papers With Code

State of the Art

I2L-STRIPS

88.86

bleu

reuters-de-en

20201 results

Dataset from Papers With Code

State of the Art

BilBOWA

75

accuracy

reuters-en-de

20201 results

Dataset from Papers With Code

State of the Art

BilBOWA

86.5

accuracy

iam-d

20201 results

Dataset from Papers With Code

State of the Art

StackMix+Blots

3.01

cer

iam-b

20201 results

Dataset from Papers With Code

State of the Art

StackMix+Blots

3.77

cer

hyperpartisan-news-detection

20201 results

Dataset from Papers With Code

State of the Art

ChuLo

95.38

accuracy

saint-gall

20201 results

Dataset from Papers With Code

State of the Art

StackMix+Blots

3.65

cer

scene-text-recognition-benchmarks

20201 results

Dataset from Papers With Code

State of the Art

CCD-ViT-Small

84.9

accuracy

wine

20201 results

Dataset from Papers With Code

State of the Art

ELSC

75.8

accuracy

wos-11967

20201 results

Dataset from Papers With Code

State of the Art

HDLTex

86.07

accuracy

hkr

20201 results

Dataset from Papers With Code

State of the Art

StackMix+Blots

3.49

cer

wos-46985

20201 results

Dataset from Papers With Code

State of the Art

HDLTex

76.58

accuracy

food-101

20201 results

Dataset from Papers With Code

State of the Art

Bert

84.41

accuracy

ephoie

20201 results

Dataset from Papers With Code

State of the Art

LayoutLMv3

99.21

average-f1

dwie

20201 results

Dataset from Papers With Code

State of the Art

VaeDiff-DocRE

0.731

f1

docred-ie

20201 results

Dataset from Papers With Code

State of the Art

REXEL

60.1

relation-f1

textseg

20201 results

Dataset from Papers With Code

State of the Art

CCD-ViT-Small

84.8

iou

yelp-14

20201 results

Dataset from Papers With Code

State of the Art

KD-LSTMreg

69.4

accuracy

digital-peter

20201 results

Dataset from Papers With Code

State of the Art

StackMix+Blots

2.5

cer

cl-scisumm

20201 results

Dataset from Papers With Code

State of the Art

GCN Hybrid

33.88

rouge-2

bentham

20201 results

Dataset from Papers With Code

State of the Art

StackMix+Blots

1.73

cer

bc8

20201 results

Dataset from Papers With Code

State of the Art

BioRex+Directionality

56.06

evaluation-macro-f1

warppie10p

20201 results

Dataset from Papers With Code

State of the Art

ELSC

53.4

accuracy

ba

20201 results

Dataset from Papers With Code

State of the Art

ELSC

51.8

accuracy

australian

20201 results

Dataset from Papers With Code

State of the Art

ELSC

70.9

accuracy

arxiv-summarization-dataset

20201 results

Dataset from Papers With Code

State of the Art

DeepPyramidion

19.99

rouge-2

arxiv-hep-th-citation-graph

20201 results

Dataset from Papers With Code

State of the Art

DeepPyramidion

47.15

rouge-1

wikilingua-(tr->en)

20201 results

Dataset from Papers With Code

State of the Art

DOCmT5

31.37

rouge-l

lun

20201 results

Dataset from Papers With Code

State of the Art

ChuLo

64.4

accuracy

IMPACT-PSNC

IMPACT Polish Digital Libraries Ground Truth

20120 results

478 pages of ground truth from four Polish digital libraries at 99.95% accuracy. Includes annotations at region, line, word, and glyph levels. Gothic and antiqua fonts.

No results tracked yet

CodeSOTA Polish

CodeSOTA Polish OCR Benchmark

20250 results

1,000 synthetic and real Polish text images with 5 degradation levels (clean to severe). Tests character-level OCR on diacritics with contamination-resistant synthetic categories. Categories: synth_random (pure character recognition), synth_words (Markov-generated words), real_corpus (Pan Tadeusz, official documents), wikipedia (potential contamination baseline).

No results tracked yet

SROIE

Scanned Receipts OCR and Information Extraction

20190 results

626 receipt images. Key task: extract company, date, address, total from receipts.

No results tracked yet

PolEval 2021 OCR

PolEval 2021 OCR Post-Correction Task

20210 results

979 Polish books (69,000 pages) from 1791-1998. Focus on OCR post-correction using NLP methods. Major benchmark for Polish historical document processing.

No results tracked yet

Related Tasks

Optical Character Recognition Benchmarks - Computer Vision - CodeSOTA | CodeSOTA