Scene Text Recognition2020en

cute80

Dataset from Papers With Code

Metrics:accuracy, cer, wer, f1
Current State of the Art

CPPD

Unknown

99.7

accuracy

accuracy Progress Over Time

Showing 7 breakthroughs from Mar 2021 to May 2023

88.291.394.597.6100.8Mar 2021Aug 2021Jan 2022Jun 2022Nov 2022May 2023accuracyDate

Key Milestones

Mar 2021
ABINet-LV

ABINet Language-Vision variant. CVPR 2021.

89.2
Aug 2021
DPAN

From paper: Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition

91.9
+3.0%
Nov 2021
MATRN

From paper: Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features

93.5
+1.7%
Dec 2021
S-GTR

From paper: Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition

94.7
+1.3%
Jul 2022
PARSeq

Lowercase alphanum eval. ECCV 2022.

98.6
+4.1%
Sep 2022
MGP-STR

From paper: Multi-Granularity Prediction for Scene Text Recognition

99.3
+0.7%
May 2023
CLIP4STR-L (DataComp-1B)Current SOTA

From paper: CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model

99.7
+0.4%
Total Improvement
11.8%
Time Span
2y 2m
Breakthroughs
7
Current SOTA
99.7

Top Models Performance Comparison

Top 10 models ranked by accuracy

accuracy1CPPD99.7100.0%2CLIP4STR-L (DataComp-1B)99.7100.0%3MGP-STR99.399.6%4CLIP4STR-B99.399.6%5DTrOCR 105M99.199.4%6CLIP4STR-L99.099.3%7PARSeq98.698.9%8CCD-ViT-Small(ARD_2.8M)98.398.6%9CCD-ViT-Base(ARD_2.8M)98.398.6%10CCD-ViT-Tiny(ARD_2.8M)95.896.1%0%25%50%75%100%% of best
Best Score
99.7
Top Model
CPPD
Models Compared
10
Score Range
3.9

accuracyPrimary

#ModelScorePaper / CodeDate
1
CPPD
99.7Jul 2023
2
CLIP4STR-L (DataComp-1B)
99.7May 2023
3
MGP-STR
99.31Sep 2022
4
CLIP4STR-B
Research
99.3May 2023
5
DTrOCR 105M
99.1Aug 2023
6
CLIP4STR-L
99May 2023
7
PARSeqOpen Source
Research
98.61Jul 2022
8
CCD-ViT-Small(ARD_2.8M)
98.3Nov 2022
9
CCD-ViT-Base(ARD_2.8M)
98.3Nov 2022
10
CCD-ViT-Tiny(ARD_2.8M)
95.8Nov 2022
11
S-GTR
94.7Dec 2021
12
MATRN
Research
93.5Nov 2021
13
SIGA_T
93.1Mar 2022
14
DiffusionSTR
92.5Jun 2023
15
NRTR+TPS++
92.4May 2023
16
DPAN
91.9
Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text RecognitionCode
Aug 2021
17
CDistNet (Ours)
89.58Nov 2021
18
ABINet-LVOpen Source
Fang et al.
89.2Mar 2021
19
TrOCR-large 558M
84.1Sep 2021
20
TrOCR-base 334M
81.2Sep 2021

Related Papers14

Self-supervised Character-to-Character Distillation for Text Recognition
Nov 2022Models: CCD-ViT-Small(ARD_2.8M), CCD-ViT-Base(ARD_2.8M), CCD-ViT-Tiny(ARD_2.8M)

Other Scene Text Recognition Datasets