Scene Text Recognition2020en

svtp

Dataset from Papers With Code

Metrics:accuracy, cer, wer, f1
Current State of the Art

DTrOCR 105M

Unknown

98.6

accuracy

accuracy Progress Over Time

Showing 5 breakthroughs from Mar 2021 to Aug 2023

88.691.394.096.899.5Mar 2021Oct 2021May 2022Dec 2022Aug 2023accuracyDate

Key Milestones

Mar 2021
ABINet-LV

ABINet Language-Vision variant. CVPR 2021.

89.5
Nov 2021
MATRN

From paper: Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features

90.6
+1.2%
Jul 2022
PARSeq

Lowercase alphanum eval. ECCV 2022.

96.9
+7.0%
Sep 2022
MGP-STR

From paper: Multi-Granularity Prediction for Scene Text Recognition

98.3
+1.4%
Aug 2023
DTrOCR 105MCurrent SOTA

From paper: DTrOCR: Decoder-only Transformer for Optical Character Recognition

98.6
+0.3%
Total Improvement
10.2%
Time Span
2y 5m
Breakthroughs
5
Current SOTA
98.6

Top Models Performance Comparison

Top 10 models ranked by accuracy

accuracy1DTrOCR 105M98.6100.0%2MGP-STR98.399.7%3CLIP4STR-L (DataComp-1B)98.199.5%4CLIP4STR-L97.498.8%5CLIP4STR-B97.298.6%6PARSeq96.998.3%7CPPD96.798.1%8CCD-ViT-Base96.197.5%9CCD-ViT-Small92.794.0%10CCD-ViT-Tiny91.692.9%0%25%50%75%100%% of best
Best Score
98.6
Top Model
DTrOCR 105M
Models Compared
10
Score Range
7.0

accuracyPrimary

#ModelScorePaper / CodeDate
1
DTrOCR 105M
98.6Aug 2023
2
MGP-STR
98.3Sep 2022
3
CLIP4STR-L (DataComp-1B)
98.1May 2023
4
CLIP4STR-L
97.4May 2023
5
CLIP4STR-B
Research
97.2May 2023
6
PARSeqOpen Source
Research
96.9Jul 2022
7
CPPD
96.7Jul 2023
8
CCD-ViT-Base
96.1Nov 2022
9
CCD-ViT-Small
92.7Nov 2022
10
CCD-ViT-Tiny
91.6Nov 2022
11
S-GTR
90.6Dec 2021
12
MATRN
Research
90.6Nov 2021
13
SIGA_T
90.5Mar 2022
14
CDistNet (Ours)
89.77Nov 2021
15
ABINet-LVOpen Source
Fang et al.
89.5Mar 2021
16
DiffusionSTR
89.2Jun 2023
17
DPAN
89
Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text RecognitionCode
Aug 2021
18
TrOCR-large 558M
88.1Sep 2021
19
TrOCR-base 334M
86.9Sep 2021

Related Papers13

Other Scene Text Recognition Datasets