svt
Dataset from Papers With Code
CLIP4STR-H (DFN-5B)
Unknown
99.1
accuracy
accuracy Progress Over Time
Showing 13 breakthroughs from Jun 2014 to May 2023
Key Milestones
From paper: Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition
From paper: An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition
From paper: Star-net: A spatial attention residue network for scene text recognition.
From paper: ASTER: An Attentional Scene Text Recognizer with Flexible Rectification
From paper: On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention
From paper: Towards Accurate Scene Text Recognition with Semantic Reasoning Networks
From paper: Representation and Correlation Enhanced Encoder-Decoder Framework for Scene Text Recognition
From paper: Why You Should Try the Real Data for the Scene Text Recognition
From paper: Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
From paper: Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition
From paper: CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
Top Models Performance Comparison
Top 10 models ranked by accuracy
accuracyPrimary
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | CLIP4STR-H (DFN-5B) | 99.1 | May 2023 | |
| 2 | DTrOCR 105M | 98.9 | Aug 2023 | |
| 3 | MGP-STR | 98.6 | Sep 2022 | |
| 4 | CLIP4STR-L (DataComp-1B) | 98.6 | May 2023 | |
| 5 | CPPD | 98.5 | Jul 2023 | |
| 6 | CLIP4STR-L | 98.5 | May 2023 | |
| 7 | CLIP4STR-B* | 98.3 | May 2023 | |
| 8 | CCD-ViT-Base(ARD_2.8M) | 97.8 | Nov 2022 | |
| 9 | CCD-ViT-Small(ARD_2.8M) | 96.4 | Nov 2022 | |
| 10 | CCD-ViT-Tiny(ARD_2.8M) | 96 | Nov 2022 | |
| 11 | S-GTR | 95.8 | Dec 2021 | |
| 12 | SIGA_T | 95.1 | Mar 2022 | |
| 13 | MATRN | 95 | Nov 2021 | |
| 14 | Yet Another Text Recognizer | 94.7 | Jul 2021 | |
| 15 | NRTR+TPS++ | 94.6 | May 2023 | |
| 16 | DPAN | 93.9 | Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text RecognitionCode | Aug 2021 |
| 17 | CDistNet (Ours) | 93.82 | Nov 2021 | |
| 18 | DiffusionSTR | 93.6 | Jun 2023 | |
| 19 | RCEED | 91.8 | Jun 2021 | |
| 20 | SRN | 91.5 | Mar 2020 | |
| 21 | SATRN | 91.3 | Oct 2019 | |
| 22 | CSTR | 90.6 | Feb 2021 | |
| 23 | TextScanner | 90.1 | Dec 2019 | |
| 24 | SEED | 89.6 | May 2020 | |
| 25 | ASTER | 89.5 | ASTER: An Attentional Scene Text Recognizer with Flexible RectificationCode | Jun 2018 |
| 26 | DAN | 89.2 | Dec 2019 | |
| 27 | SAFL | 88.6 | Jan 2022 | |
| 28 | ViTSTR | 87.7 | May 2021 | |
| 29 | Baek et al. | 87.5 | Apr 2019 | |
| 30 | CA-FCN | 86.4 | Sep 2018 | |
| 31 | SAR | 84.5 | Nov 2018 | |
| 32 | STAR-Net | 83.6 | Star-net: A spatial attention residue network for scene text recognition.Code | Sep 2016 |
| 33 | RARE | 81.9 | Mar 2016 | |
| 34 | CRNN | 80.8 | Jul 2015 | |
| 35 | CHAR | 68 | Jun 2014 |