| 01 | DTrOCR 105M From paper: DTrOCR: Decoder-only Transformer for Optical Character Recognition | verified | 99.4 | 2023 | Paper ↗Code ↗ | Looks wrong? |
| 02 | CLIP4STR-L (DataComp-1B) From paper: CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | verified | 99 | 2023 | Paper ↗Code ↗ | Looks wrong? |
| 03 | CLIP4STR-L From paper: An Empirical Study of Scaling Law for OCR | verified | 98.5 | 2023 | Paper ↗Code ↗Source ↗ | Looks wrong? |
| 04 | MGP-STR From paper: Multi-Granularity Prediction for Scene Text Recognition | verified | 98.5 | 2022 | Paper ↗Code ↗ | Looks wrong? |
| 05 | CLIP4STR-B From paper: CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | verified | 98.3 | 2023 | Paper ↗Code ↗ | Looks wrong? |
| 06 | CCD-ViT-Small(ARD_2.8M) From paper: Self-supervised Character-to-Character Distillation for Text Recognition | verified | 98.3 | 2022 | Paper ↗Code ↗ | Looks wrong? |
| 07 | CCD-ViT-Base(ARD_2.8M) From paper: Self-supervised Character-to-Character Distillation for Text Recognition | verified | 98.3 | 2022 | Paper ↗Code ↗ | Looks wrong? |
| 08 | MATRN From paper: Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features | verified | 97.9 | 2021 | Paper ↗Code ↗ | Looks wrong? |
| 09 | SIGA_T From paper: Self-supervised Implicit Glyph Attention for Text Recognition | verified | 97.8 | 2022 | Paper ↗Code ↗ | Looks wrong? |
| 10 | S-GTR From paper: Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition | verified | 97.8 | 2021 | Paper ↗Code ↗ | Looks wrong? |
| 11 | DPAN From paper: Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition | verified | 97.7 | 2021 | Paper ↗Code ↗ | Looks wrong? |
| 12 | CDistNet (Ours) From paper: CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition | verified | 97.67 | 2021 | Paper ↗Code ↗ | Looks wrong? |
| 13 | CCD-ViT-Tiny(ARD_2.8M) From paper: Self-supervised Character-to-Character Distillation for Text Recognition | verified | 97.5 | 2022 | Paper ↗Code ↗ | Looks wrong? |
| 14 | SVTR-L (Large) From paper: SVTR: Scene Text Recognition with a Single Visual Model | verified | 97.2 | 2022 | Paper ↗Code ↗ | Looks wrong? |
| 15 | SVTR-B (Base) From paper: SVTR: Scene Text Recognition with a Single Visual Model | verified | 97.1 | 2022 | Paper ↗Code ↗ | Looks wrong? |
| 16 | DiffusionSTR From paper: DiffusionSTR: Diffusion Model for Scene Text Recognition | verified | 97.1 | 2023 | Paper ↗ | Looks wrong? |
| 17 | Yet Another Text Recognizer From paper: Why You Should Try the Real Data for the Scene Text Recognition | verified | 96.8 | 2021 | Paper ↗Code ↗ | Looks wrong? |
| 18 | SVTR-T (Tiny) From paper: SVTR: Scene Text Recognition with a Single Visual Model | verified | 96.3 | 2022 | Paper ↗Code ↗ | Looks wrong? |
| 19 | SVTR-S (Small) From paper: SVTR: Scene Text Recognition with a Single Visual Model | verified | 95.7 | 2022 | Paper ↗Code ↗ | Looks wrong? |
| 20 | SRN From paper: Towards Accurate Scene Text Recognition with Semantic Reasoning Networks | verified | 95.5 | 2020 | Paper ↗Code ↗ | Looks wrong? |
| 21 | RCEED From paper: Representation and Correlation Enhanced Encoder-Decoder Framework for Scene Text Recognition | verified | 94.7 | 2021 | Paper ↗Code ↗ | Looks wrong? |
| 22 | SATRN From paper: On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention | verified | 94.1 | 2019 | Paper ↗Code ↗ | Looks wrong? |
| 23 | DAN From paper: Decoupled Attention Network for Text Recognition | verified | 93.9 | 2019 | Paper ↗Code ↗ | Looks wrong? |
| 24 | CSTR From paper: Revisiting Classification Perspective on Scene Text Recognition | verified | 93.2 | 2021 | Paper ↗Code ↗ | Looks wrong? |
| 25 | TextScanner From paper: TextScanner: Reading Characters in Order for Robust Scene Text Recognition | verified | 92.9 | 2019 | Paper ↗ | Looks wrong? |
| 26 | SAFL From paper: SAFL: A Self-Attention Scene Text Recognizer with Focal Loss | verified | 92.8 | 2022 | Paper ↗Code ↗ | Looks wrong? |
| 27 | SEED From paper: SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition | verified | 92.8 | 2020 | Paper ↗Code ↗ | Looks wrong? |
| 28 | ViTSTR From paper: Vision Transformer for Fast and Efficient Scene Text Recognition | verified | 92.4 | 2021 | Paper ↗Code ↗ | Looks wrong? |
| 29 | Baek et al. From paper: What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis | verified | 92.3 | 2019 | Paper ↗Code ↗ | Looks wrong? |
| 30 | ASTER From paper: ASTER: An Attentional Scene Text Recognizer with Flexible Rectification | verified | 91.8 | 2018 | Paper ↗Code ↗ | Looks wrong? |
| 31 | CA-FCN From paper: Scene Text Recognition from Two-Dimensional Perspective | verified | 91.5 | 2018 | Paper ↗ | Looks wrong? |
| 32 | SAR From paper: Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition | verified | 91 | 2018 | Paper ↗Code ↗ | Looks wrong? |
| 33 | STAR-Net From paper: Star-net: A spatial attention residue network for scene text recognition. | verified | 89.1 | 2016 | Paper ↗Code ↗ | Looks wrong? |
| 34 | RARE From paper: Robust Scene Text Recognition with Automatic Rectification | verified | 88.6 | 2016 | Paper ↗Code ↗ | Looks wrong? |
| 35 | CRNN From paper: An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition | verified | 86.7 | 2015 | Paper ↗Code ↗ | Looks wrong? |
| 36 | CHAR From paper: Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition | verified | 79.5 | 2014 | Paper ↗Code ↗ | Looks wrong? |