Codesota · Computer Vision · Scene Text Detection · icdar-2013Tasks/Computer Vision/Scene Text Detection
Scene Text Detection · benchmark dataset · 2020 · EN

icdar-2013.

Dataset from Papers With Code

Legacy benchmark· last significant update Jan 2019

Legacy benchmark from 2013. For current OCR evaluation, use OCRBench, ICDAR 2019/2021, or DocVQA.

Submit a result
§ 01 · Leaderboard

Best published scores.

59 results indexed across 5 metrics. Shaded row marks current SOTA; ties broken by submission date.


Primary
accuracy · higher is better
All metrics
accuracy, f-measure, h-mean, precision, recall
accuracy· primary
15 rows
#ModelOrgSubmittedPaper / codeaccuracy
01JSTROSSFujitakeApr 2024JSTR: Judgment Improves Scene Text Recognition99.20
02CLIP4STR-L (RBU 6.5M)OSSZhao et al.May 2023CLIP4STR: A Simple Baseline for Scene Text Recognition w…99
03CLIP4STR-H (DFN-5B)OSSZhao et al.May 2023CLIP4STR: A Simple Baseline for Scene Text Recognition w…98.90
04DTrOCRAug 2023DTrOCR: Decoder-only Transformer for Optical Character R…98.80
05SVTRv2-BOSSDu et al.Nov 2024SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text R…98.70
06LISTEROSSCheng et al.Aug 2023LISTER: Neighbor Decoding for Length-Insensitive Scene T…98.60
07SVTRv2-SOSSDu et al.Nov 2024SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text R…98.50
08TrOCR-large 558MSep 2021TrOCR: Transformer-based Optical Character Recognition w…98.40
09TrOCR-base 334MSep 2021TrOCR: Transformer-based Optical Character Recognition w…98.40
10CPPDJul 2023Context Perception Parallel Decoder for Scene Text Recog…98.20
11MAERecOSSJiang et al.Jul 2023Revisiting Scene Text Recognition: A Data Perspective98.20
12PARSeqOSSResearchJul 2022Scene Text Recognition with Permuted Autoregressive Sequ…98.13
13SVTRv2-TOSSDu et al.Nov 2024SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text R…98
14ABINet-LVOSSFang et al.Mar 2021Read Like Humans: Autonomous, Bidirectional and Iterativ…97
15CRNNJul 2015An End-to-End Trainable Neural Network for Image-based S…86.70
f-measure
15 rows
#ModelOrgSubmittedPaper / codef-measure
01TextFuseNet (ResNeXt-101)May 2020papers-with-code · code94.61
02SPCNETNov 2018Scene Text Detection with Supervised Pyramid Context Net… · code92.10
03Mask TextSpotterJul 2018Mask TextSpotter: An End-to-End Trainable Neural Network… · code91.70
04WordSup (VGG16-synth-icdar)Aug 2017WordSup: Exploiting Word Annotations for Character based…90.34
05STN-OCRJul 2017STN-OCR: A single Neural Network for Text Detection and … · code90.30
06PixelLink+VGG16 2s MSJan 2018PixelLink: Detecting Scene Text via Instance Segmentatio… · code88.10
07TextBoxes++_MSJan 2018TextBoxes++: A Single-Shot Oriented Scene Text Detector · code88
08Corner Localization (multi-scale)Feb 2018Multi-Oriented Scene Text Detection via Corner Localizat… · code88
09Corner-based Region ProposalsApr 2018Detecting Multi-Oriented Text with Corner-based Region P… · code87.60
10SSTDSep 2017Single Shot Text Detector with Regional Attention · code87
11SegLinkMar 2017Detecting Oriented Text in Natural Images by Linking Seg… · code85.30
12Gupta et al.Apr 2016Synthetic Data for Text Localisation in Natural Images · code83
13USM (COCO TS + ICDAR–2013)Sep 2019papers-with-code · code80.40
14Neumann et al. *Apr 2015Efficient Scene Text Localization and Recognition with L…77.10
15Jaderberg et al.Dec 2014Reading Text in the Wild with Convolutional Neural Netwo…76.80
h-mean
1 row
#ModelOrgSubmittedPaper / codeh-mean
01CRAFTApr 2019Character Region Awareness for Text Detection · code95.20
precision
14 rows
#ModelOrgSubmittedPaper / codeprecision
01CRAFTApr 2019Character Region Awareness for Text Detection · code97.40
02TextFuseNet (ResNeXt-101)May 2020papers-with-code · code97.27
03Mask TextSpotterJul 2018Mask TextSpotter: An End-to-End Trainable Neural Network… · code95
04SPCNETNov 2018Scene Text Detection with Supervised Pyramid Context Net… · code93.80
05WordSup (VGG16-synth-icdar)Aug 2017WordSup: Exploiting Word Annotations for Character based…93.34
06Gupta et al.Apr 2016Synthetic Data for Text Localisation in Natural Images · code92
07Corner Localization (multi-scale)Feb 2018Multi-Oriented Scene Text Detection via Corner Localizat… · code92
08Corner-based Region ProposalsApr 2018Detecting Multi-Oriented Text with Corner-based Region P… · code91.90
09TextBoxes++_MSJan 2018TextBoxes++: A Single-Shot Oriented Scene Text Detector · code91
10PixelLink+VGG16 2s MSJan 2018PixelLink: Detecting Scene Text via Instance Segmentatio… · code88.60
11Jaderberg et al.Dec 2014Reading Text in the Wild with Convolutional Neural Netwo…88.50
12SSTDSep 2017Single Shot Text Detector with Regional Attention · code88
13SegLinkMar 2017Detecting Oriented Text in Natural Images by Linking Seg… · code87.70
14Neumann et al. *Apr 2015Efficient Scene Text Localization and Recognition with L…81.80
recall
14 rows
#ModelOrgSubmittedPaper / coderecall
01CRAFTApr 2019Character Region Awareness for Text Detection · code93.10
02TextFuseNet (ResNeXt-101)May 2020papers-with-code · code92.09
03SPCNETNov 2018Scene Text Detection with Supervised Pyramid Context Net… · code90.50
04Mask TextSpotterJul 2018Mask TextSpotter: An End-to-End Trainable Neural Network… · code88.60
05WordSup (VGG16-synth-icdar)Aug 2017WordSup: Exploiting Word Annotations for Character based…87.53
06PixelLink+VGG16 2s MSJan 2018PixelLink: Detecting Scene Text via Instance Segmentatio… · code87.50
07SSTDSep 2017Single Shot Text Detector with Regional Attention · code86
08Corner Localization (multi-scale)Feb 2018Multi-Oriented Scene Text Detection via Corner Localizat… · code84.40
09TextBoxes++_MSJan 2018TextBoxes++: A Single-Shot Oriented Scene Text Detector · code84
10Corner-based Region ProposalsApr 2018Detecting Multi-Oriented Text with Corner-based Region P… · code83.90
11SegLinkMar 2017Detecting Oriented Text in Natural Images by Linking Seg… · code83
12Gupta et al.Apr 2016Synthetic Data for Text Localisation in Natural Images · code75.50
13Neumann et al. *Apr 2015Efficient Scene Text Localization and Recognition with L…72.40
14Jaderberg et al.Dec 2014Reading Text in the Wild with Convolutional Neural Netwo…67.80
Fig 2 · Rows sorted by score within each metric. Shaded row marks SOTA. Dates reflect model or paper release where available, otherwise the date Codesota accessed the source.
§ 03 · Progress

5 steps
of state of the art.

Each row below marks a model that broke the previous record on accuracy. Intermediate submissions are kept in the leaderboard above; only SOTA-setting entries are re-listed here.

Higher scores win. Each subsequent entry improved upon the previous best.

SOTA line · accuracy
  1. Jul 21, 2015CRNN86.70
  2. Mar 6, 2021ABINet-LVFang et al.97
  3. Sep 21, 2021TrOCR-large 558M98.40
  4. May 23, 2023CLIP4STR-L (RBU 6.5M)Zhao et al.99
  5. Apr 9, 2024JSTRFujitake99.20
Fig 3 · SOTA-setting models only. 5 entries span Jul 2015 Apr 2024.
§ 04 · Literature

25 papers
tied to this benchmark.

Every paper below corresponds to at least one row in the leaderboard above. Click through for the arXiv preprint and, when available, the reference implementation.

§ 06 · Contribute

Have a score that beats
this table?

Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.

Submit a result Read submission guide
What a submission needs
  • 01A public checkpoint or API endpoint
  • 02A reproduction script with frozen commit + seed
  • 03Declared evaluation environment (Python, deps)
  • 04One row per metric declared by this dataset
  • 05A contact so we can follow up on discrepancies