Who leads the icdar-2013 benchmark?

JSTR currently leads icdar-2013 with a score of 99.20 on accuracy.

What is the state-of-the-art score on icdar-2013?

The state-of-the-art result on icdar-2013 is 99.20 (accuracy), achieved by JSTR as of 2024.

How many models are tracked on icdar-2013?

Codesota tracks 31 models on icdar-2013 across 5 metrics.

When was the icdar-2013 leaderboard last updated?

The icdar-2013 leaderboard on Codesota includes results through 2024, with the earliest tracked result from 2014.

Codesota · Computer Vision · Scene Text Detection · icdar-2013Tasks/Computer Vision/Scene Text Detection

Scene Text Detection · benchmark dataset · 2020 · EN

icdar-2013.

Name: icdar-2013 Benchmark Results
Creator: Codesota
Published: 2014-01-01
License: https://creativecommons.org/licenses/by/4.0/

Dataset from Papers With Code

Legacy benchmark· last significant update Jan 2019

Legacy benchmark from 2013. For current OCR evaluation, use OCRBench, ICDAR 2019/2021, or DocVQA.

Submit a result ↵

§ 01 · Leaderboard

Best published scores.

59 results indexed across 5 metrics. Shaded row marks current SOTA; ties broken by submission date.

Primary: accuracy · higher is better
All metrics: accuracy, f-measure, h-mean, precision, recall

accuracy· primary

15 rows

#	Model	Org	Submitted	Paper / code	accuracy
01	JSTROpen	Fujitake	Apr 2024	JSTR: Judgment Improves Scene Text Recognition	99.20
02	CLIP4STR-L (RBU 6.5M)Open	Zhao et al.	May 2023	CLIP4STR: A Simple Baseline for Scene Text Recognition w…	99
03	CLIP4STR-H (DFN-5B)Open	Zhao et al.	May 2023	CLIP4STR: A Simple Baseline for Scene Text Recognition w…	98.90
04	DTrOCR	—	Aug 2023	DTrOCR: Decoder-only Transformer for Optical Character R…	98.80
05	SVTRv2-BOpen	Du et al.	Nov 2024	SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text R…	98.70
06	LISTEROpen	Cheng et al.	Aug 2023	LISTER: Neighbor Decoding for Length-Insensitive Scene T…	98.60
07	SVTRv2-SOpen	Du et al.	Nov 2024	SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text R…	98.50
08	TrOCR-large 558M	—	Sep 2021	TrOCR: Transformer-based Optical Character Recognition w…	98.40
09	TrOCR-base 334M	—	Sep 2021	TrOCR: Transformer-based Optical Character Recognition w…	98.40
10	CPPD	—	Jul 2023	Context Perception Parallel Decoder for Scene Text Recog…	98.20
11	MAERecOpen	Jiang et al.	Jul 2023	Revisiting Scene Text Recognition: A Data Perspective	98.20
12	PARSeqOpen	Research	Jul 2022	Scene Text Recognition with Permuted Autoregressive Sequ…	98.13
13	SVTRv2-TOpen	Du et al.	Nov 2024	SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text R…	98
14	ABINet-LVOpen	Fang et al.	Mar 2021	Read Like Humans: Autonomous, Bidirectional and Iterativ…	97
15	CRNN	—	Jul 2015	An End-to-End Trainable Neural Network for Image-based S…	86.70

f-measure

15 rows

#	Model	Org	Submitted	Paper / code	f-measure
01	TextFuseNet (ResNeXt-101)	—	May 2020	papers-with-code · code	94.61
02	SPCNET	—	Nov 2018	Scene Text Detection with Supervised Pyramid Context Net… · code	92.10
03	Mask TextSpotter	—	Jul 2018	Mask TextSpotter: An End-to-End Trainable Neural Network… · code	91.70
04	WordSup (VGG16-synth-icdar)	—	Aug 2017	WordSup: Exploiting Word Annotations for Character based…	90.34
05	STN-OCR	—	Jul 2017	STN-OCR: A single Neural Network for Text Detection and … · code	90.30
06	PixelLink+VGG16 2s MS	—	Jan 2018	PixelLink: Detecting Scene Text via Instance Segmentatio… · code	88.10
07	TextBoxes++_MS	—	Jan 2018	TextBoxes++: A Single-Shot Oriented Scene Text Detector · code	88
08	Corner Localization (multi-scale)	—	Feb 2018	Multi-Oriented Scene Text Detection via Corner Localizat… · code	88
09	Corner-based Region Proposals	—	Apr 2018	Detecting Multi-Oriented Text with Corner-based Region P… · code	87.60
10	SSTD	—	Sep 2017	Single Shot Text Detector with Regional Attention · code	87
11	SegLink	—	Mar 2017	Detecting Oriented Text in Natural Images by Linking Seg… · code	85.30
12	Gupta et al.	—	Apr 2016	Synthetic Data for Text Localisation in Natural Images · code	83
13	USM (COCO TS + ICDAR–2013)	—	Sep 2019	papers-with-code · code	80.40
14	Neumann et al. *	—	Apr 2015	Efficient Scene Text Localization and Recognition with L…	77.10
15	Jaderberg et al.	—	Dec 2014	Reading Text in the Wild with Convolutional Neural Netwo…	76.80

h-mean

1 row

#	Model	Org	Submitted	Paper / code	h-mean
01	CRAFT	—	Apr 2019	Character Region Awareness for Text Detection · code	95.20

precision

14 rows

#	Model	Org	Submitted	Paper / code	precision
01	CRAFT	—	Apr 2019	Character Region Awareness for Text Detection · code	97.40
02	TextFuseNet (ResNeXt-101)	—	May 2020	papers-with-code · code	97.27
03	Mask TextSpotter	—	Jul 2018	Mask TextSpotter: An End-to-End Trainable Neural Network… · code	95
04	SPCNET	—	Nov 2018	Scene Text Detection with Supervised Pyramid Context Net… · code	93.80
05	WordSup (VGG16-synth-icdar)	—	Aug 2017	WordSup: Exploiting Word Annotations for Character based…	93.34
06	Gupta et al.	—	Apr 2016	Synthetic Data for Text Localisation in Natural Images · code	92
07	Corner Localization (multi-scale)	—	Feb 2018	Multi-Oriented Scene Text Detection via Corner Localizat… · code	92
08	Corner-based Region Proposals	—	Apr 2018	Detecting Multi-Oriented Text with Corner-based Region P… · code	91.90
09	TextBoxes++_MS	—	Jan 2018	TextBoxes++: A Single-Shot Oriented Scene Text Detector · code	91
10	PixelLink+VGG16 2s MS	—	Jan 2018	PixelLink: Detecting Scene Text via Instance Segmentatio… · code	88.60
11	Jaderberg et al.	—	Dec 2014	Reading Text in the Wild with Convolutional Neural Netwo…	88.50
12	SSTD	—	Sep 2017	Single Shot Text Detector with Regional Attention · code	88
13	SegLink	—	Mar 2017	Detecting Oriented Text in Natural Images by Linking Seg… · code	87.70
14	Neumann et al. *	—	Apr 2015	Efficient Scene Text Localization and Recognition with L…	81.80

recall

14 rows

#	Model	Org	Submitted	Paper / code	recall
01	CRAFT	—	Apr 2019	Character Region Awareness for Text Detection · code	93.10
02	TextFuseNet (ResNeXt-101)	—	May 2020	papers-with-code · code	92.09
03	SPCNET	—	Nov 2018	Scene Text Detection with Supervised Pyramid Context Net… · code	90.50
04	Mask TextSpotter	—	Jul 2018	Mask TextSpotter: An End-to-End Trainable Neural Network… · code	88.60
05	WordSup (VGG16-synth-icdar)	—	Aug 2017	WordSup: Exploiting Word Annotations for Character based…	87.53
06	PixelLink+VGG16 2s MS	—	Jan 2018	PixelLink: Detecting Scene Text via Instance Segmentatio… · code	87.50
07	SSTD	—	Sep 2017	Single Shot Text Detector with Regional Attention · code	86
08	Corner Localization (multi-scale)	—	Feb 2018	Multi-Oriented Scene Text Detection via Corner Localizat… · code	84.40
09	TextBoxes++_MS	—	Jan 2018	TextBoxes++: A Single-Shot Oriented Scene Text Detector · code	84
10	Corner-based Region Proposals	—	Apr 2018	Detecting Multi-Oriented Text with Corner-based Region P… · code	83.90
11	SegLink	—	Mar 2017	Detecting Oriented Text in Natural Images by Linking Seg… · code	83
12	Gupta et al.	—	Apr 2016	Synthetic Data for Text Localisation in Natural Images · code	75.50
13	Neumann et al. *	—	Apr 2015	Efficient Scene Text Localization and Recognition with L…	72.40
14	Jaderberg et al.	—	Dec 2014	Reading Text in the Wild with Convolutional Neural Netwo…	67.80

Fig 2 · Rows sorted by score within each metric. Shaded row marks SOTA. Dates reflect model or paper release where available, otherwise the date Codesota accessed the source.

§ 03 · Progress

5 steps
of state of the art.

Each row below marks a model that broke the previous record on accuracy. Intermediate submissions are kept in the leaderboard above; only SOTA-setting entries are re-listed here.

Higher scores win. Each subsequent entry improved upon the previous best.

SOTA line · accuracy

Jul 21, 2015CRNN86.70
Mar 6, 2021ABINet-LVFang et al.97
Sep 21, 2021TrOCR-large 558M98.40
May 23, 2023CLIP4STR-L (RBU 6.5M)Zhao et al.99
Apr 9, 2024JSTRFujitake99.20

Fig 3 · SOTA-setting models only. 5 entries span Jul 2015 → Apr 2024.

§ 04 · Literature

25 papers
tied to this benchmark.

Every paper below corresponds to at least one row in the leaderboard above. Click through for the arXiv preprint and, when available, the reference implementation.

SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition
Nov 2024·SVTRv2-B, SVTRv2-S, SVTRv2-T
arXiv ↗
JSTR: Judgment Improves Scene Text Recognition
Apr 2024·JSTR
arXiv ↗
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Aug 2023·DTrOCR
arXiv ↗
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Aug 2023·LISTER
arXiv ↗
Context Perception Parallel Decoder for Scene Text Recognition
Jul 2023·CPPD
arXiv ↗
Revisiting Scene Text Recognition: A Data Perspective
Jul 2023·MAERec
arXiv ↗
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
May 2023·CLIP4STR-L (RBU 6.5M), CLIP4STR-H (DFN-5B)
arXiv ↗
Scene Text Recognition with Permuted Autoregressive Sequence Models
Jul 2022·PARSeq
arXiv ↗
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Sep 2021·TrOCR-large 558M, TrOCR-base 334M
arXiv ↗
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
Mar 2021·ABINet-LV
arXiv ↗
Character Region Awareness for Text Detection
Apr 2019·CRAFT
arXiv ↗Code
Scene Text Detection with Supervised Pyramid Context Network
Nov 2018·SPCNET
arXiv ↗Code
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes
Jul 2018·Mask TextSpotter
arXiv ↗Code
Detecting Multi-Oriented Text with Corner-based Region Proposals
Apr 2018·Corner-based Region Proposals
arXiv ↗Code
Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation
Feb 2018·Corner Localization (multi-scale)
arXiv ↗Code
TextBoxes++: A Single-Shot Oriented Scene Text Detector
Jan 2018·TextBoxes++_MS
arXiv ↗Code
PixelLink: Detecting Scene Text via Instance Segmentation
Jan 2018·PixelLink+VGG16 2s MS
arXiv ↗Code
Single Shot Text Detector with Regional Attention
Sep 2017·SSTD
arXiv ↗Code
WordSup: Exploiting Word Annotations for Character based Text Detection
Aug 2017·WordSup (VGG16-synth-icdar)
arXiv ↗
STN-OCR: A single Neural Network for Text Detection and Text Recognition
Jul 2017·STN-OCR
arXiv ↗Code
Detecting Oriented Text in Natural Images by Linking Segments
Mar 2017·SegLink
arXiv ↗Code
Synthetic Data for Text Localisation in Natural Images
Apr 2016·Gupta et al.
arXiv ↗Code
An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition
Jul 2015·CRNN
arXiv ↗
Efficient Scene Text Localization and Recognition with Local Character Refinement
Apr 2015·Neumann et al. *
arXiv ↗
Reading Text in the Wild with Convolutional Neural Networks
Dec 2014·Jaderberg et al.
arXiv ↗

§ 06 · Contribute

Have a score that beats
this table?

Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.

Submit a result ↵Read submission guide

What a submission needs

01A public checkpoint or API endpoint
02A reproduction script with frozen commit + seed
03Declared evaluation environment (Python, deps)
04One row per metric declared by this dataset
05A contact so we can follow up on discrepancies

icdar-2013.

Best published scores.

5 stepsof state of the art.

25 paperstied to this benchmark.

Neighbouring benchmarks.

Have a score that beatsthis table?

5 steps
of state of the art.

25 papers
tied to this benchmark.

Have a score that beats
this table?