Codesota · Computer Vision · Scene Text Detection · ICDAR 2015Tasks/Computer Vision/Scene Text Detection
Scene Text Detection · benchmark dataset · 2015 · EN

ICDAR 2015 Incidental Scene Text.

1000 training + 500 test images captured with wearable cameras. Industry standard for scene text detection.

Paper Download datasetSubmit a result
§ 01 · Leaderboard

Best published scores.

188 results indexed across 8 metrics. Shaded row marks current SOTA; ties broken by submission date.


Primary
f1 · higher is better
All metrics
accuracy, f-measure, f-measure-generic-lexicon, f-measure-strong-lexicon, f-measure-weak-lexicon, fps, precision, recall
accuracy
2 rows
#ModelOrgSubmittedPaper / codeaccuracy
01PGNet-AApr 2021PGNet: Real-time Arbitrarily-Shaped Text Spotting with P… · code62.30
02PGNet-E57.40
f-measure
43 rows
#ModelOrgSubmittedPaper / codef-measure
01TextFuseNet (ResNeXt-101)May 2020papers-with-code · code92.23
02CharNet H-88 (multi-scale)Oct 2019Convolutional Character Networks · code91.55
03CharNet H-88 (single-scale)Oct 2019Convolutional Character Networks · code90.97
04CharNet H-50 (multi-scale)Oct 2019Convolutional Character Networks · code90.16
05SBDDec 2019Exploring the Capacity of an Orderless Box Discretizatio… · code90.10
06CharNet H-57 (multi-scale)Oct 2019Convolutional Character Networks · code90.06
07FreeReal+DBNetOSSSJTUSep 2024arxiv90
08TESTRApr 2022github-readme90
09FOTS MSJan 2018FOTS: Fast Oriented Text Spotting with a Unified Network · code89.84
10CharNet H-50 (single-scale)Oct 2019Convolutional Character Networks · code89.70
11CharNet H-57 (single-scale)Oct 2019Convolutional Character Networks · code89.66
12PMTD*Mar 2019Pyramid Mask Text Detector · code89.33
13GNNetsSep 2019Geometry Normalization Networks for Accurate Scene Text … · code88.52
14FOTSJan 2018FOTS: Fast Oriented Text Spotting with a Unified Network · code87.99
15DBNet++ (ResNet-50) (1152)Feb 2022Real-Time Scene Text Detection with Differentiable Binar… · code87.30
16DB-ResNet-50 (1152)Nov 2019Real-time Scene Text Detection with Differentiable Binar… · code87.30
17SPCNETNov 2018Scene Text Detection with Supervised Pyramid Context Net… · code87.20
18FAST-B-1280Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code87.10
19SASTAug 2019A Single-Shot Arbitrarily-Shaped Text Detector based on … · code86.91
20CRAFTApr 2019Character Region Awareness for Text Detection · code86.90
21EK-Net++OSSResearchJan 2024journal-paper86.72
22FAST-B-896Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code86.30
23Mask TextSpotterJul 2018Mask TextSpotter: An End-to-End Trainable Neural Network… · code86
24EK-NetOSSZhu et al.Jan 2024arxiv85.72
25PSENet-1sMar 2019Shape Robust Text Detection with Progressive Scale Expan… · code85.70
26FAST-B-736Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code84.70
27SLPRJan 2018PixelLink: Detecting Scene Text via Instance Segmentatio… · code84.50
28Corner-based Region ProposalsApr 2018Detecting Multi-Oriented Text with Corner-based Region P… · code84.50
29Corner Localization (multi-scale)Feb 2018Multi-Oriented Scene Text Detection via Corner Localizat… · code84.30
30FTSN + MNMSSep 2017Fused Text Segmentation Networks for Multi-oriented Scen…84.10
31PixelLink+VGG16 2sJul 2018TextSnake: A Flexible Representation for Detecting Text … · code83.70
32DBNet++ (ResNet-18) (736)Feb 2022Real-Time Scene Text Detection with Differentiable Binar… · code83.10
33PANApr 2017EAST: An Efficient and Accurate Scene Text Detector · code82.90
34FAST-S-736Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code82.90
35Quad_MSJan 2018TextBoxes++: A Single-Shot Oriented Scene Text Detector · code82.90
36TextSnakeAug 2019Efficient and Accurate Arbitrary-Shaped Text Detection w… · code82.60
37FAST-T-736Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code81.70
38EAST + PVANET2x RBOX (multi-scale)Sep 2017Single Shot Text Detector with Regional Attention · code80.70
39EAST + PVANET2x RBOX (single-scale)Apr 2017EAST: An Efficient and Accurate Scene Text Detector · code78.20
40WordSup (VGG16-synth-icdar)Mar 2017Detecting Oriented Text in Natural Images by Linking Seg… · code78.20
41SSTDAug 2017WordSup: Exploiting Word Annotations for Character based…77
42SegLinkApr 2016Multi-Oriented Text Detection with Fully Convolutional N… · code75
43MCLAB_FCNApr 2021PGNet: Real-time Arbitrarily-Shaped Text Spotting with P… · code53.60
f-measure-generic-lexicon
18 rows
#ModelOrgSubmittedPaper / codef-measure-generic-lexicon
01UNITSApr 2023Towards Unified Scene Text Spotting based on Sequence Ge… · code80.30
02A3SFeb 2023A3S: Adversarial learning of semantic representations fo…79.60
03DeepSolo (ViTAEv2-S, TextOCR)Nov 2022DeepSolo: Let Transformer Decoder with Explicit Points S… · code79.50
04DeepSolo (ResNet-50, TextOCR)Nov 2022DeepSolo: Let Transformer Decoder with Explicit Points S… · code79.10
05DeepSolo (ResNet-50)Nov 2022DeepSolo: Let Transformer Decoder with Explicit Points S… · code76.90
06GLASSAug 2022GLASS: Global to Local Attention for Scene-Text Spotting · code76.30
07SRTSJul 2022Single Shot Self-Reliant Scene Text Spotter by Decoupled… · code74.50
08MaskTextSpotter v3Jul 2020Mask TextSpotter v3: Segmentation Proposal Network for R… · code74.20
09TESTRApr 2022Text Spotting Transformers · code73.60
10ABCNet v2May 2021ABCNet v2: Adaptive Bezier-Curve Network for Real-time E… · code73
11SPTS v2Jan 2023SPTS v2: Single-Point Scene Text Spotting · code72.60
12SwinTextSpotterMar 2022SwinTextSpotter: Scene Text Spotting via Better Synergy … · code70.50
13MANGODec 2020MANGO: A Mask Attention Guided One-Stage Scene Text Spot… · code67.30
14SPTSDec 2021SPTS: Single-Point Text Spotting · code65.80
15TextDragonOct 2019papers-with-code65.20
16TextPerceptronFeb 2020Text Perceptron: Towards End-to-End Arbitrary-Shaped Tex… · code65.10
17PGNetApr 2021PGNet: Real-time Arbitrarily-Shaped Text Spotting with P… · code63.50
18FOTSJan 2018FOTS: Fast Oriented Text Spotting with a Unified Network · code62.20
f-measure-strong-lexicon
18 rows
#ModelOrgSubmittedPaper / codef-measure-strong-lexicon
01UNITSApr 2023Towards Unified Scene Text Spotting based on Sequence Ge… · code89
02DeepSolo (ViTAEv2-S, TextOCR)Nov 2022DeepSolo: Let Transformer Decoder with Explicit Points S… · code88.10
03DeepSolo (ResNet-50, TextOCR)Nov 2022DeepSolo: Let Transformer Decoder with Explicit Points S… · code88
04DeepSolo (ResNet-50)Nov 2022DeepSolo: Let Transformer Decoder with Explicit Points S… · code86.80
05SRTSJul 2022Single Shot Self-Reliant Scene Text Spotter by Decoupled… · code85.60
06TESTRApr 2022Text Spotting Transformers · code85.20
07A3SFeb 2023A3S: Adversarial learning of semantic representations fo…84.80
08GLASSAug 2022GLASS: Global to Local Attention for Scene-Text Spotting · code84.70
09SwinTextSpotterMar 2022SwinTextSpotter: Scene Text Spotting via Better Synergy … · code83.90
10FOTSJan 2018FOTS: Fast Oriented Text Spotting with a Unified Network · code83.60
11PGNetApr 2021PGNet: Real-time Arbitrarily-Shaped Text Spotting with P… · code83.30
12MaskTextSpotter v3Jul 2020Mask TextSpotter v3: Segmentation Proposal Network for R… · code83.30
13ABCNet v2May 2021ABCNet v2: Adaptive Bezier-Curve Network for Real-time E… · code82.70
14TextDragonOct 2019papers-with-code82.50
15SPTS v2Jan 2023SPTS v2: Single-Point Scene Text Spotting · code82.30
16MANGODec 2020MANGO: A Mask Attention Guided One-Stage Scene Text Spot… · code81.80
17TextPerceptronFeb 2020Text Perceptron: Towards End-to-End Arbitrary-Shaped Tex… · code80.50
18SPTSDec 2021SPTS: Single-Point Text Spotting · code77.50
f-measure-weak-lexicon
18 rows
#ModelOrgSubmittedPaper / codef-measure-weak-lexicon
01UNITSApr 2023Towards Unified Scene Text Spotting based on Sequence Ge… · code84.10
02DeepSolo (ViTAEv2-S, TextOCR)Nov 2022DeepSolo: Let Transformer Decoder with Explicit Points S… · code83.90
03A3SFeb 2023A3S: Adversarial learning of semantic representations fo…83.70
04DeepSolo (ResNet-50, TextOCR)Nov 2022DeepSolo: Let Transformer Decoder with Explicit Points S… · code83.50
05DeepSolo (ResNet-50)Nov 2022DeepSolo: Let Transformer Decoder with Explicit Points S… · code81.90
06SRTSJul 2022Single Shot Self-Reliant Scene Text Spotter by Decoupled… · code81.70
07GLASSAug 2022GLASS: Global to Local Attention for Scene-Text Spotting · code80.10
08TESTRApr 2022Text Spotting Transformers · code79.40
09MANGODec 2020MANGO: A Mask Attention Guided One-Stage Scene Text Spot… · code78.90
10ABCNet v2May 2021ABCNet v2: Adaptive Bezier-Curve Network for Real-time E… · code78.50
11PGNetApr 2021PGNet: Real-time Arbitrarily-Shaped Text Spotting with P… · code78.30
12TextDragonOct 2019papers-with-code78.30
13MaskTextSpotter v3Jul 2020Mask TextSpotter v3: Segmentation Proposal Network for R… · code78.10
14SPTS v2Jan 2023SPTS v2: Single-Point Scene Text Spotting · code77.70
15SwinTextSpotterMar 2022SwinTextSpotter: Scene Text Spotting via Better Synergy … · code77.30
16TextPerceptronFeb 2020Text Perceptron: Towards End-to-End Arbitrary-Shaped Tex… · code76.60
17FOTSJan 2018FOTS: Fast Oriented Text Spotting with a Unified Network · code74.50
18SPTSDec 2021SPTS: Single-Point Text Spotting · code70.20
fps
7 rows
#ModelOrgSubmittedPaper / codefps
01FAST-T-736Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code60.90
02FAST-S-736Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code53.90
03DBNet++ (ResNet-18) (736)Feb 2022Real-Time Scene Text Detection with Differentiable Binar… · code44
04FAST-B-736Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code42.70
05FAST-B-896Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code31.80
06FAST-B-1280Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code15.70
07DBNet++ (ResNet-50) (1152)Feb 2022Real-Time Scene Text Detection with Differentiable Binar… · code10
precision
41 rows
#ModelOrgSubmittedPaper / codeprecision
01TextFuseNet (ResNeXt-101)May 2020papers-with-code · code93.96
02CharNet H-88 (multi-scale)Oct 2019Convolutional Character Networks · code92.65
03SBDDec 2019Exploring the Capacity of an Orderless Box Discretizatio… · code92.10
04EK-NetOSSZhu et al.Jan 2024arxiv92
05FOTS MSJan 2018FOTS: Fast Oriented Text Spotting with a Unified Network · code91.85
06DB-ResNet-50 (1152)Nov 2019Real-time Scene Text Detection with Differentiable Binar… · code91.80
07Mask TextSpotterJul 2018Mask TextSpotter: An End-to-End Trainable Neural Network… · code91.60
08CharNet H-57 (multi-scale)Oct 2019Convolutional Character Networks · code91.43
09PMTD*Mar 2019Pyramid Mask Text Detector · code91.30
10CharNet H-50 (single-scale)Oct 2019Convolutional Character Networks · code91.15
11FOTSJan 2018FOTS: Fast Oriented Text Spotting with a Unified Network · code91
12CharNet H-50 (multi-scale)Oct 2019Convolutional Character Networks · code90.90
13DBNet++ (ResNet-50) (1152)Feb 2022Real-Time Scene Text Detection with Differentiable Binar… · code90.90
14GNNetsSep 2019Geometry Normalization Networks for Accurate Scene Text … · code90.41
15TESTRApr 2022github-readme90.31
16DBNet++ (ResNet-18) (736)Feb 2022Real-Time Scene Text Detection with Differentiable Binar… · code90.10
17CharNet H-88 (single-scale)Oct 2019Convolutional Character Networks · code89.99
18CRAFTApr 2019Character Region Awareness for Text Detection · code89.80
19FAST-B-1280Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code89.70
20Corner Localization (multi-scale)Feb 2018Multi-Oriented Scene Text Detection via Corner Localizat… · code89.50
21FAST-B-896Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code89.20
22CharNet H-57 (single-scale)Oct 2019Convolutional Character Networks · code88.88
23SPCNETNov 2018Scene Text Detection with Supervised Pyramid Context Net… · code88.70
24Corner-based Region ProposalsApr 2018Detecting Multi-Oriented Text with Corner-based Region P… · code88.70
25FTSN + MNMSSep 2017Fused Text Segmentation Networks for Multi-oriented Scen…88.60
26FAST-B-736Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code88
27Quad_MSJan 2018TextBoxes++: A Single-Shot Oriented Scene Text Detector · code87.80
28PSENet-1sMar 2019Shape Robust Text Detection with Progressive Scale Expan… · code86.90
29SASTAug 2019A Single-Shot Arbitrarily-Shaped Text Detector based on … · code86.72
30FAST-S-736Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code86.30
31FAST-T-736Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code86
32PixelLink+VGG16 2sJul 2018TextSnake: A Flexible Representation for Detecting Text … · code85.50
33SLPRJan 2018PixelLink: Detecting Scene Text via Instance Segmentatio… · code85.50
34TextSnakeAug 2019Efficient and Accurate Arbitrary-Shaped Text Detection w… · code84.90
35PANApr 2017EAST: An Efficient and Accurate Scene Text Detector · code84
36EAST + PVANET2x RBOX (single-scale)Apr 2017EAST: An Efficient and Accurate Scene Text Detector · code83.60
37EAST + PVANET2x RBOX (multi-scale)Sep 2017Single Shot Text Detector with Regional Attention · code83.30
38SSTDAug 2017WordSup: Exploiting Word Annotations for Character based…80
39WordSup (VGG16-synth-icdar)Mar 2017Detecting Oriented Text in Natural Images by Linking Seg… · code79.30
40SegLinkApr 2016Multi-Oriented Text Detection with Fully Convolutional N… · code73.10
41MCLAB_FCNApr 2021PGNet: Real-time Arbitrarily-Shaped Text Spotting with P… · code70.80
recall
41 rows
#ModelOrgSubmittedPaper / coderecall
01CharNet H-88 (single-scale)Oct 2019Convolutional Character Networks · code91.98
02TextFuseNet (ResNeXt-101)May 2020papers-with-code · code90.56
03CharNet H-88 (multi-scale)Oct 2019Convolutional Character Networks · code90.47
04CharNet H-57 (single-scale)Oct 2019Convolutional Character Networks · code90.45
05TESTRApr 2022github-readme89.70
06CharNet H-50 (multi-scale)Oct 2019Convolutional Character Networks · code89.44
07CharNet H-57 (multi-scale)Oct 2019Convolutional Character Networks · code88.74
08CharNet H-50 (single-scale)Oct 2019Convolutional Character Networks · code88.30
09SBDDec 2019Exploring the Capacity of an Orderless Box Discretizatio… · code88.20
10FOTS MSJan 2018FOTS: Fast Oriented Text Spotting with a Unified Network · code87.92
11PMTD*Mar 2019Pyramid Mask Text Detector · code87.43
12SASTAug 2019A Single-Shot Arbitrarily-Shaped Text Detector based on … · code87.09
13GNNetsSep 2019Geometry Normalization Networks for Accurate Scene Text … · code86.71
14SPCNETNov 2018Scene Text Detection with Supervised Pyramid Context Net… · code85.80
15FOTSJan 2018FOTS: Fast Oriented Text Spotting with a Unified Network · code85.17
16FAST-B-1280Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code84.60
17PSENet-1sMar 2019Shape Robust Text Detection with Progressive Scale Expan… · code84.50
18CRAFTApr 2019Character Region Awareness for Text Detection · code84.30
19DBNet++ (ResNet-50) (1152)Feb 2022Real-Time Scene Text Detection with Differentiable Binar… · code83.90
20FAST-B-896Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code83.60
21SLPRJan 2018PixelLink: Detecting Scene Text via Instance Segmentatio… · code83.60
22DB-ResNet-50 (1152)Nov 2019Real-time Scene Text Detection with Differentiable Binar… · code83.20
23PixelLink+VGG16 2sJul 2018TextSnake: A Flexible Representation for Detecting Text … · code82
24PANApr 2017EAST: An Efficient and Accurate Scene Text Detector · code81.90
25FAST-B-736Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code81.70
26Mask TextSpotterJul 2018Mask TextSpotter: An End-to-End Trainable Neural Network… · code81
27Corner-based Region ProposalsApr 2018Detecting Multi-Oriented Text with Corner-based Region P… · code80.70
28TextSnakeAug 2019Efficient and Accurate Arbitrary-Shaped Text Detection w… · code80.40
29EK-NetOSSZhu et al.Jan 2024arxiv80.24
30FTSN + MNMSSep 2017Fused Text Segmentation Networks for Multi-oriented Scen…80
31FAST-S-736Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code79.80
32Corner Localization (multi-scale)Feb 2018Multi-Oriented Scene Text Detection via Corner Localizat… · code79.70
33Quad_MSJan 2018TextBoxes++: A Single-Shot Oriented Scene Text Detector · code78.50
34EAST + PVANET2x RBOX (multi-scale)Sep 2017Single Shot Text Detector with Regional Attention · code78.30
35FAST-T-736Nov 2021FAST: Faster Arbitrarily-Shaped Text Detector with Minim… · code77.90
36DBNet++ (ResNet-18) (736)Feb 2022Real-Time Scene Text Detection with Differentiable Binar… · code77.20
37WordSup (VGG16-synth-icdar)Mar 2017Detecting Oriented Text in Natural Images by Linking Seg… · code77
38SegLinkApr 2016Multi-Oriented Text Detection with Fully Convolutional N… · code76.80
39EAST + PVANET2x RBOX (single-scale)Apr 2017EAST: An Efficient and Accurate Scene Text Detector · code73.50
40SSTDAug 2017WordSup: Exploiting Word Annotations for Character based…73
41MCLAB_FCNApr 2021PGNet: Real-time Arbitrarily-Shaped Text Spotting with P… · code43
Fig 2 · Rows sorted by score within each metric. Shaded row marks SOTA. Dates reflect model or paper release where available, otherwise the date Codesota accessed the source.
§ 04 · Literature

39 papers
tied to this benchmark.

Every paper below corresponds to at least one row in the leaderboard above. Click through for the arXiv preprint and, when available, the reference implementation.

§ 06 · Contribute

Have a score that beats
this table?

Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.

Submit a result Read submission guide
What a submission needs
  • 01A public checkpoint or API endpoint
  • 02A reproduction script with frozen commit + seed
  • 03Declared evaluation environment (Python, deps)
  • 04One row per metric declared by this dataset
  • 05A contact so we can follow up on discrepancies