Scene Text Detection2015en

ICDAR 2015 Incidental Scene Text

1000 training + 500 test images captured with wearable cameras. Industry standard for scene text detection.

Samples:1,500
Metrics:precision, recall, f1
Paper / WebsiteDownload

accuracy

#ModelScorePaper / CodeDate
1
PGNet-A
62.3Apr 2021
2
PGNet-E
57.4
-

f-measure

#ModelScorePaper / CodeDate
1
TextFuseNet (ResNeXt-101)
92.23
TextFuseNet: Scene Text Detection with Richer Fused FeaturesCode
May 2020
2
CharNet H-88 (multi-scale)
91.55Oct 2019
3
CharNet H-88 (single-scale)
90.97Oct 2019
4
CharNet H-50 (multi-scale)
90.16Oct 2019
5
SBD
90.1Dec 2019
6
CharNet H-57 (multi-scale)
90.06Oct 2019
7
FOTS MS
89.84Jan 2018
8
CharNet H-50 (single-scale)
89.7Oct 2019
9
CharNet H-57 (single-scale)
89.66Oct 2019
10
PMTD*
89.33Mar 2019
11
GNNets
88.52Sep 2019
12
FOTS
87.99Jan 2018
13
DB-ResNet-50 (1152)
87.3Nov 2019
14
DBNet++ (ResNet-50) (1152)
87.3Feb 2022
15
SPCNET
87.2Nov 2018
16
FAST-B-1280
87.1Nov 2021
17
SAST
86.91Aug 2019
18
CRAFT
86.9Apr 2019
19
FAST-B-896
86.3Nov 2021
20
Mask TextSpotter
86Jul 2018
21
PSENet-1s
85.7Mar 2019
22
FAST-B-736
84.7Nov 2021
23
SLPR
84.5Jan 2018
24
Corner-based Region Proposals
84.5Apr 2018
25
Corner Localization (multi-scale)
84.3Feb 2018
26
FTSN + MNMS
84.1Sep 2017
27
PixelLink+VGG16 2s
83.7Jul 2018
28
DBNet++ (ResNet-18) (736)
83.1Feb 2022
29
Quad_MS
82.9Jan 2018
30
FAST-S-736
82.9Nov 2021
31
PAN
82.9Apr 2017
32
TextSnake
82.6Aug 2019
33
FAST-T-736
81.7Nov 2021
34
EAST + PVANET2x RBOX (multi-scale)
80.7Sep 2017
35
EAST + PVANET2x RBOX (single-scale)
78.2Apr 2017
36
WordSup (VGG16-synth-icdar)
78.2Mar 2017
37
SSTD
77Aug 2017
38
SegLink
75Apr 2016
39
MCLAB_FCN
53.6Apr 2021

f-measure-generic-lexicon

#ModelScorePaper / CodeDate
1
UNITS
80.3Apr 2023
2
A3S
79.6Feb 2023
3
DeepSolo (ViTAEv2-S, TextOCR)
79.5Nov 2022
4
DeepSolo (ResNet-50, TextOCR)
79.1Nov 2022
5
DeepSolo (ResNet-50)
76.9Nov 2022
6
GLASS
76.3Aug 2022
7
SRTS
74.5Jul 2022
8
MaskTextSpotter v3
74.2Jul 2020
9
TESTR
73.6Apr 2022
10
ABCNet v2
73May 2021
11
SPTS v2
72.6Jan 2023
12
SwinTextSpotter
70.5Mar 2022
13
MANGO
67.3Dec 2020
14
SPTS
65.8Dec 2021
15
TextDragon
65.2
TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting
Oct 2019
16
TextPerceptron
65.1Feb 2020
17
PGNet
63.5Apr 2021
18
FOTS
62.2Jan 2018

f-measure-strong-lexicon

#ModelScorePaper / CodeDate
1
UNITS
89Apr 2023
2
DeepSolo (ViTAEv2-S, TextOCR)
88.1Nov 2022
3
DeepSolo (ResNet-50, TextOCR)
88Nov 2022
4
DeepSolo (ResNet-50)
86.8Nov 2022
5
SRTS
85.6Jul 2022
6
TESTR
85.2Apr 2022
7
A3S
84.8Feb 2023
8
GLASS
84.7Aug 2022
9
SwinTextSpotter
83.9Mar 2022
10
FOTS
83.6Jan 2018
11
MaskTextSpotter v3
83.3Jul 2020
12
PGNet
83.3Apr 2021
13
ABCNet v2
82.7May 2021
14
TextDragon
82.5
TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting
Oct 2019
15
SPTS v2
82.3Jan 2023
16
MANGO
81.8Dec 2020
17
TextPerceptron
80.5Feb 2020
18
SPTS
77.5Dec 2021

f-measure-weak-lexicon

#ModelScorePaper / CodeDate
1
UNITS
84.1Apr 2023
2
DeepSolo (ViTAEv2-S, TextOCR)
83.9Nov 2022
3
A3S
83.7Feb 2023
4
DeepSolo (ResNet-50, TextOCR)
83.5Nov 2022
5
DeepSolo (ResNet-50)
81.9Nov 2022
6
SRTS
81.7Jul 2022
7
GLASS
80.1Aug 2022
8
TESTR
79.4Apr 2022
9
MANGO
78.9Dec 2020
10
ABCNet v2
78.5May 2021
11
TextDragon
78.3
TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting
Oct 2019
12
PGNet
78.3Apr 2021
13
MaskTextSpotter v3
78.1Jul 2020
14
SPTS v2
77.7Jan 2023
15
SwinTextSpotter
77.3Mar 2022
16
TextPerceptron
76.6Feb 2020
17
FOTS
74.5Jan 2018
18
SPTS
70.2Dec 2021

fps

precision

#ModelScorePaper / CodeDate
1
TextFuseNet (ResNeXt-101)
93.96
TextFuseNet: Scene Text Detection with Richer Fused FeaturesCode
May 2020
2
CharNet H-88 (multi-scale)
92.65Oct 2019
3
SBD
92.1Dec 2019
4
FOTS MS
91.85Jan 2018
5
DB-ResNet-50 (1152)
91.8Nov 2019
6
Mask TextSpotter
91.6Jul 2018
7
CharNet H-57 (multi-scale)
91.43Oct 2019
8
PMTD*
91.3Mar 2019
9
CharNet H-50 (single-scale)
91.15Oct 2019
10
FOTS
91Jan 2018
11
CharNet H-50 (multi-scale)
90.9Oct 2019
12
DBNet++ (ResNet-50) (1152)
90.9Feb 2022
13
GNNets
90.41Sep 2019
14
DBNet++ (ResNet-18) (736)
90.1Feb 2022
15
CharNet H-88 (single-scale)
89.99Oct 2019
16
CRAFT
89.8Apr 2019
17
FAST-B-1280
89.7Nov 2021
18
Corner Localization (multi-scale)
89.5Feb 2018
19
FAST-B-896
89.2Nov 2021
20
CharNet H-57 (single-scale)
88.88Oct 2019
21
SPCNET
88.7Nov 2018
22
Corner-based Region Proposals
88.7Apr 2018
23
FTSN + MNMS
88.6Sep 2017
24
FAST-B-736
88Nov 2021
25
Quad_MS
87.8Jan 2018
26
PSENet-1s
86.9Mar 2019
27
SAST
86.72Aug 2019
28
FAST-S-736
86.3Nov 2021
29
FAST-T-736
86Nov 2021
30
SLPR
85.5Jan 2018
31
PixelLink+VGG16 2s
85.5Jul 2018
32
TextSnake
84.9Aug 2019
33
PAN
84Apr 2017
34
EAST + PVANET2x RBOX (single-scale)
83.6Apr 2017
35
EAST + PVANET2x RBOX (multi-scale)
83.3Sep 2017
36
SSTD
80Aug 2017
37
WordSup (VGG16-synth-icdar)
79.3Mar 2017
38
SegLink
73.1Apr 2016
39
MCLAB_FCN
70.8Apr 2021

recall

#ModelScorePaper / CodeDate
1
CharNet H-88 (single-scale)
91.98Oct 2019
2
TextFuseNet (ResNeXt-101)
90.56
TextFuseNet: Scene Text Detection with Richer Fused FeaturesCode
May 2020
3
CharNet H-88 (multi-scale)
90.47Oct 2019
4
CharNet H-57 (single-scale)
90.45Oct 2019
5
CharNet H-50 (multi-scale)
89.44Oct 2019
6
CharNet H-57 (multi-scale)
88.74Oct 2019
7
CharNet H-50 (single-scale)
88.3Oct 2019
8
SBD
88.2Dec 2019
9
FOTS MS
87.92Jan 2018
10
PMTD*
87.43Mar 2019
11
SAST
87.09Aug 2019
12
GNNets
86.71Sep 2019
13
SPCNET
85.8Nov 2018
14
FOTS
85.17Jan 2018
15
FAST-B-1280
84.6Nov 2021
16
PSENet-1s
84.5Mar 2019
17
CRAFT
84.3Apr 2019
18
DBNet++ (ResNet-50) (1152)
83.9Feb 2022
19
SLPR
83.6Jan 2018
20
FAST-B-896
83.6Nov 2021
21
DB-ResNet-50 (1152)
83.2Nov 2019
22
PixelLink+VGG16 2s
82Jul 2018
23
PAN
81.9Apr 2017
24
FAST-B-736
81.7Nov 2021
25
Mask TextSpotter
81Jul 2018
26
Corner-based Region Proposals
80.7Apr 2018
27
TextSnake
80.4Aug 2019
28
FTSN + MNMS
80Sep 2017
29
FAST-S-736
79.8Nov 2021
30
Corner Localization (multi-scale)
79.7Feb 2018
31
Quad_MS
78.5Jan 2018
32
EAST + PVANET2x RBOX (multi-scale)
78.3Sep 2017
33
FAST-T-736
77.9Nov 2021
34
DBNet++ (ResNet-18) (736)
77.2Feb 2022
35
WordSup (VGG16-synth-icdar)
77Mar 2017
36
SegLink
76.8Apr 2016
37
EAST + PVANET2x RBOX (single-scale)
73.5Apr 2017
38
SSTD
73Aug 2017
39
MCLAB_FCN
43Apr 2021

Related Papers39

DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting
Nov 2022Models: DeepSolo (ViTAEv2-S, TextOCR), DeepSolo (ResNet-50, TextOCR), DeepSolo (ResNet-50)
Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion
Feb 2022Models: DBNet++ (ResNet-50) (1152), DBNet++ (ResNet-18) (736)
Convolutional Character Networks
Oct 2019Models: CharNet H-88 (multi-scale), CharNet H-88 (single-scale), CharNet H-50 (multi-scale) +3 more
Single Shot Text Detector with Regional Attention
Sep 2017Models: EAST + PVANET2x RBOX (multi-scale)
EAST: An Efficient and Accurate Scene Text Detector
Apr 2017Models: PAN, EAST + PVANET2x RBOX (single-scale)

Other Scene Text Detection Datasets