Codesota · Benchmark · Total-TextHome/Leaderboards/Vision & Documents/Scene Text Detection/Total-Text

Unknown

Total-Text.

Curved text benchmark. 1555 images with polygon annotations.

Paper ↗Leaderboard ↓

§ 01 · Leaderboard

Results by metric.

Found a wrong score or missing run?

Use row edits to send a sourced correction into moderation.

Add / edit result ↗Report issue ↗

Fps

Fps is the reported evaluation metric for Total-Text. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for Fpsverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	FAST-T-448 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	152.8	2021	Paper ↗Code ↗	Looks wrong?
02	FAST-S-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	115.5	2021	Paper ↗Code ↗	Looks wrong?
03	FAST-B-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	93.2	2021	Paper ↗Code ↗	Looks wrong?

precision

Precision is the reported evaluation metric for Total-Text. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for precisionverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	DAT-SEG Segmentation variant. ICML 2024. Table 1 in arxiv:2405.19765.	verified	95.04	2024	Paper ↗	Looks wrong?
02	DAT-DET Detection variant. ICML 2024. Table 1 in arxiv:2405.19765.	verified	93.98	2024	Paper ↗	Looks wrong?
03	MixNet From paper: MixNet: Toward Accurate Detection of Challenging Scene Text in the Wild	verified	93	2023	Paper ↗Code ↗	Looks wrong?
04	ERRNet (with pre-training) With pre-training. AAAI 2025. Table 1 in arxiv:2412.14692.	verified	92.6	2024	Paper ↗	Looks wrong?
05	SRFormer (ResNet-50) From paper: SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression	verified	92.2	2023	Paper ↗Code ↗	Looks wrong?
06	DPText-DETR (ResNet-50) From paper: DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer	verified	91.8	2022	Paper ↗Code ↗	Looks wrong?
07	LRANet With pre-training. AAAI 2024 Oral. Table 7 in arxiv:2412.14692.	verified	90.3	2023	Paper ↗Code ↗	Looks wrong?
08	ERRNet Without pre-training. AAAI 2025. Table 1 in arxiv:2412.14692.	verified	90.1	2024	Paper ↗	Looks wrong?
09	FAST-B-800 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	90	2021	Paper ↗Code ↗	Looks wrong?
10	FAST-B-640 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	89.9	2021	Paper ↗Code ↗	Looks wrong?
11	CharNet H-88 From paper: Convolutional Character Networks	verified	89.9	2019	Paper ↗Code ↗	Looks wrong?
12	I3CL + SSL(ResNet-50) From paper: I3CL:Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection	verified	89.8	2021	Paper ↗Code ↗	Looks wrong?
13	FAST-B-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	89.6	2021	Paper ↗Code ↗	Looks wrong?
14	TextMamba ResNet-50 backbone. Table I in paper. arxiv:2512.06657	verified	89.5	2024	Paper ↗	Looks wrong?
15	PAN-640 From paper: Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network	verified	89.3	2019	Paper ↗Code ↗	Looks wrong?
16	TextFuseNet (ResNeXt-101) From paper: TextFuseNet: Scene Text Detection with Richer Fused Features	verified	89.2	2020	Paper ↗Code ↗	Looks wrong?
17	DBNet++ (ResNet-50) (800) From paper: Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion	verified	88.9	2022	Paper ↗Code ↗	Looks wrong?
18	FAST-S-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	88.3	2021	Paper ↗Code ↗	Looks wrong?
19	CharNet H-88 (multi-scale) From paper: Convolutional Character Networks	verified	88	2019	Paper ↗Code ↗	Looks wrong?
20	CRAFT From paper: Character Region Awareness for Text Detection	verified	87.6	2019	Paper ↗Code ↗	Looks wrong?
21	DBNet++ (ResNet-18) (800) From paper: Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion	verified	87.4	2022	Paper ↗Code ↗	Looks wrong?
22	FAST-T-448 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	86.5	2021	Paper ↗Code ↗	Looks wrong?
23	FTSN From paper: Fused Text Segmentation Networks for Multi-oriented Scene Text Detection	verified	84.7	2017	Paper ↗	Looks wrong?
24	PSENet-4s From paper: Shape Robust Text Detection with Progressive Scale Expansion Network	verified	84.5	2019	Paper ↗Code ↗	Looks wrong?
25	SPCNET From paper: Scene Text Detection with Supervised Pyramid Context Network	verified	83	2018	Paper ↗Code ↗	Looks wrong?
26	TextSnake From paper: TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes	verified	82.7	2018	Paper ↗Code ↗	Looks wrong?
27	TextFiled From paper: TextField: Learning A Deep Direction Field for Irregular Scene Text Detection	verified	81.2	2018	Paper ↗Code ↗	Looks wrong?

F Measure

F Measure is the reported evaluation metric for Total-Text. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for F Measureverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	DAT-SEG Segmentation variant, new SOTA on Total-Text. P=95.04, R=89.16. ICML 2024. Table 1 in arxiv:2405.19765.	verified	92.01	2024	Paper ↗	Looks wrong?
02	DAT-DET Detection variant. P=93.98, R=88.17. ICML 2024. Table 1 in arxiv:2405.19765.	verified	90.98	2024	Paper ↗	Looks wrong?
03	MixNet From paper: MixNet: Toward Accurate Detection of Challenging Scene Text in the Wild	verified	90.5	2023	Paper ↗Code ↗	Looks wrong?
04	SRFormer (ResNet-50) From paper: SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression	verified	90	2023	Paper ↗Code ↗	Looks wrong?
05	ERRNet (with pre-training) With pre-training. P=92.6, R=87.3. AAAI 2025. Table 1 in arxiv:2412.14692.	verified	89.9	2024	Paper ↗	Looks wrong?
06	TextMamba ResNet-50 backbone. Table I in paper. arxiv:2512.06657	verified	89.2	2024	Paper ↗	Looks wrong?
07	LRANet With pre-training. P=90.3, R=87.8. AAAI 2024 Oral. Table 7 in arxiv:2412.14692.	verified	89	2023	Paper ↗Code ↗	Looks wrong?
08	DPText-DETR (ResNet-50) From paper: DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer	verified	89	2022	Paper ↗Code ↗	Looks wrong?
09	ERRNet Without pre-training. P=90.1, R=86.1. AAAI 2025. Table 1 in arxiv:2412.14692.	verified	88.1	2024	Paper ↗	Looks wrong?
10	FAST-B-800 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	87.5	2021	Paper ↗Code ↗	Looks wrong?
11	TextFuseNet (ResNeXt-101) From paper: TextFuseNet: Scene Text Detection with Richer Fused Features	verified	87.5	2020	Paper ↗Code ↗	Looks wrong?
12	I3CL + SSL(ResNet-50) From paper: I3CL:Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection	verified	86.9	2021	Paper ↗Code ↗	Looks wrong?
13	CharNet H-88 (multi-scale) From paper: Convolutional Character Networks	verified	86.5	2019	Paper ↗Code ↗	Looks wrong?
14	FAST-B-640 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	86.4	2021	Paper ↗Code ↗	Looks wrong?
15	DBNet++ (ResNet-50) (800) From paper: Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion	verified	86	2022	Paper ↗Code ↗	Looks wrong?
16	FAST-B-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	85.8	2021	Paper ↗Code ↗	Looks wrong?
17	SA-Text From paper: A method for detecting text of arbitrary shapes in natural scenes that improves text spotting	verified	85.6	2019	Paper ↗	Looks wrong?
18	CharNet H-88 From paper: Convolutional Character Networks	verified	85.6	2019	Paper ↗Code ↗	Looks wrong?
19	PAN-640 From paper: Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network	verified	85	2019	Paper ↗Code ↗	Looks wrong?
20	FAST-S-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	84.9	2021	Paper ↗Code ↗	Looks wrong?
21	DB-ResNet-50 (800) From paper: Real-time Scene Text Detection with Differentiable Binarization	verified	84.7	2019	Paper ↗Code ↗	Looks wrong?
22	TextCohesion From paper: TextCohesion: Detecting Text for Arbitrary Shapes	verified	84.6	2019	Paper ↗	Looks wrong?
23	CRAFT From paper: Character Region Awareness for Text Detection	verified	83.6	2019	Paper ↗Code ↗	Looks wrong?
24	DBNet++ (ResNet-18) (800) From paper: Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion	verified	83.3	2022	Paper ↗Code ↗	Looks wrong?
25	SPCNET From paper: Scene Text Detection with Supervised Pyramid Context Network	verified	82.9	2018	Paper ↗Code ↗	Looks wrong?
26	FAST-T-448 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	81.6	2021	Paper ↗Code ↗	Looks wrong?
27	FTSN From paper: Fused Text Segmentation Networks for Multi-oriented Scene Text Detection	verified	81.3	2017	Paper ↗	Looks wrong?
28	TextFiled From paper: TextField: Learning A Deep Direction Field for Irregular Scene Text Detection	verified	80.6	2018	Paper ↗Code ↗	Looks wrong?
29	PSENet-4s From paper: Shape Robust Text Detection with Progressive Scale Expansion Network	verified	79.6	2019	Paper ↗Code ↗	Looks wrong?
30	TextSnake From paper: TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes	verified	78.4	2018	Paper ↗Code ↗	Looks wrong?

F Measure Full Lexicon

F Measure Full Lexicon is the reported evaluation metric for Total-Text. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for F Measure Full Lexiconverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	DeepSolo (ViTAEv2-S, TextOCR) From paper: DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting	verified	89.6	2022	Paper ↗Code ↗	Looks wrong?
02	DeepSolo (ResNet-50, TextOCR) From paper: DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting	verified	88.7	2022	Paper ↗Code ↗	Looks wrong?
03	DeepSolo (ResNet-50) From paper: DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting	verified	87	2022	Paper ↗Code ↗	Looks wrong?
04	UNITS From paper: Towards Unified Scene Text Spotting based on Sequence Generation	verified	86	2023	Paper ↗Code ↗	Looks wrong?
05	A3S From paper: A3S: Adversarial learning of semantic representations for Scene-Text Spotting	verified	85.1	2023	Paper ↗	Looks wrong?
06	SwinTextSpotter From paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition	verified	84.1	2022	Paper ↗Code ↗	Looks wrong?
07	TESTR From paper: Text Spotting Transformers	verified	83.9	2022	Paper ↗Code ↗	Looks wrong?
08	MANGO From paper: MANGO: A Mask Attention Guided One-Stage Scene Text Spotter	verified	83.6	2020	Paper ↗Code ↗	Looks wrong?
09	DEER From paper: DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting	verified	83.3	2022	Paper ↗	Looks wrong?
10	GLASS From paper: GLASS: Global to Local Attention for Scene-Text Spotting	verified	83	2022	Paper ↗Code ↗	Looks wrong?
11	MaskTextSpotter v3 From paper: Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text Spotting	verified	78.4	2020	Paper ↗Code ↗	Looks wrong?
12	ABCNet v2 From paper: ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text Spotting	verified	78.1	2021	Paper ↗Code ↗	Looks wrong?

recall

Recall is the reported evaluation metric for Total-Text. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for recallverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	DAT-SEG Segmentation variant. ICML 2024. Table 1 in arxiv:2405.19765.	verified	89.16	2024	Paper ↗	Looks wrong?
02	TextMamba ResNet-50 backbone. Table I in paper. arxiv:2512.06657	verified	88.8	2024	Paper ↗	Looks wrong?
03	DAT-DET Detection variant. ICML 2024. Table 1 in arxiv:2405.19765.	verified	88.17	2024	Paper ↗	Looks wrong?
04	MixNet From paper: MixNet: Toward Accurate Detection of Challenging Scene Text in the Wild	verified	88.1	2023	Paper ↗Code ↗	Looks wrong?
05	SRFormer (ResNet-50) From paper: SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression	verified	87.9	2023	Paper ↗Code ↗	Looks wrong?
06	LRANet With pre-training. AAAI 2024 Oral. Table 7 in arxiv:2412.14692.	verified	87.8	2023	Paper ↗Code ↗	Looks wrong?
07	ERRNet (with pre-training) With pre-training. AAAI 2025. Table 1 in arxiv:2412.14692.	verified	87.3	2024	Paper ↗	Looks wrong?
08	DPText-DETR (ResNet-50) From paper: DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer	verified	86.4	2022	Paper ↗Code ↗	Looks wrong?
09	ERRNet Without pre-training. AAAI 2025. Table 1 in arxiv:2412.14692.	verified	86.1	2024	Paper ↗	Looks wrong?
10	TextFuseNet (ResNeXt-101) From paper: TextFuseNet: Scene Text Detection with Richer Fused Features	verified	85.8	2020	Paper ↗Code ↗	Looks wrong?
11	FAST-B-800 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	85.2	2021	Paper ↗Code ↗	Looks wrong?
12	CharNet H-88 (multi-scale) From paper: Convolutional Character Networks	verified	85	2019	Paper ↗Code ↗	Looks wrong?
13	I3CL + SSL(ResNet-50) From paper: I3CL:Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection	verified	84.2	2021	Paper ↗Code ↗	Looks wrong?
14	FAST-B-640 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	83.2	2021	Paper ↗Code ↗	Looks wrong?
15	DBNet++ (ResNet-50) (800) From paper: Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion	verified	83.2	2022	Paper ↗Code ↗	Looks wrong?
16	SPCNET From paper: Scene Text Detection with Supervised Pyramid Context Network	verified	82.8	2018	Paper ↗Code ↗	Looks wrong?
17	FAST-B-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	82.4	2021	Paper ↗Code ↗	Looks wrong?
18	CharNet H-88 From paper: Convolutional Character Networks	verified	81.7	2019	Paper ↗Code ↗	Looks wrong?
19	FAST-S-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	81.7	2021	Paper ↗Code ↗	Looks wrong?
20	PAN-640 From paper: Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network	verified	81	2019	Paper ↗Code ↗	Looks wrong?
21	TextFiled From paper: TextField: Learning A Deep Direction Field for Irregular Scene Text Detection	verified	79.9	2018	Paper ↗Code ↗	Looks wrong?
22	CRAFT From paper: Character Region Awareness for Text Detection	verified	79.9	2019	Paper ↗Code ↗	Looks wrong?
23	DBNet++ (ResNet-18) (800) From paper: Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion	verified	79.6	2022	Paper ↗Code ↗	Looks wrong?

F Measure No Lexicon

F Measure No Lexicon is the reported evaluation metric for Total-Text. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for F Measure No Lexiconverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	DeepSolo (ViTAEv2-S, TextOCR) From paper: DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting	verified	83.6	2022	Paper ↗Code ↗	Looks wrong?
02	DeepSolo (ResNet-50, TextOCR) From paper: DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting	verified	82.5	2022	Paper ↗Code ↗	Looks wrong?
03	DeepSolo (ResNet-50) From paper: DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting	verified	79.7	2022	Paper ↗Code ↗	Looks wrong?
04	A3S From paper: A3S: Adversarial learning of semantic representations for Scene-Text Spotting	verified	79.4	2023	Paper ↗	Looks wrong?
05	UNITS From paper: Towards Unified Scene Text Spotting based on Sequence Generation	verified	78.7	2023	Paper ↗Code ↗	Looks wrong?

§ 04 · Submit a result

Add to the leaderboard.

← Back to Scene Text Detection