Codesota · Benchmark · scut-ctw1500Home/Leaderboards/Vision & Documents/Document OCR/scut-ctw1500

Unknown

scut-ctw1500.

scut-ctw1500 is a state-of-the-art machine learning benchmark indexed on Codesota. This page tracks published model results, top scores per metric, and the SOTA timeline for scut-ctw1500.

Paper ↗Leaderboard ↓

§ 01 · SOTA history

Year over year.

§ 02 · Leaderboard

Results by metric.

Found a wrong score or missing run?

Use row edits to send a sourced correction into moderation.

Add / edit result ↗Report issue ↗

Fps

Fps is the reported evaluation metric for scut-ctw1500. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for Fpsverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	FAST-T-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	129.1	2021	Paper ↗Code ↗	Looks wrong?
02	FAST-S-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	112.9	2021	Paper ↗Code ↗	Looks wrong?
03	FAST-B-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	92.6	2021	Paper ↗Code ↗	Looks wrong?
04	FAST-B-640 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	66.5	2021	Paper ↗Code ↗	Looks wrong?
05	PAN From paper: Mask R-CNN with Pyramid Attention Network for Scene Text Detection	verified	65.2	2018	Paper ↗	Looks wrong?
06	MixNet From paper: MixNet: Toward Accurate Detection of Challenging Scene Text in the Wild	verified	15.2	2023	Paper ↗Code ↗	Looks wrong?

Precision

Precision is the reported evaluation metric for scut-ctw1500. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for Precisionverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	DeepSolo (with pre-training) Pre-trained on Synth150K+MLT17+IC13+IC15. Source: Table 7, arxiv:2305.19957	verified	92.5	2022	Paper ↗Code ↗	Looks wrong?
02	DPText-DETR (ResNet50) From paper: DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer	verified	91.7	2022	Paper ↗Code ↗	Looks wrong?
03	SRFormer (ResNet-50) From paper: SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression	verified	91.6	2023	Paper ↗Code ↗	Looks wrong?
04	MixNet From paper: MixNet: Toward Accurate Detection of Challenging Scene Text in the Wild	verified	91.4	2023	Paper ↗Code ↗	Looks wrong?
05	TextMamba ResNet-50 backbone. Source: Table I, arxiv:2512.06657	verified	91	2024	Paper ↗	Looks wrong?
06	TextFuseNet (ResNeXt-101) From paper: TextFuseNet: Scene Text Detection with Richer Fused Features	verified	89.7	2020	Paper ↗Code ↗	Looks wrong?
07	I3CL + SSL From paper: I3CL:Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection	verified	88.4	2021	Paper ↗Code ↗	Looks wrong?
08	EK-Net ResNet-18 backbone. Source: Table 3, arxiv:2401.11704	verified	87.85	2024	Paper ↗	Looks wrong?
09	FAST-B-640 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	87.8	2021	Paper ↗Code ↗	Looks wrong?
10	PAN From paper: Mask R-CNN with Pyramid Attention Network for Scene Text Detection	verified	86.8	2018	Paper ↗	Looks wrong?
11	PAN-640 From paper: Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network	verified	86.4	2019	Paper ↗Code ↗	Looks wrong?
12	CRAFT From paper: Character Region Awareness for Text Detection	verified	86	2019	Paper ↗Code ↗	Looks wrong?
13	FAST-B-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	85.7	2021	Paper ↗Code ↗	Looks wrong?
14	FAST-S-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	85.6	2021	Paper ↗Code ↗	Looks wrong?
15	FAST-T-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	85.5	2021	Paper ↗Code ↗	Looks wrong?
16	PSENet-1s From paper: Shape Robust Text Detection with Progressive Scale Expansion Network	verified	82.5	2018	Paper ↗Code ↗Source ↗	Looks wrong?
17	SLPR From paper: Sliding Line Point Regression for Shape Robust Scene Text Detection	verified	80.1	2018	Paper ↗Code ↗	Looks wrong?
18	TextSnake From paper: TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes	verified	67.9	2018	Paper ↗Code ↗	Looks wrong?

F Measure

F Measure is the reported evaluation metric for scut-ctw1500. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for F Measureverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	MixNet From paper: MixNet: Toward Accurate Detection of Challenging Scene Text in the Wild	verified	89.8	2023	Paper ↗Code ↗	Looks wrong?
02	TextMamba ResNet-50 backbone. Source: Table I, arxiv:2512.06657	verified	89.7	2024	Paper ↗	Looks wrong?
03	SRFormer (ResNet-50) From paper: SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression	verified	89.6	2023	Paper ↗Code ↗	Looks wrong?
04	DeepSolo (with pre-training) Pre-trained on Synth150K+MLT17+IC13+IC15. P=92.5, R=86.3. Source: Table 7, arxiv:2305.19957	verified	89.3	2022	Paper ↗Code ↗	Looks wrong?
05	DPText-DETR (ResNet50) From paper: DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer	verified	88.8	2022	Paper ↗Code ↗	Looks wrong?
06	TextFuseNet (ResNeXt-101) From paper: TextFuseNet: Scene Text Detection with Richer Fused Features	verified	87.4	2020	Paper ↗Code ↗	Looks wrong?
07	I3CL + SSL From paper: I3CL:Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection	verified	86.5	2021	Paper ↗Code ↗	Looks wrong?
08	EK-Net ResNet-18 backbone, 40.13 FPS. Source: Table 3, arxiv:2401.11704	verified	85.75	2024	Paper ↗	Looks wrong?
09	PAN From paper: Mask R-CNN with Pyramid Attention Network for Scene Text Detection	verified	85	2018	Paper ↗	Looks wrong?
10	FAST-B-640 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	84.2	2021	Paper ↗Code ↗	Looks wrong?
11	PAN-640 From paper: Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network	verified	83.7	2019	Paper ↗Code ↗	Looks wrong?
12	CRAFT From paper: Character Region Awareness for Text Detection	verified	83.5	2019	Paper ↗Code ↗	Looks wrong?
13	DB-ResNet50 (1024) From paper: Real-time Scene Text Detection with Differentiable Binarization	verified	83.4	2019	Paper ↗Code ↗	Looks wrong?
14	FAST-B-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	82.9	2021	Paper ↗Code ↗	Looks wrong?
15	FAST-S-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	82	2021	Paper ↗Code ↗	Looks wrong?
16	FAST-T-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	81.5	2021	Paper ↗Code ↗	Looks wrong?
17	PSENet-1s From paper: Shape Robust Text Detection with Progressive Scale Expansion Network	verified	81.17	2018	Paper ↗Code ↗Source ↗	Looks wrong?
18	TextSnake From paper: TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes	verified	75.6	2018	Paper ↗Code ↗	Looks wrong?
19	SLPR From paper: Sliding Line Point Regression for Shape Robust Scene Text Detection	verified	74.8	2018	Paper ↗Code ↗	Looks wrong?

Recall

Recall is the reported evaluation metric for scut-ctw1500. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for Recallverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	TextMamba ResNet-50 backbone. Source: Table I, arxiv:2512.06657	verified	88.5	2024	Paper ↗	Looks wrong?
02	MixNet From paper: MixNet: Toward Accurate Detection of Challenging Scene Text in the Wild	verified	88.3	2023	Paper ↗Code ↗	Looks wrong?
03	SRFormer (ResNet-50) From paper: SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression	verified	87.7	2023	Paper ↗Code ↗	Looks wrong?
04	DeepSolo (with pre-training) Pre-trained on Synth150K+MLT17+IC13+IC15. Source: Table 7, arxiv:2305.19957	verified	86.3	2022	Paper ↗Code ↗	Looks wrong?
05	DPText-DETR (ResNet50) From paper: DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer	verified	86.2	2022	Paper ↗Code ↗	Looks wrong?
06	TextSnake From paper: TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes	verified	85.3	2018	Paper ↗Code ↗	Looks wrong?
07	TextFuseNet (ResNeXt-101) From paper: TextFuseNet: Scene Text Detection with Richer Fused Features	verified	85.1	2020	Paper ↗Code ↗	Looks wrong?
08	I3CL + SSL From paper: I3CL:Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection	verified	84.6	2021	Paper ↗Code ↗	Looks wrong?
09	EK-Net ResNet-18 backbone. Source: Table 3, arxiv:2401.11704	verified	83.74	2024	Paper ↗	Looks wrong?
10	PAN From paper: Mask R-CNN with Pyramid Attention Network for Scene Text Detection	verified	83.2	2018	Paper ↗	Looks wrong?
11	PAN-640 From paper: Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network	verified	81.2	2019	Paper ↗Code ↗	Looks wrong?
12	CRAFT From paper: Character Region Awareness for Text Detection	verified	81.1	2019	Paper ↗Code ↗	Looks wrong?
13	FAST-B-640 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	80.9	2021	Paper ↗Code ↗	Looks wrong?
14	FAST-B-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	80.2	2021	Paper ↗Code ↗	Looks wrong?
15	PSENet-1s From paper: Shape Robust Text Detection with Progressive Scale Expansion Network	verified	79.89	2018	Paper ↗Code ↗Source ↗	Looks wrong?
16	FAST-S-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	78.7	2021	Paper ↗Code ↗	Looks wrong?
17	FAST-T-512 From paper: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation	verified	77.9	2021	Paper ↗Code ↗	Looks wrong?
18	SLPR From paper: Sliding Line Point Regression for Shape Robust Scene Text Detection	verified	70.1	2018	Paper ↗Code ↗	Looks wrong?

F Measure Full Lexicon

F Measure Full Lexicon is the reported evaluation metric for scut-ctw1500. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for F Measure Full Lexiconverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	SPTS From paper: SPTS: Single-Point Text Spotting	verified	83.8	2021	Paper ↗Code ↗	Looks wrong?
02	A3S From paper: A3S: Adversarial learning of semantic representations for Scene-Text Spotting	verified	82.3	2023	Paper ↗	Looks wrong?
03	TESTR From paper: Text Spotting Transformers	verified	81.5	2022	Paper ↗Code ↗	Looks wrong?
04	DeepSolo (ResNet-50) From paper: DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multilingual Text Spotting	verified	81.4	2023	Paper ↗Code ↗	Looks wrong?
05	ABINet++ From paper: ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting	verified	80.3	2022	Paper ↗Code ↗	Looks wrong?
06	TPSNet From paper: TPSNet: Reverse Thinking of Thin Plate Splines for Arbitrary Shape Scene Text Representation	verified	79.2	2021	Paper ↗Code ↗	Looks wrong?
07	MANGO From paper: MANGO: A Mask Attention Guided One-Stage Scene Text Spotter	verified	78.7	2020	Paper ↗Code ↗	Looks wrong?
08	ABCNet v2 From paper: ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text Spotting	verified	77.2	2021	Paper ↗Code ↗	Looks wrong?
09	SwinTextSpotter From paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition	verified	77	2022	Paper ↗Code ↗	Looks wrong?
10	TextDragon From paper: TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting	verified	72.4	2019	Paper ↗	Looks wrong?

F Measure No Lexicon

F Measure No Lexicon is the reported evaluation metric for scut-ctw1500. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for F Measure No Lexiconverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	A3S From paper: A3S: Adversarial learning of semantic representations for Scene-Text Spotting	verified	64.4	2023	Paper ↗	Looks wrong?
02	DeepSolo (ResNet-50) From paper: DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multilingual Text Spotting	verified	64.2	2023	Paper ↗Code ↗	Looks wrong?
03	SPTS From paper: SPTS: Single-Point Text Spotting	verified	63.6	2021	Paper ↗Code ↗	Looks wrong?
04	ABINet++ From paper: ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting	verified	60.2	2022	Paper ↗Code ↗	Looks wrong?
05	TPSNet From paper: TPSNet: Reverse Thinking of Thin Plate Splines for Arbitrary Shape Scene Text Representation	verified	59.7	2021	Paper ↗Code ↗	Looks wrong?
06	MANGO From paper: MANGO: A Mask Attention Guided One-Stage Scene Text Spotter	verified	58.9	2020	Paper ↗Code ↗	Looks wrong?
07	ABCNet v2 From paper: ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text Spotting	verified	57.5	2021	Paper ↗Code ↗	Looks wrong?
08	TextPerceptron From paper: Text Perceptron: Towards End-to-End Arbitrary-Shaped Text Spotting	verified	57	2020	Paper ↗Code ↗	Looks wrong?
09	TESTR From paper: Text Spotting Transformers	verified	56	2022	Paper ↗Code ↗	Looks wrong?
10	SwinTextSpotter From paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition	verified	51.8	2022	Paper ↗Code ↗	Looks wrong?
11	TextDragon From paper: TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting	verified	39.7	2019	Paper ↗	Looks wrong?

§ 04 · Submit a result

Add to the leaderboard.

← Back to Document OCR