Who leads the reuters-21578 benchmark?

ApproxRepSet currently leads reuters-21578 with a score of 97.17 on accuracy.

What is the state-of-the-art score on reuters-21578?

The state-of-the-art result on reuters-21578 is 97.17 (accuracy), achieved by ApproxRepSet as of 2020.

How many models are tracked on reuters-21578?

Codesota tracks 8 models on reuters-21578 across 2 metrics.

When was the reuters-21578 leaderboard last updated?

The reuters-21578 leaderboard on Codesota includes results through 2020, with the earliest tracked result from 2019.

Codesota · Computer Vision · Optical Character Recognition · reuters-21578Tasks/Computer Vision/Optical Character Recognition

Optical Character Recognition · benchmark dataset · 2020 · EN

reuters-21578.

Name: reuters-21578 Benchmark Results
Creator: Codesota
Published: 2019-01-01
License: https://creativecommons.org/licenses/by/4.0/

Dataset from Papers With Code

Submit a result ↵

§ 01 · Leaderboard

Best published scores.

8 results indexed across 2 metrics. Shaded row marks current SOTA; ties broken by submission date.

Primary: accuracy · higher is better
All metrics: accuracy, f1

accuracy· primary

3 rows

#	Model	Org	Submitted	Paper / code	accuracy
01	ApproxRepSet	—	Apr 2019	Rep the Set: Neural Networks for Learning Set Representa… · code	97.17
02	REL-RWMD k-NN	—	Dec 2019	Speeding up Word Mover's Distance and its variants via p… · code	95.61
03	Orthogonalized Soft VSM	—	Mar 2020	Text classification with word embedding regularization a… · code	92.65

5 rows

#	Model	Org	Submitted	Paper / code	f1
01	MAGNET	—	Feb 2020	papers-with-code · code	89.90
02	VLAWE	—	Feb 2019	Vector of Locally-Aggregated Word Embeddings (VLAWE): A … · code	89.30
03	KD-LSTMreg	—	Apr 2019	DocBERT: BERT for Document Classification · code	88.90
04	LSTM-reg (single model)	—	Jun 2019	papers-with-code · code	87
05	SCDV-MS	—	Nov 2019	Improving Document Classification with Multi-Sense Embed… · code	82.71

Fig 2 · Rows sorted by score within each metric. Shaded row marks SOTA. Dates reflect model or paper release where available, otherwise the date Codesota accessed the source.

§ 03 · Progress

1 steps
of state of the art.

Each row below marks a model that broke the previous record on accuracy. Intermediate submissions are kept in the leaderboard above; only SOTA-setting entries are re-listed here.

Higher scores win. Each subsequent entry improved upon the previous best.

SOTA line · accuracy

Apr 3, 2019ApproxRepSet97.17

Fig 3 · SOTA-setting models only. 1 entries span Apr 2019 → Apr 2019.

§ 04 · Literature

6 papers
tied to this benchmark.

Every paper below corresponds to at least one row in the leaderboard above. Click through for the arXiv preprint and, when available, the reference implementation.

Text classification with word embedding regularization and soft similarity measure
Mar 2020·Orthogonalized Soft VSM
arXiv ↗Code
Speeding up Word Mover's Distance and its variants via properties of distances between embeddings
Dec 2019·REL-RWMD k-NN
arXiv ↗Code
Improving Document Classification with Multi-Sense Embeddings
Nov 2019·SCDV-MS
arXiv ↗Code
DocBERT: BERT for Document Classification
Apr 2019·KD-LSTMreg
arXiv ↗Code
Rep the Set: Neural Networks for Learning Set Representations
Apr 2019·ApproxRepSet
arXiv ↗Code
Vector of Locally-Aggregated Word Embeddings (VLAWE): A Novel Document-level Representation
Feb 2019·VLAWE
arXiv ↗Code

§ 06 · Contribute

Have a score that beats
this table?

Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.

Submit a result ↵Read submission guide

What a submission needs

01A public checkpoint or API endpoint
02A reproduction script with frozen commit + seed
03Declared evaluation environment (Python, deps)
04One row per metric declared by this dataset
05A contact so we can follow up on discrepancies

reuters-21578.

Best published scores.

1 stepsof state of the art.

6 paperstied to this benchmark.

Neighbouring benchmarks.

Have a score that beatsthis table?

1 steps
of state of the art.

6 papers
tied to this benchmark.

Have a score that beats
this table?