Optical Character Recognition2020en

mldoc-zero-shot-english-to-italian

Dataset from Papers With Code

Metrics:accuracy, cer, wer, f1
Current State of the Art

MultiFiT, pseudo

Unknown

76.02

accuracy

accuracy Progress Over Time

Showing 3 breakthroughs from May 2018 to Sep 2019

68.770.772.774.776.7May 2018Dec 2018Sep 2019accuracyDate

Key Milestones

May 2018
MultiCCA + CNN

From paper: A Corpus for Multilingual Document Classification in Eight Languages

69.4
Dec 2018
Massively Multilingual Sentence Embeddings

From paper: Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond

69.4
+0.1%
Sep 2019
MultiFiT, pseudoCurrent SOTA

From paper: MultiFiT: Efficient Multi-lingual Language Model Fine-tuning

76.0
+9.5%
Total Improvement
9.6%
Time Span
1y 4m
Breakthroughs
3
Current SOTA
76.0

Top Models Performance Comparison

Top 4 models ranked by accuracy

accuracy1MultiFiT, pseudo76.0100.0%2Massively Multilingual Se...69.491.3%3MultiCCA + CNN69.491.3%4BiLSTM (Europarl)60.779.9%0%25%50%75%100%% of best
Best Score
76.0
Top Model
MultiFiT, pseudo
Models Compared
4
Score Range
15.3

accuracyPrimary

Related Papers3

Other Optical Character Recognition Datasets