2,000 environmental audio recordings organized into 50 classes (animals, natural soundscapes, etc.).
Accuracy is the reported evaluation metric for ESC-50. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Fix |
|---|---|---|---|---|---|---|
| 01 | BEATs (iter3+) | unverified | 98.1 | 2022 | Paper ↗Code ↗ | Looks wrong? |
| 02 | BEATs | verified | 98.1 | 2023 | Source ↗ | Looks wrong? |
| 03 | HTS-AT | verified | 97 | 2022 | Source ↗ | Looks wrong? |
| 04 | AST-P | unverified | 95.6 | 2021 | Paper ↗Code ↗ | Looks wrong? |
| 05 | AST | verified | 95.6 | 2021 | Source ↗ | Looks wrong? |
| 06 | CLAP | verified | 93.7 | 2023 | Source ↗ | Looks wrong? |
| 07 | CLAP+K2C Aug. | unverified | 91 | 2022 | Paper ↗Code ↗ | Looks wrong? |
| 08 | AST-S | unverified | 88.7 | 2021 | Paper ↗Code ↗ | Looks wrong? |