Home/Browse/audiocaps

audiocaps

Unknown

audiocaps is a state-of-the-art machine learning benchmark indexed on Codesota. This page tracks published model results, top scores per metric, and the SOTA timeline for audiocaps.

Benchmark Stats

Models5
Papers5
Metrics1

SOTA History

fad

Higher is better

RankModelSourceScoreYearPaper
1AudioLDM

AudioLDM (Liu et al., ICML 2023). FAD on AudioCaps test set. Baseline comparison in AudioLDM 2 paper.

Community4.482023Source
2AudioLDM 2-Full-Large

AudioLDM 2-Full-Large (Liu et al., IEEE/ACM TASLP 2024). FAD on AudioCaps test set. Table II in paper.

Community1.862024Source
3AudioLDM 2-Full

AudioLDM 2-Full (Liu et al., IEEE/ACM TASLP 2024). FAD on AudioCaps test set. Table II in paper.

Community1.782024Source
4TANGO

TANGO (Ghosal et al., 2023). FAD on AudioCaps test set. Previous SOTA before AudioLDM 2.

Community1.732023Source
5AudioLDM 2-AC-Large

AudioLDM 2 AudioCaps-finetuned large model (Liu et al., IEEE/ACM TASLP 2024). Best FAD on AudioCaps test set. Table II in paper.

Community1.422024Source

Submit a Result