Audio Captioning2019en
AudioCaps
Audio generation quality evaluated on AudioCaps captions
Current State of the Art
AudioCaps baseline (TopDown+Align)
Kim et al.
0.369
spider
AudioCaps — spider
3 results · 1 SOTA advances · higher is better
All results
SOTA frontier
spider Progress Over Time
Showing 3 breakthroughs from May 2023 to Apr 2026
Key Milestones
Apr 2026
AudioCaps baseline (TopDown+Align)Current SOTA
Original AudioCaps baseline — seed, verify (paper reports CIDEr/METEOR/SPICE separately).
0.369
+23.0%
Total Improvement
36.2%
Time Span
3y
Breakthroughs
3
Current SOTA
0.369
Top Models Performance Comparison
Top 3 models ranked by spider
Best Score
0.369
Top Model
AudioCaps baselin...
Models Compared
3
Score Range
0.098