Codesota · Models · AudioCaps baseline (TopDown+Align)Kim et al.1 results · 1 benchmarks

Model card

AudioCaps baseline (TopDown+Align).

Kim et al.open-sourceUnknown paramsVGGish + Top-Down attention + alignment loss1 current SOTA

Original AudioCaps paper baseline (NAACL 2019).

§ 02 · Benchmarks

Every benchmark AudioCaps baseline (TopDown+Align) has a recorded score for.

#	Benchmark	Area · Task	Metric	Value	Rank	Date	Source
01	AudioCaps	Audio · Audio Captioning	spider	0.4%	#1/3	—	source ↗

Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.

§ 03 · Strengths by area

Where AudioCaps baseline (TopDown+Align) actually performs.

Audio

benchmark

avg rank #1.0 · 1 SOTA

§ 06 · Sources & freshness

Where these numbers come from.

editorial

result

0 of 1 rows marked verified.