Text-to-Audio2023en
AudioCaps — Text-to-Audio Generation Benchmark
AudioCaps captions used as prompts for text-to-audio generation models. Standard eval for AudioLDM, AudioGen, Stable Audio.
No benchmark results indexed for this dataset yet.
Contribute results on GitHub