Text-to-Audio2023en

AudioCaps — Text-to-Audio Generation Benchmark

AudioCaps captions used as prompts for text-to-audio generation models. Standard eval for AudioLDM, AudioGen, Stable Audio.

No benchmark results indexed for this dataset yet.

Contribute results on GitHub