Skip to yearly menu bar Skip to main content


AudioSetCaps: Enriched Audio Captioning Dataset Generation Using Large Audio Language Models

Jisheng Bai ⋅ Haohe Liu ⋅ Mou Wang ⋅ Dongyuan Shi ⋅ Wenwu Wang ⋅ Mark Plumbley ⋅ Woon-Seng Gan ⋅ Jianfeng Chen

Abstract

Video

Chat is not available.