Skip to yearly menu bar Skip to main content


What do MLLMs hear? Examining the interaction between LLM and audio encoder components in Multimodal Large Language Models

Enis Çoban ⋅ Michael Mandel ⋅ Johanna Devaney

Abstract

Video

Chat is not available.