firstbacksecondback
3 Results
Poster
|
Thu 16:30 |
Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs Alexandros Haliassos · Rodrigo Mira · Honglie Chen · Zoe Landgraf · Stavros Petridis · Maja Pantic |
|
Poster
|
Fri 11:00 |
A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation Gwanghyun Kim · Alonso Martinez · Yu-Chuan Su · Brendan Jou · Jose Lezama · Agrim Gupta · Lijun Yu · Lu Jiang · Aren Jansen · Jacob Walker · Krishna Somandepalli |
|
Workshop
|
Vision and language representations in multimodal AI models and human social brain regions during natural movie viewing Hannah Small · Haemy Lee Masson · Leyla Isik |