Poster
|
Tue 14:00
|
AVLEN: Audio-Visual-Language Embodied Navigation in 3D Environments
Sudipta Paul · Amit Roy-Chowdhury · Anoop Cherian
|
|
Poster
|
Tue 9:00
|
A Closer Look at Weakly-Supervised Audio-Visual Source Localization
Shentong Mo · Pedro Morgado
|
|
Poster
|
|
Audio-Driven Co-Speech Gesture Video Generation
Xian Liu · Qianyi Wu · Hang Zhou · Yuanqi Du · Wayne Wu · Dahua Lin · Ziwei Liu
|
|
Poster
|
Wed 14:00
|
u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality
Wei-Ning Hsu · Bowen Shi
|
|
Poster
|
Tue 14:00
|
Learning State-Aware Visual Representations from Audible Interactions
Himangi Mittal · Pedro Morgado · Unnat Jain · Abhinav Gupta
|
|
Poster
|
Thu 9:00
|
SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning
Changan Chen · Carl Schissler · Sanchit Garg · Philip Kobernik · Alexander Clegg · Paul Calamia · Dhruv Batra · Philip Robinson · Kristen Grauman
|
|
Poster
|
Wed 9:00
|
Few-Shot Audio-Visual Learning of Environment Acoustics
Sagnik Majumder · Changan Chen · Ziad Al-Halah · Kristen Grauman
|
|
Poster
|
Wed 14:00
|
Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source Separation
Moitreya Chatterjee · Narendra Ahuja · Anoop Cherian
|
|
Poster
|
Tue 14:00
|
Multi-modal Grouping Network for Weakly-Supervised Audio-Visual Video Parsing
Shentong Mo · Yapeng Tian
|
|
Workshop
|
|
Domain Invariant Q-Learning for model-free robust continuous control under visual distractions
Tom Dupuis · Jaonary Rabarisoa · Quoc Cuong PHAM · David Filliat
|
|
Poster
|
Thu 9:00
|
Energy-Based Contrastive Learning of Visual Representations
Beomsu Kim · Jong Chul Ye
|
|
Workshop
|
|
On the Pitfalls of Visual Learning in Referential Games
Shresth Verma
|
|