Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

82 Results

<<   <   Page 3 of 7   >   >>
Poster
Wed 11:00 What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights
Xin Wen · Bingchen Zhao · Yilun Chen · Jiangmiao Pang · Xiaojuan Qi
Workshop
Controlling Multimodal LLMs via Reward-guided Decoding
Oscar Mañas · Pierluca D&#x27;Oro · Koustuv Sinha · Adriana Romero · Michal Drozdzal · Aishwarya Agrawal
Workshop
LiMTR: Time Series Motion Prediction for Diverse Road Users through Multimodal Feature Integration
Camiel Oerlemans · Bram Grooten · Michiel Braat · Alaa Alassi · Emilia Silvas · Decebal Constantin Mocanu
Poster
Wed 11:00 MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning
Yifan Jiang · jiarui zhang · Kexuan Sun · Zhivar Sourati · Kian Ahrabian · Kaixin Ma · Filip Ilievski · Jay Pujara
Poster
Wed 11:00 Can LLMs Solve Molecule Puzzles? A Multimodal Benchmark for Molecular Structure Elucidation
Kehan Guo · Bozhao Nan · Yujun Zhou · Taicheng Guo · Zhichun Guo · Mihir Surve · Zhenwen Liang · Nitesh Chawla · Olaf Wiest · Xiangliang Zhang
Poster
Fri 11:00 Make-it-Real: Unleashing Large Multimodal Model for Painting 3D Objects with Realistic Materials
Ye Fang · Zeyi Sun · Tong Wu · Jiaqi Wang · Ziwei Liu · Gordon Wetzstein · Dahua Lin
Poster
UKnow: A Unified Knowledge Protocol with Multimodal Knowledge Graph Datasets for Reasoning and Vision-Language Pre-Training
Biao Gong · Shuai Tan · Yutong Feng · Xiaoying Xie · Yuyuan Li · Chaochao Chen · Kecheng Zheng · Yujun Shen · Deli Zhao
Workshop
A Formal Framework for Assessing and Mitigating Emergent Security Risks in Generative AI Models: Bridging Theory and Dynamic Risk Mitigation
aviral srivastava · Sourav Panda
Poster
Thu 16:30 Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models
Yang Jiao · Shaoxiang Chen · Zequn Jie · Jingjing Chen · Lin Ma · Yu-Gang Jiang
Poster
Wed 11:00 Visual Perception by Large Language Model’s Weights
Feipeng Ma · Hongwei Xue · Yizhou Zhou · Guangting Wang · Fengyun Rao · Shilin Yan · Yueyi Zhang · Siying Wu · Mike Zheng Shou · Xiaoyan Sun
Poster
Thu 11:00 Data curation via joint example selection further accelerates multimodal learning
Talfan Evans · Nikhil Parthasarathy · Hamza Merzic · Olivier Henaff
Poster
M3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation
Mingshuang Luo · RuiBing Hou · Zhuo Li · Hong Chang · Zimo Liu · Yaowei Wang · Shiguang Shan