Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

82 Results

<<   <   Page 6 of 7   >   >>
Workshop
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning
Xiaotian Han · Yiren Jian · Xuefeng Hu · Haogeng Liu · Yiqi Wang · Qihang Fan · Yuang Ai · Huaibo Huang · Ran He · Zhenheng Yang · Quanzeng You
Poster
Thu 11:00 Are We on the Right Way for Evaluating Large Vision-Language Models?
Lin Chen · Jinsong Li · Xiaoyi Dong · Pan Zhang · Yuhang Zang · Zehui Chen · Haodong Duan · Jiaqi Wang · Yu Qiao · Dahua Lin · Feng Zhao
Workshop
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination
Jianing Yang · Xuweiyi Chen · Nikhil Madaan · Madhavan Iyengar · Shengyi Qian · David Fouhey · Joyce Chai
Poster
Fri 16:30 Needle In A Multimodal Haystack
Weiyun Wang · Shuibo Zhang · Yiming Ren · Yuchen Duan · Tiantong Li · Shuo Liu · Mengkang Hu · Zhe Chen · Kaipeng Zhang · Lewei Lu · Xizhou Zhu · Ping Luo · Yu Qiao · Jifeng Dai · Wenqi Shao · Wenhai Wang
Poster
Fri 11:00 BIOSCAN-5M: A Multimodal Dataset for Insect Biodiversity
Zahra Gharaee · Scott C. Lowe · ZeMing Gong · Pablo Millan Arias · Nicholas Pellegrino · Austin T. Wang · Joakim Bruslund Haurum · Iuliia Eyriay · Lila Kari · Dirk Steinke · Graham Taylor · Paul Fieguth · Angel Chang
Workshop
Comparison Visual Instruction Tuning
Wei Lin · Muhammad Jehanzeb Mirza · Sivan Doveh · Rogerio Feris · Raja Giryes · Sepp Hochreiter · Leonid Karlinsky
Poster
Fri 16:30 What to Say and When to Say it: Live Fitness Coaching as a Testbed for Situated Interaction
Sunny Panchal · Apratim Bhattacharyya · Guillaume Berger · Antoine Mercier · Cornelius Böhm · Florian Dietrichkeit · Reza Pourreza · Xuanlin Li · Pulkit Madan · Mingu Lee · Mark Todorovich · Ingo Bax · Roland Memisevic
Workshop
MedAIScout: Automated Retrieval of Known Machine Learning Vulnerabilities in Medical Applications
Athish Pranav Dharmalingam · Gargi Mitra
Workshop
When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?
Rylan Schaeffer · Dan Valentine · Luke Bailey · James Chua · Cristobal Eyzaguirre · Zane Durante · Joe Benton · Brando Miranda · Henry Sleight · Tony Wang · John Hughes · Rajashree Agrawal · Mrinank Sharma · Scott Emmons · Sanmi Koyejo · Ethan Perez
Poster
Thu 11:00 Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning
Brandon Huang · Chancharik Mitra · Leonid Karlinsky · Assaf Arbelle · Trevor Darrell · Roei Herzig
Workshop
Chain-of-Imagination for Reliable Instruction Following in Decision Making
Enshen Zhou · Yiran Qin · Zhenfei (Jeremy) Yin · Yuzhou Huang · Ruimao Zhang · Lu Sheng · Yu Qiao · Jing Shao
Workshop
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
Rylan Schaeffer · Dan Valentine · Luke Bailey · James Chua · Zane Durante · Cristobal Eyzaguirre · Joe Benton · Brando Miranda · Henry Sleight · Tony Wang · John Hughes · Rajashree Agrawal · Mrinank Sharma · Scott Emmons · Sanmi Koyejo · Ethan Perez