firstbacksecondback
82 Results
Workshop
|
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning Xiaotian Han · Yiren Jian · Xuefeng Hu · Haogeng Liu · Yiqi Wang · Qihang Fan · Yuang Ai · Huaibo Huang · Ran He · Zhenheng Yang · Quanzeng You |
||
Poster
|
Thu 11:00 |
Are We on the Right Way for Evaluating Large Vision-Language Models? Lin Chen · Jinsong Li · Xiaoyi Dong · Pan Zhang · Yuhang Zang · Zehui Chen · Haodong Duan · Jiaqi Wang · Yu Qiao · Dahua Lin · Feng Zhao |
|
Workshop
|
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination Jianing Yang · Xuweiyi Chen · Nikhil Madaan · Madhavan Iyengar · Shengyi Qian · David Fouhey · Joyce Chai |
||
Poster
|
Fri 16:30 |
Needle In A Multimodal Haystack Weiyun Wang · Shuibo Zhang · Yiming Ren · Yuchen Duan · Tiantong Li · Shuo Liu · Mengkang Hu · Zhe Chen · Kaipeng Zhang · Lewei Lu · Xizhou Zhu · Ping Luo · Yu Qiao · Jifeng Dai · Wenqi Shao · Wenhai Wang |
|
Poster
|
Fri 11:00 |
BIOSCAN-5M: A Multimodal Dataset for Insect Biodiversity Zahra Gharaee · Scott C. Lowe · ZeMing Gong · Pablo Millan Arias · Nicholas Pellegrino · Austin T. Wang · Joakim Bruslund Haurum · Iuliia Eyriay · Lila Kari · Dirk Steinke · Graham Taylor · Paul Fieguth · Angel Chang |
|
Workshop
|
Comparison Visual Instruction Tuning Wei Lin · Muhammad Jehanzeb Mirza · Sivan Doveh · Rogerio Feris · Raja Giryes · Sepp Hochreiter · Leonid Karlinsky |
||
Poster
|
Fri 16:30 |
What to Say and When to Say it: Live Fitness Coaching as a Testbed for Situated Interaction Sunny Panchal · Apratim Bhattacharyya · Guillaume Berger · Antoine Mercier · Cornelius Böhm · Florian Dietrichkeit · Reza Pourreza · Xuanlin Li · Pulkit Madan · Mingu Lee · Mark Todorovich · Ingo Bax · Roland Memisevic |
|
Workshop
|
MedAIScout: Automated Retrieval of Known Machine Learning Vulnerabilities in Medical Applications Athish Pranav Dharmalingam · Gargi Mitra |
||
Workshop
|
When Do Universal Image Jailbreaks Transfer Between Vision-Language Models? Rylan Schaeffer · Dan Valentine · Luke Bailey · James Chua · Cristobal Eyzaguirre · Zane Durante · Joe Benton · Brando Miranda · Henry Sleight · Tony Wang · John Hughes · Rajashree Agrawal · Mrinank Sharma · Scott Emmons · Sanmi Koyejo · Ethan Perez |
||
Poster
|
Thu 11:00 |
Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning Brandon Huang · Chancharik Mitra · Leonid Karlinsky · Assaf Arbelle · Trevor Darrell · Roei Herzig |
|
Workshop
|
Chain-of-Imagination for Reliable Instruction Following in Decision Making Enshen Zhou · Yiran Qin · Zhenfei (Jeremy) Yin · Yuzhou Huang · Ruimao Zhang · Lu Sheng · Yu Qiao · Jing Shao |
||
Workshop
|
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models Rylan Schaeffer · Dan Valentine · Luke Bailey · James Chua · Zane Durante · Cristobal Eyzaguirre · Joe Benton · Brando Miranda · Henry Sleight · Tony Wang · John Hughes · Rajashree Agrawal · Mrinank Sharma · Scott Emmons · Sanmi Koyejo · Ethan Perez |