firstbacksecondback
198 Results
Poster
|
Wed 16:30 |
Understanding the Limits of Vision Language Models Through the Lens of the Binding Problem Declan Campbell · Sunayana Rane · Tyler Giallanza · Camillo Nicolò De Sabbata · Kia Ghods · Amogh Joshi · Alexander Ku · Steven Frankland · Tom Griffiths · Jonathan D Cohen · Taylor Webb |
|
Workshop
|
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models Peng Xia · Kangyu Zhu · Haoran Li · Tianze Wang · Weijia Shi · Sheng Wang · Linjun Zhang · James Zou · Huaxiu Yao |
||
Workshop
|
DrawEduMath: Evaluating Vision Language Models with Expert-Annotated Students’ Hand-Drawn Math Images Sami Baral · Li Lucy · Ryan Knight · Alice Ng · Luca Soldaini · Neil Heffernan · Kyle Lo |
||
Workshop
|
Decompose, Recompose, and Conquer: Multi-modal LLMs are Vulnerable to Compositional Adversarial Attacks in Multi-Image Queries Julius Broomfield · George Ingebretsen · Reihaneh Iranmanesh · Sara Pieri · Ethan Kosak-Hine · Tom Gibbs · Reihaneh Rabbany · Kellin Pelrine |
||
Poster
|
Fri 16:30 |
CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models Peng Xia · Ze Chen · Juanxi Tian · Yangrui Gong · Ruibo Hou · Yue Xu · Zhenbang Wu · Zhiyuan Fan · Yiyang Zhou · Kangyu Zhu · Wenhao Zheng · Zhaoyang Wang · Xiao Wang · Xuchao Zhang · Chetan Bansal · Marc Niethammer · Junzhou Huang · Hongtu Zhu · Yun Li · Jimeng Sun · Zongyuan Ge · Gang Li · James Zou · Huaxiu Yao |
|
Poster
|
Thu 16:30 |
Amortizing intractable inference in diffusion models for vision, language, and control Siddarth Venkatraman · Moksh Jain · Luca Scimeca · Minsu Kim · Marcin Sendera · Mohsin Hasan · Luke Rowe · Sarthak Mittal · Pablo Lemos · Emmanuel Bengio · Alexandre Adam · Jarrid Rector-Brooks · Yoshua Bengio · Glen Berseth · Nikolay Malkin |