firstbacksecondback
1333 Results
Poster
|
Thu 11:00 |
RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation Jiaming Liu · Mengzhen Liu · Zhenyu Wang · Pengju An · Xiaoqi Li · Kaichen Zhou · Senqiao Yang · Renrui Zhang · Yandong Guo · Shanghang Zhang |
|
Workshop
|
Integrating Visual and Linguistic Instructions for Context-Aware Navigation Agents Suhwan Choi · Yongjun Cho · Minchan Kim · Jaeyoon Jung · Myunchul Joe · Park Yu Been · Minseo Kim · Sungwoong Kim · Sungjae Lee · WHISEONG PARK · Jiwan Chung · Youngjae Yu |
||
Workshop
|
Integrating Visual and Linguistic Instructions for Context-Aware Navigation Agents Suhwan Choi · Yongjun Cho · Minchan Kim · Jaeyoon Jung · Myunchul Joe · Park Yu Been · Minseo Kim · Sungwoong Kim · Sungjae Lee · WHISEONG PARK · Jiwan Chung · Youngjae Yu |
||
Poster
|
Thu 16:30 |
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents Zihao Wang · Shaofei Cai · Zhancun Mu · Haowei Lin · Ceyao Zhang · Xuejie Liu · Qing Li · Anji Liu · Xiaojian (Shawn) Ma · Yitao Liang |
|
Workshop
|
ShowUI: One Vision-Language-Action Model for Generalist GUI Agent Kevin Qinghong Lin · Linjie Li · Difei Gao · Zhengyuan Yang · Zechen Bai · Weixian Lei · Lijuan Wang · Mike Zheng Shou |
||
Workshop
|
ShowUI: One Vision-Language-Action Model for Generalist GUI Agent Kevin Qinghong Lin · Linjie Li · Difei Gao · Zhengyuan Yang · Zechen Bai · Weixian Lei · Lijuan Wang · Mike Zheng Shou |
||
Workshop
|
Sat 14:30 |
2:30 - 3:30 PM: Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language Conditioning |
|
Workshop
|
How to Determine the Preferred Image Distribution of a Black-Box Vision-Language Model? Saeid Asgari · Joseph G Lambourne · Alana Mongkhounsavath |
||
Workshop
|
How to Determine the Preferred Image Distribution of a Black-Box Vision-Language Model? Saeid Asgari · Joseph G Lambourne · Alana Mongkhounsavath |
||
Workshop
|
Sat 13:00 |
Navigating Neural Fields with Vision-Language Models Neale Ratzlaff · Phillip Howard · VASUDEV LAL |
|
Poster
|
Wed 11:00 |
No Filter: Cultural and Socioeconomic Diversity in Contrastive Vision-Language Models Angéline Pouget · Lucas Beyer · Emanuele Bugliarello · Xiao Wang · Andreas Steiner · Xiaohua Zhai · Ibrahim Alabdulmohsin |
|
Poster
|
Wed 16:30 |
VLG-CBM: Training Concept Bottleneck Models with Vision-Language Guidance Divyansh Srivastava · Ge Yan · Lily Weng |