firstbacksecondback
1323 Results
Poster
|
Wed 11:00 |
Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning CHENYU YANG · Xizhou Zhu · Jinguo Zhu · Weijie Su · Junjie Wang · Xuan Dong · Wenhai Wang · Bin Li · Jie Zhou · Yu Qiao · Jifeng Dai |
|
Poster
|
Fri 11:00 |
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs Peter Tong · Ellis Brown · Penghao Wu · Sanghyun Woo · Adithya Jairam Vedagiri IYER · Sai Charitha Akula · Shusheng Yang · Jihan Yang · Manoj Middepogu · Ziteng Wang · Xichen Pan · Rob Fergus · Yann LeCun · Saining Xie |
|
Poster
|
Thu 16:30 |
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability, Reproducibility, and Practicality Tianle Zhang · Langtian Ma · Yuchen Yan · yuchen zhang · yue yang · Ziyao Guo · Wenqi Shao · Kai Wang · Yang You · Yu Qiao · Ping Luo · Kaipeng Zhang |