firstbacksecondback
1333 Results
Poster
|
Fri 16:30 |
LAM3D: Large Image-Point Clouds Alignment Model for 3D Reconstruction from Single Image Ruikai Cui · Xibin Song · Weixuan Sun · Senbo Wang · Weizhe Liu · Shenzhou Chen · Taizhang Shang · YANG LI · Nick Barnes · Hongdong Li · Pan Ji |
|
Poster
|
Wed 11:00 |
Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning CHENYU YANG · Xizhou Zhu · Jinguo Zhu · Weijie Su · Junjie Wang · Xuan Dong · Wenhai Wang · Bin Li · Jie Zhou · Yu Qiao · Jifeng Dai |
|
Poster
|
Fri 11:00 |
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs Peter Tong · Ellis Brown · Penghao Wu · Sanghyun Woo · Adithya Jairam Vedagiri IYER · Sai Charitha Akula · Shusheng Yang · Jihan Yang · Manoj Middepogu · Ziteng Wang · Xichen Pan · Rob Fergus · Yann LeCun · Saining Xie |
|
Workshop
|
From Context to Action: Analysis of the Impact of State Representation and Context on the Generalization of Multi-Turn Web Navigation Agents Nalin Tiwary · Vardhan Dongre · Sanil Chawla · Ashwin Lamani · Dilek Tur |
||
Poster
|
Wed 16:30 |
Efficient Temporal Action Segmentation via Boundary-aware Query Voting Peiyao Wang · Yuewei Lin · Erik Blasch · jie wei · Haibin Ling |
|
Poster
|
Fri 11:00 |
Recovering Complete Actions for Cross-dataset Skeleton Action Recognition Hanchao Liu · Yujiang Li · Tai-Jiang Mu · Shi-min Hu |
|
Poster
|
Wed 16:30 |
CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition Yuhang Wen · Mengyuan Liu · Songtao Wu · Beichen Ding |
|
Poster
|
Thu 16:30 |
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability, Reproducibility, and Practicality Tianle Zhang · Langtian Ma · Yuchen Yan · yuchen zhang · yue yang · Ziyao Guo · Wenqi Shao · Kai Wang · Yang You · Yu Qiao · Ping Luo · Kaipeng Zhang |
|
Poster
|
Fri 11:00 |
Action Imitation in Common Action Space for Customized Action Image Synthesis wang lin · Jingyuan Chen · Jiaxin Shi · Zirun Guo · Yichen Zhu · Zehan Wang · Tao Jin · Zhou Zhao · Fei Wu · Shuicheng Yan · Hanwang Zhang |
|
Workshop
|
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks Thomas Schmied · Thomas Adler · Vihang Patil · Maximilian Beck · Korbinian Pöppel · Johannes Brandstetter · Günter Klambauer · Razvan Pascanu · Sepp Hochreiter |
||
Poster
|
Fri 11:00 |
Does Video-Text Pretraining Help Open-Vocabulary Online Action Detection? qingsong zhao · Yi Wang · Jilan Xu · Yinan He · Zifan Song · Limin Wang · Yu Qiao · Cairong Zhao |
|
Poster
|
Wed 11:00 |
Learning Action and Reasoning-Centric Image Editing from Videos and Simulation Benno Krojer · Dheeraj Vattikonda · Luis Lara · Varun Jampani · Eva Portelance · Chris Pal · Siva Reddy |