Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

5 Results

<<   <   Page 1 of 1   >>   >
Workshop
MobileFlow: A Multimodal LLM For Mobile GUI Agent
Songqin Nong · Jiali Zhu · Rui Wu · Jiongchao Jin · Shuo Shan · Xiutian Huang · Wenhao Xu
Workshop
ShowUI: One Vision-Language-Action Model for Generalist GUI Agent
Kevin Qinghong Lin · Linjie Li · Difei Gao · Zhengyuan Yang · Zechen Bai · Weixian Lei · Lijuan Wang · Mike Zheng Shou
Workshop
ShowUI: One Vision-Language-Action Model for Generalist GUI Agent
Kevin Qinghong Lin · Linjie Li · Difei Gao · Zhengyuan Yang · Zechen Bai · Weixian Lei · Lijuan Wang · Mike Zheng Shou
Workshop
GUI-WORLD: A GUI-oriented Video Dataset for Multimodal LLM-based Agents
Dongping Chen · Yue Huang · Siyuan Wu · Jingyu Tang · Huichi Zhou · Qihui Zhang · Zhigang He · Yilin Bai · Gao Chujie · Liuyi Chen · Yiqiang Li · Chenlong Wang · Yue Yu · Tianshuo Zhou · Zhen Li · Yi Gui · Yao Wan · Pan Zhou · Jianfeng Gao · Lichao Sun
Workshop
CRAB: Cross-platfrom agent benchmark for multi-modal embodied language model agents
Tianqi Xu · Linyao Chen · Dai-Jie Wu · Yanjun Chen · Zecheng Zhang · Xiang Yao · Zhiqiang Xie · Yongchao Chen · Shilong Liu · Bochen Qian · Philip Torr · Bernard Ghanem · Guohao Li