firstbacksecondback
5 Results
Workshop
|
MobileFlow: A Multimodal LLM For Mobile GUI Agent Songqin Nong · Jiali Zhu · Rui Wu · Jiongchao Jin · Shuo Shan · Xiutian Huang · Wenhao Xu |
||
Workshop
|
ShowUI: One Vision-Language-Action Model for Generalist GUI Agent Kevin Qinghong Lin · Linjie Li · Difei Gao · Zhengyuan Yang · Zechen Bai · Weixian Lei · Lijuan Wang · Mike Zheng Shou |
||
Workshop
|
ShowUI: One Vision-Language-Action Model for Generalist GUI Agent Kevin Qinghong Lin · Linjie Li · Difei Gao · Zhengyuan Yang · Zechen Bai · Weixian Lei · Lijuan Wang · Mike Zheng Shou |
||
Workshop
|
GUI-WORLD: A GUI-oriented Video Dataset for Multimodal LLM-based Agents Dongping Chen · Yue Huang · Siyuan Wu · Jingyu Tang · Huichi Zhou · Qihui Zhang · Zhigang He · Yilin Bai · Gao Chujie · Liuyi Chen · Yiqiang Li · Chenlong Wang · Yue Yu · Tianshuo Zhou · Zhen Li · Yi Gui · Yao Wan · Pan Zhou · Jianfeng Gao · Lichao Sun |
||
Workshop
|
CRAB: Cross-platfrom agent benchmark for multi-modal embodied language model agents Tianqi Xu · Linyao Chen · Dai-Jie Wu · Yanjun Chen · Zecheng Zhang · Xiang Yao · Zhiqiang Xie · Yongchao Chen · Shilong Liu · Bochen Qian · Philip Torr · Bernard Ghanem · Guohao Li |