firstbacksecondback
231 Results
Workshop
|
Contextual Pre-Planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning Guy Azran · Mohamad Hosein Danesh · Stefano Albrecht · Sarah Keren |
||
Poster
|
Wed 15:00 |
Efficient Exploration in Continuous-time Model-based Reinforcement Learning Lenart Treven · Jonas Hübotter · Bhavya · Florian Dorfler · Andreas Krause |
|
Workshop
|
TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents Jingqing Ruan · YiHong Chen · Bin Zhang · Zhiwei Xu · Tianpeng Bao · du qing · shi shiwei · Hangyu Mao · Xingyu Zeng · Rui Zhao |
||
Workshop
|
TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents Jingqing Ruan · YiHong Chen · Bin Zhang · Zhiwei Xu · Tianpeng Bao · du qing · shi shiwei · Hangyu Mao · Xingyu Zeng · Rui Zhao |
||
Workshop
|
Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks Murtaza Dalal · Tarun Chiruvolu · Devendra Singh Chaplot · Russ Salakhutdinov |
||
Workshop
|
Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks Murtaza Dalal · Tarun Chiruvolu · Devendra Singh Chaplot · Russ Salakhutdinov |
||
Workshop
|
Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks Murtaza Dalal · Tarun Chiruvolu · Devendra Singh Chaplot · Russ Salakhutdinov |
||
Poster
|
Tue 15:15 |
Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning Lin Guan · Karthik Valmeekam · Sarath Sreedharan · Subbarao Kambhampati |
|
Poster
|
Tue 15:15 |
Model-Free Reinforcement Learning with the Decision-Estimation Coefficient Dylan J Foster · Noah Golowich · Jian Qian · Alexander Rakhlin · Ayush Sekhari |
|
Workshop
|
Sat 6:55 |
Learning Abstract World Models for Value-preserving Planning with Options Rafael Rodriguez Sanchez · George Konidaris |
|
Workshop
|
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models Mianchu Wang · Rui Yang · Xi Chen · Meng Fang |
||
Poster
|
Tue 8:45 |
Online Nonstochastic Model-Free Reinforcement Learning Udaya Ghai · Arushi Gupta · Wenhan Xia · Karan Singh · Elad Hazan |