firstbacksecondback
130 Results
Poster
|
Thu 11:00 |
Reinforcement Learning with LTL and ωω-Regular Objectives via Optimality-Preserving Translation to Average Rewards Xuan Bach Le · Dominik Wagner · Leon Witzman · Alexander Rabinovich · Luke Ong |
|
Poster
|
Wed 11:00 |
Chain of Thoughtlessness? An Analysis of CoT in Planning Kaya Stechly · Karthik Valmeekam · Subbarao Kambhampati |
|
Poster
|
Thu 16:30 |
Operator World Models for Reinforcement Learning Pietro Novelli · Marco Pratticò · Massimiliano Pontil · Carlo Ciliberto |
|
Poster
|
Fri 16:30 |
Towards the Transferability of Rewards Recovered via Regularized Inverse Reinforcement Learning Andreas Schlaginhaufen · Maryam Kamgarpour |
|
Poster
|
Wed 11:00 |
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing Ye Tian · Baolin Peng · Linfeng Song · Lifeng Jin · Dian Yu · Lei Han · Haitao Mi · Dong Yu |
|
Poster
|
Thu 16:30 |
The Limits of Transfer Reinforcement Learning with Latent Low-rank Structure Tyler Sam · Yudong Chen · Christina Yu |
|
Poster
|
Thu 11:00 |
Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning rates Jincheng Mei · Bo Dai · Alekh Agarwal · Sharan Vaswani · Anant Raj · Csaba Szepesvari · Dale Schuurmans |
|
Poster
|
Wed 11:00 |
Exploratory Retrieval-Augmented Planning For Continual Embodied Instruction Following Minjong Yoo · Jinwoo Jang · Wei-Jin Park · Honguk Woo |
|
Poster
|
Wed 11:00 |
Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces Angeliki Kamoutsi · Peter Schmitt-Förster · Tobias Sutter · Volkan Cevher · John Lygeros |
|
Poster
|
Thu 11:00 |
Foundations of Multivariate Distributional Reinforcement Learning Harley Wiltzer · Jesse Farebrother · Arthur Gretton · Mark Rowland |
|
Poster
|
Wed 16:30 |
Reward Machines for Deep RL in Noisy and Uncertain Environments Andrew Li · Zizhao Chen · Toryn Klassen · Pashootan Vaezipoor · Rodrigo Toro Icarte · Sheila McIlraith |
|
Poster
|
Wed 16:30 |
SeeA*: Efficient Exploration-Enhanced A* Search by Selective Sampling Dengwei Zhao · Shikui Tu · Lei Xu |