Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

13 Results

<<   <   Page 1 of 2   >   >>
Poster
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate
Fan-Ming Luo · Zuolin Tu · Zefang Huang · Yang Yu
Workshop
Faster, More Efficient RLHF through Off-Policy Asynchronous Learning
Michael Noukhovitch · Shengyi Huang · Sophie Xhonneux · Arian Hosseini · Rishabh Agarwal · Aaron Courville
Poster
Wed 16:30 Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes
Andrew Bennett · Nathan Kallus · Miruna Oprescu · Wen Sun · Kaiwen Wang
Poster
Thu 11:00 On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation
Yuheng Zhang · Nan Jiang
Poster
Fri 11:00 RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
Jeongyeol Kwon · Shie Mannor · Constantine Caramanis · Yonathan Efroni
Poster
Fri 16:30 Improved off-policy training of diffusion samplers
Marcin Sendera · Minsu Kim · Sarthak Mittal · Pablo Lemos · Luca Scimeca · Jarrid Rector-Brooks · Alexandre Adam · Yoshua Bengio · Nikolay Malkin
Poster
Wed 16:30 Off-policy estimation with adaptively collected data: the power of online learning
Jeonghwan Lee · Cong Ma
Poster
Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning
Shuguang Yu · Shuxing Fang · Ruixin Peng · Zhengling Qi · Fan Zhou · Chengchun Shi
Workshop
Improved Off-policy Reinforcement Learning in Biological Sequence Design
Hyeonah Kim · Minsu Kim · Taeyoung Yun · Sanghyeok Choi · Emmanuel Bengio · Alex Hernandez-Garcia · Jinkyoo Park
Poster
Thu 16:30 Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation
Shreyas Chaudhari · Ameet Deshpande · Bruno C. da Silva · Philip Thomas
Poster
Wed 16:30 Off-Policy Selection for Initiating Human-Centric Experimental Design
Ge Gao · Xi Yang · Qitong Gao · Song Ju · Miroslav Pajic · Min Chi
Poster
Fri 11:00 Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning
Otmane Sakhi · Imad Aouali · Pierre Alquier · Nicolas Chopin