Skip to yearly menu bar Skip to main content


Search All 2022 Events
 

24 Results

<<   <   Page 1 of 2   >   >>
Workshop
Variance Reduction in Off-Policy Deep Reinforcement Learning using Spectral Normalization
Payal Bawa · Rafael Oliveira · Fabio Ramos
Workshop
On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly-Communicating MDPs
Yi Wan · Richard Sutton
Workshop
Efficient Multi-Horizon Learning for Off-Policy Reinforcement Learning
Raja Farrukh Ali · Nasik Muhammad Nafi · Kevin Duong · William Hsu
Poster
Thu 14:00 Action-modulated midbrain dopamine activity arises from distributed control policies
Jack Lindsey · Ashok Litwin-Kumar
Poster
Wed 14:00 Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions
Haanvid Lee · Jongmin Lee · Yunseon Choi · Wonseok Jeon · Byung-Jun Lee · Yung-Kyun Noh · Kee-Eung Kim
Poster
Thu 14:00 Beyond the Return: Off-policy Function Estimation under User-specified Error-measuring Distributions
Audrey Huang · Nan Jiang
Workshop
MOPA: a Minimalist Off-Policy Approach to Safe-RL
Hao Sun · Ziping Xu · Zhenghao Peng · Meng Fang · Bo Dai · Bolei Zhou
Workshop
AsymQ: Asymmetric Q-loss to mitigate overestimation bias in off-policy reinforcement learning
Qinsheng Zhang · Arjun Krishna · Sehoon Ha · Yongxin Chen
Poster
Thu 14:00 The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning
Yunhao Tang · Remi Munos · Mark Rowland · Bernardo Avila Pires · Will Dabney · Marc Bellemare
Poster
Thu 9:00 Markovian Interference in Experiments
Vivek Farias · Andrew Li · Tianyi Peng · Andrew Zheng
Poster
Wed 9:00 Policy Gradient With Serial Markov Chain Reasoning
Edoardo Cetin · Oya Celiktutan
Poster
Wed 9:00 Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification
Takumi Tanabe · Rei Sato · Kazuto Fukuchi · Jun Sakuma · Youhei Akimoto