Getting Started
Schedule
Tutorials
Main Conference
Invited Talks
Panels
Papers
Oral-equivalent Papers
Competitions
Datasets and Benchmarks
Journal Track
Outstanding Paper Awards
Workshops
Community
Affinity Events
Socials
Mentorship
Town Hall
Careers / Recruiting
Help
Presenters Instructions
Moderators Instructions
FAQ
Helpdesk in RocketChat
Topia Poster Sessions
Organizers
Login
firstbacksecondback
Search All 2022 Events
Results
<<
<
Page 1 of 3
>
>>
Workshop
Balanced Off-Policy Evaluation for Personalized Pricing
Adam N. Elmachtoub · Vishal Gupta · YUNFAN ZHAO
Poster
Thu 14:00
Off-Policy Evaluation with Policy-Dependent Optimization Response
Wenshuo Guo · Michael Jordan · Angela Zhou
Workshop
Efficient Multi-Horizon Learning for Off-Policy Reinforcement Learning
Raja Farrukh Ali · Nasik Muhammad Nafi · Kevin Duong · William Hsu
Workshop
Variance Reduction in Off-Policy Deep Reinforcement Learning using Spectral Normalization
Payal Bawa · Rafael Oliveira · Fabio Ramos
Workshop
On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly-Communicating MDPs
Yi Wan · Richard Sutton
Poster
Thu 14:00
Action-modulated midbrain dopamine activity arises from distributed control policies
Jack Lindsey · Ashok Litwin-Kumar
Poster
Thu 14:00
Beyond the Return: Off-policy Function Estimation under User-specified Error-measuring Distributions
Audrey Huang · Nan Jiang
Poster
Wed 14:00
Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions
Haanvid Lee · Jongmin Lee · Yunseon Choi · Wonseok Jeon · Byung-Jun Lee · Yung-Kyun Noh · Kee-Eung Kim
Poster
Thu 9:00
Markovian Interference in Experiments
Vivek Farias · Andrew Li · Tianyi Peng · Andrew Zheng
Poster
Thu 14:00
The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning
Yunhao Tang · Remi Munos · Mark Rowland · Bernardo Avila Pires · Will Dabney · Marc Bellemare
Poster
Wed 9:00
Policy Gradient With Serial Markov Chain Reasoning
Edoardo Cetin · Oya Celiktutan
Poster
Tue 9:00
The Pitfalls of Regularization in Off-Policy TD Learning
Gaurav Manek · J. Zico Kolter
NeurIPS uses cookies to remember that you are logged in. By using our websites, you agree to the placement of these cookies.
Our Privacy Policy »
Accept Cookies