firstbacksecondback
91 Results
Poster
|
Wed 18:30 |
Dynamic-Depth Context Tree Weighting Joao V Messias · Shimon Whiteson |
|
Poster
|
Wed 18:30 |
Using Options and Covariance Testing for Long Horizon Off-Policy Policy Evaluation Zhaohan Guo · Philip S. Thomas · Emma Brunskill |
|
Poster
|
Wed 18:30 |
A multi-agent reinforcement learning model of common-pool resource appropriation Julien Pérolat · Joel Leibo · Vinicius Zambaldi · Charles Beattie · Karl Tuyls · Thore Graepel |
|
Poster
|
Wed 18:30 |
Reinforcement Learning under Model Mismatch Aurko Roy · Huan Xu · Sebastian Pokutta |
|
Poster
|
Wed 18:30 |
Learning Unknown Markov Decision Processes: A Thompson Sampling Approach Yi Ouyang · Mukul Gagrani · Ashutosh Nayyar · Rahul Jain |
|
Poster
|
Tue 18:30 |
Cold-Start Reinforcement Learning with Softmax Policy Gradient Nan Ding · Radu Soricut |
|
Poster
|
Wed 18:30 |
A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning Marc Lanctot · Vinicius Zambaldi · Audrunas Gruslys · Angeliki Lazaridou · Karl Tuyls · Julien Perolat · David Silver · Thore Graepel |
|
Poster
|
Wed 18:30 |
Adaptive Batch Size for Safe Policy Gradients Matteo Papini · Matteo Pirotta · Marcello Restelli |
|
Poster
|
Wed 18:30 |
Compatible Reward Inverse Reinforcement Learning Alberto Maria Metelli · Matteo Pirotta · Marcello Restelli |
|
Workshop
|
Sat 9:40 |
Landmark Options Via Reflection (LOVR) in Multi-task Lifelong Reinforcement Learning (Nicholas Denis) Nicholas Denis |
|
Oral
|
Wed 15:05 |
Imagination-Augmented Agents for Deep Reinforcement Learning Sébastien Racanière · Theophane Weber · David Reichert · Lars Buesing · Arthur Guez · Danilo Jimenez Rezende · Adrià Puigdomènech Badia · Oriol Vinyals · Nicolas Heess · Yujia Li · Razvan Pascanu · Peter Battaglia · Demis Hassabis · David Silver · Daan Wierstra |
|
Poster
|
Tue 18:30 |
QMDP-Net: Deep Learning for Planning under Partial Observability Peter Karkus · David Hsu · Wee Sun Lee |