firstbacksecondback
130 Results
Poster
|
Wed 14:00 |
Reward learning from human preferences and demonstrations in Atari Borja Ibarz · Jan Leike · Tobias Pohlen · Geoffrey Irving · Shane Legg · Dario Amodei |
|
Poster
|
Wed 14:00 |
Transfer of Value Functions via Variational Methods Andrea Tirinzoni · Rafael Rodriguez Sanchez · Marcello Restelli |
|
Workshop
|
Sat 5:00 |
Wordplay: Reinforcement and Language Learning in Text-based Games Adam Trischler · Angeliki Lazaridou · Yonatan Bisk · Wendy Tay · Nate Kushman · Marc-Alexandre Côté · Alessandro Sordoni · Daniel Ricks · Tom Zahavy · Hal Daumé III |
|
Poster
|
Wed 7:45 |
Inference Aided Reinforcement Learning for Incentive Mechanism Design in Crowdsourcing Zehong Hu · Yitao Liang · Jie Zhang · Zhao Li · Yang Liu |
|
Poster
|
Wed 14:00 |
Genetic-Gated Networks for Deep Reinforcement Learning Simyung Chang · John Yang · Jaeseok Choi · Nojun Kwak |
|
Poster
|
Wed 14:00 |
Deep Reinforcement Learning of Marked Temporal Point Processes Utkarsh Upadhyay · Abir De · Manuel Gomez Rodriguez |
|
Poster
|
Wed 7:45 |
Fast deep reinforcement learning using online adjustments from the past Steven Hansen · Alexander Pritzel · Pablo Sprechmann · Andre Barreto · Charles Blundell |
|
Poster
|
Wed 14:00 |
Learning to Share and Hide Intentions using Information Regularization DJ Strouse · Max Kleiman-Weiner · Josh Tenenbaum · Matt Botvinick · David Schwab |
|
Poster
|
Wed 14:00 |
Learning Others' Intentional Models in Multi-Agent Settings Using Interactive POMDPs Yanlin Han · Piotr Gmytrasiewicz |
|
Poster
|
Wed 14:00 |
Negotiable Reinforcement Learning for Pareto Optimal Sequential Decision-Making Nishant Desai · Andrew Critch · Stuart J Russell |
|
Poster
|
Thu 14:00 |
Learning Loop Invariants for Program Verification Xujie Si · Hanjun Dai · Mukund Raghothaman · Mayur Naik · Le Song |
|
Poster
|
Wed 14:00 |
Zero-Shot Transfer with Deictic Object-Oriented Representation in Reinforcement Learning Ofir Marom · Benjamin Rosman |