firstbacksecondback
91 Results
Poster
|
Wed 18:30 |
Bandits Dueling on Partially Ordered Sets Julien Audiffren · Liva Ralaivola |
|
Poster
|
Wed 18:30 |
Emergence of Language with Multi-agent Games: Learning to Communicate with Sequences of Symbols Serhii Havrylov · Ivan Titov |
|
Poster
|
Wed 18:30 |
Deep Reinforcement Learning from Human Preferences Paul Christiano · Jan Leike · Tom Brown · Miljan Martic · Shane Legg · Dario Amodei |
|
Poster
|
Mon 18:30 |
One-Shot Imitation Learning Yan Duan · Marcin Andrychowicz · Bradly Stadie · OpenAI Jonathan Ho · Jonas Schneider · Ilya Sutskever · Pieter Abbeel · Wojciech Zaremba |
|
Poster
|
Wed 18:30 |
Online Reinforcement Learning in Stochastic Games Chen-Yu Wei · Yi-Te Hong · Chi-Jen Lu |
|
Poster
|
Wed 18:30 |
Imagination-Augmented Agents for Deep Reinforcement Learning Sébastien Racanière · Theophane Weber · David Reichert · Lars Buesing · Arthur Guez · Danilo Jimenez Rezende · Adrià Puigdomènech Badia · Oriol Vinyals · Nicolas Heess · Yujia Li · Razvan Pascanu · Peter Battaglia · Demis Hassabis · David Silver · Daan Wierstra |
|
Demonstration
|
Tue 19:00 |
A Deep Reinforcement Learning Chatbot Iulian Vlad Serban · Chinnadhurai Sankar · Mathieu Germain · Saizheng Zhang · Zhouhan Lin · Sandeep Subramanian · Taesup Kim · Michael Pieper · Sarath Chandar · Nan Rosemary Ke · Sai Rajeswar Mudumba · Alexandre de Brébisson · Jose Sotelo · Dendi A Suhubdy · Vincent Michalski · Joelle Pineau · Yoshua Bengio |