firstbacksecondback
87 Results
Poster
|
Tue 18:30 |
Hindsight Experience Replay Marcin Andrychowicz · Filip Wolski · Alex Ray · Jonas Schneider · Rachel Fong · Peter Welinder · Bob McGrew · Josh Tobin · OpenAI Pieter Abbeel · Wojciech Zaremba |
|
Poster
|
Tue 18:30 |
Finite sample analysis of the GTD Policy Evaluation Algorithms in Markov Setting Yue Wang · Wei Chen · Yuting Liu · Zhi-Ming Ma · Tie-Yan Liu |
|
Demonstration
|
Tue 19:00 |
A Deep Reinforcement Learning Chatbot Iulian Vlad Serban · Chinnadhurai Sankar · Mathieu Germain · Saizheng Zhang · Zhouhan Lin · Sandeep Subramanian · Taesup Kim · Michael Pieper · Sarath Chandar · Nan Rosemary Ke · Sai Rajeswar Mudumba · Alexandre de Brébisson · Jose Sotelo · Dendi A Suhubdy · Vincent Michalski · Joelle Pineau · Yoshua Bengio |