firstbacksecondback
130 Results
Poster
|
Tue 14:00 |
Learning to Solve SMT Formulas Mislav Balunovic · Pavol Bielik · Martin Vechev |
|
Poster
|
Wed 14:00 |
REFUEL: Exploring Sparse Features in Deep Reinforcement Learning for Fast Disease Diagnosis Yu-Shao Peng · Kai-Fu Tang · Hsuan-Tien Lin · Edward Chang |
|
Poster
|
Thu 14:00 |
Non-delusional Q-learning and value-iteration Tyler Lu · Dale Schuurmans · Craig Boutilier |
|
Poster
|
Wed 14:00 |
Meta-Gradient Reinforcement Learning Zhongwen Xu · Hado van Hasselt · David Silver |
|
Poster
|
Wed 14:00 |
Meta-Reinforcement Learning of Structured Exploration Strategies Abhishek Gupta · Russell Mendonca · YuXuan Liu · Pieter Abbeel · Sergey Levine |
|
Poster
|
Wed 14:00 |
Playing hard exploration games by watching YouTube Yusuf Aytar · Tobias Pfaff · David Budden · Thomas Paine · Ziyu Wang · Nando de Freitas |
|
Poster
|
Wed 7:45 |
Synthesize Policies for Transfer and Adaptation across Tasks and Environments Hexiang Hu · Liyu Chen · Boqing Gong · Fei Sha |
|
Poster
|
Wed 14:00 |
Scalar Posterior Sampling with Applications Georgios Theocharous · Zheng Wen · Yasin Abbasi Yadkori · Nikos Vlassis |
|
Poster
|
Wed 14:00 |
On Learning Intrinsic Rewards for Policy Gradient Methods Zeyu Zheng · Junhyuk Oh · Satinder Singh |
|
Poster
|
Tue 7:45 |
The Importance of Sampling inMeta-Reinforcement Learning Bradly Stadie · Ge Yang · Rein Houthooft · Peter Chen · Yan Duan · Yuhuai Wu · Pieter Abbeel · Ilya Sutskever |
|
Poster
|
Thu 14:00 |
Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion Jacob Buckman · Danijar Hafner · George Tucker · Eugene Brevdo · Honglak Lee |
|
Poster
|
Wed 14:00 |
Iterative Value-Aware Model Learning Amir-massoud Farahmand |