firstbacksecondback
25 Results
Poster
|
Wed 14:00 |
A Bayesian Approach to Generative Adversarial Imitation Learning Wonseok Jeon · Seokin Seo · Kee-Eung Kim |
|
Poster
|
Wed 14:00 |
Learning Task Specifications from Demonstrations Marcell Vazquez-Chanlatte · Susmit Jha · Ashish Tiwari · Mark Ho · Sanjit Seshia |
|
Poster
|
Wed 14:00 |
Context-dependent upper-confidence bounds for directed exploration Raksha Kumaraswamy · Matthew Schlegel · Adam White · Martha White |
|
Poster
|
Wed 7:45 |
Inference Aided Reinforcement Learning for Incentive Mechanism Design in Crowdsourcing Zehong Hu · Yitao Liang · Jie Zhang · Zhao Li · Yang Liu |
|
Poster
|
Wed 7:45 |
Exploration in Structured Reinforcement Learning Jungseul Ok · Alexandre Proutiere · Damianos Tranos |
|
Poster
|
Wed 14:00 |
From Stochastic Planning to Marginal MAP Hao(Jackson) Cui · Radu Marinescu · Roni Khardon |
|
Poster
|
Wed 14:00 |
Occam's razor is insufficient to infer the preferences of irrational agents Stuart Armstrong · Sören Mindermann |
|
Poster
|
Wed 14:00 |
Bayesian Control of Large MDPs with Unknown Dynamics in Data-Poor Environments Mahdi Imani · Seyede Fatemeh Ghoreishi · Ulisses M. Braga-Neto |
|
Poster
|
Wed 14:00 |
Constrained Cross-Entropy Method for Safe Reinforcement Learning Min Wen · Ufuk Topcu |
|
Poster
|
Wed 14:00 |
A Lyapunov-based Approach to Safe Reinforcement Learning Yinlam Chow · Ofir Nachum · Edgar Duenez-Guzman · Mohammad Ghavamzadeh |
|
Poster
|
Thu 14:00 |
Non-delusional Q-learning and value-iteration Tyler Lu · Dale Schuurmans · Craig Boutilier |
|
Poster
|
Wed 14:00 |
Maximum Causal Tsallis Entropy Imitation Learning Kyungjae Lee · Sungjoon Choi · Songhwai Oh |