Timezone: »
Deep RePReL--Combining Planning and Deep RL for acting in relational domains
Harsha Kokel · Arjun Manoharan · Sriraam Natarajan · Balaraman Ravindran · Prasad Tadepalli
Event URL: https://openreview.net/forum?id=ffLKUFlsFK0 »
We consider the problem of combining a symbolic planner and a Deep RL agent to achieve the best of both worlds -- the generalization ability of the planner with the effective learning ability of Deep RL. To this effect, we extend a previous work of Kokel et al. ICAPS 2021, RePReL, to Deep RL. As we demonstrate in experiments in two relational worlds, this combination enables effective learning, transfer and generalization when compared to the use of only Deep RL.
Author Information
Harsha Kokel (University of Texas, Dallas)
Arjun Manoharan (Indian Institute of Technology Madras)
Sriraam Natarajan (Indiana University)
Balaraman Ravindran (Indian Institute of Technology Madras)
Prasad Tadepalli (Oregon State University)
More from the Same Authors
-
2021 Spotlight: Optimal Policies Tend To Seek Power »
Alex Turner · Logan Smith · Rohin Shah · Andrew Critch · Prasad Tadepalli -
2021 : Interactive Robust Policy Optimization for Multi-Agent Reinforcement Learning »
Videh Nema · Balaraman Ravindran -
2021 : Interactive Robust Policy Optimization for Multi-Agent Reinforcement Learning »
Videh Nema · Balaraman Ravindran -
2021 : Interactive Robust Policy Optimization for Multi-Agent Reinforcement Learning »
Videh Nema · Balaraman Ravindran -
2022 : Guiding Offline Reinforcement Learning Using a Safety Expert »
Richa Verma · Kartik Bharadwaj · Harshad Khadilkar · Balaraman Ravindran -
2022 : Lagrangian Model Based Reinforcement Learning »
Adithya Ramesh · Balaraman Ravindran -
2022 Poster: Parametrically Retargetable Decision-Makers Tend To Seek Power »
Alex Turner · Prasad Tadepalli -
2022 Poster: ORIENT: Submodular Mutual Information Measures for Data Subset Selection under Distribution Shift »
Athresh Karanam · Krishnateja Killamsetty · Harsha Kokel · Rishabh Iyer -
2021 : Matching options to tasks using Option-Indexed Hierarchical Reinforcement Learning »
Kushal Chauhan · Soumya Chatterjee · Pradeep Shenoy · Balaraman Ravindran -
2021 Poster: Interventional Sum-Product Networks: Causal Inference with Tractable Probabilistic Models »
Matej Zečević · Devendra Dhami · Athresh Karanam · Sriraam Natarajan · Kristian Kersting -
2021 Poster: One Explanation is Not Enough: Structured Attention Graphs for Image Classification »
Vivswan Shitole · Fuxin Li · Minsuk Kahng · Prasad Tadepalli · Alan Fern -
2021 Poster: Optimal Policies Tend To Seek Power »
Alex Turner · Logan Smith · Rohin Shah · Andrew Critch · Prasad Tadepalli -
2020 Poster: Avoiding Side Effects in Complex Environments »
Alex Turner · Neale Ratzlaff · Prasad Tadepalli -
2020 Spotlight: Avoiding Side Effects in Complex Environments »
Alex Turner · Neale Ratzlaff · Prasad Tadepalli -
2019 : Coffee Break & Poster Session 2 »
Juho Lee · Yoonho Lee · Yee Whye Teh · Raymond A. Yeh · Yuan-Ting Hu · Alex Schwing · Sara Ahmadian · Alessandro Epasto · Marina Knittel · Ravi Kumar · Mohammad Mahdian · Christian Bueno · Aditya Sanghi · Pradeep Kumar Jayaraman · Ignacio Arroyo-Fernández · Andrew Hryniowski · Vinayak Mathur · Sanjay Singh · Shahrzad Haddadan · Vasco Portilheiro · Luna Zhang · Mert Yuksekgonul · Jhosimar Arias Figueroa · Deepak Maurya · Balaraman Ravindran · Frank NIELSEN · Philip Pham · Justin Payan · Andrew McCallum · Jinesh Mehta · Ke SUN -
2018 : Spotlights 2 »
Mausam · Ankit Anand · Parag Singla · Tarik Koc · Tim Klinger · Habibeh Naderi · Sungwon Lyu · Saeed Amizadeh · Kshitij Dwivedi · Songpeng Zu · Wei Feng · Balaraman Ravindran · Edouard Pineau · Abdulkadir Celikkanat · Deepak Venugopal -
2017 Tutorial: Statistical Relational Artificial Intelligence: Logic, Probability and Computation »
Luc De Raedt · David Poole · Kristian Kersting · Sriraam Natarajan -
2014 Poster: An Autoencoder Approach to Learning Bilingual Word Representations »
Sarath Chandar · Stanislas Lauly · Hugo Larochelle · Mitesh Khapra · Balaraman Ravindran · Vikas C Raykar · Amrita Saha -
2013 Poster: Symbolic Opportunistic Policy Iteration for Factored-Action MDPs »
Aswin Raghavan · Roni Khardon · Alan Fern · Prasad Tadepalli -
2012 Poster: A Bayesian Approach for Policy Learning from Trajectory Preference Queries »
Aaron Wilson · Alan Fern · Prasad Tadepalli -
2011 Poster: Autonomous Learning of Action Models for Planning »
Neville Mehta · Prasad Tadepalli · Alan Fern -
2011 Poster: Inverting Grice's Maxims to Learn Rules from Natural Language Extractions »
M. Shahed Sorower · Thomas Dietterich · Janardhan Rao Doppa · Walker Orr · Prasad Tadepalli · Xiaoli Fern -
2010 Poster: A Computational Decision Theory for Interactive Assistants »
Alan Fern · Prasad Tadepalli