Paper Presentation
in
Workshop: The Future of Interactive Machine Learning
Efficient Exploration in Monte Carlo Tree Search using Human Action Abstractions, Kaushik Subramanian, Jonathan Scholz, Charles Isbell and Andrea Thomaz
Monte Carlo Tree Search (MCTS) is a family of methods for planning in large domains. It focuses on finding a good action for a particular state, making its complexity independent of the size of the state space. However such methods are exponential with respect to the branching factor. Effective application of MCTS requires good heuristics to arbitrate action selection during learning. In this paper we present a policy-guided approach that utilizes action abstractions, derived from human input, with MCTS to facilitate efficient exploration. We draw from existing work in hierarchical reinforcement learning, interactive machine learning and show how multi-step actions, represented as stochastic policies, can serve as good action selection heuristics. We demonstrate the efficacy of our approach in the PacMan domain and highlight its advantages over traditional MCTS.
Live content is unavailable. Log in and register to view live content