Timezone: »
Point-based algorithms have been surprisingly successful in computing approximately optimal policies for partially observable Markov decision processes (POMDPs) in high dimensional belief spaces. In this work, we seek to understand the belief-space properties that allow some POMDP problems to be approximated efficiently and thus help to explain the point-based algorithms' success often observed in the experiments. We show that an approximately optimal POMDP solution can be computed in time polynomial in the covering number of a reachable belief space, the subset of the belief space reachable from a given belief point. We also show that under the weaker condition of having a small covering number for an optimal reachable space, the subset of the belief space reachable under an optimal policy, computing an approximately optimal solution is NP-hard. However, given a set of points from an optimal reachable space that covers it well, an approximate solution can be computed in polynomial time. The covering number highlights several interesting properties that help reduce the complexity of POMDP problems in practice, such as fully observed state variables, beliefs with sparse support, smooth beliefs, and circulant state-transition matrices.
Author Information
David Hsu (National University of Singapore)
Wee Sun Lee (National University of Singapore)
Wee Sun Lee is a professor in the Department of Computer Science, National University of Singapore. He obtained his B.Eng from the University of Queensland in 1992 and his Ph.D. from the Australian National University in 1996. He has been a research fellow at the Australian Defence Force Academy, a fellow of the Singapore-MIT Alliance, and a visiting scientist at MIT. His research interests include machine learning, planning under uncertainty, and approximate inference. His works have won the Test of Time Award at Robotics: Science and Systems (RSS) 2021, the RoboCup Best Paper Award at International Conference on Intelligent Robots and Systems (IROS) 2015, the Google Best Student Paper Award, Uncertainty in AI (UAI) 2014 (as faculty co-author), as well as several competitions and challenges. He has been an area chair for machine learning and AI conferences such as the Neural Information Processing Systems (NeurIPS), the International Conference on Machine Learning (ICML), the AAAI Conference on Artificial Intelligence (AAAI), and the International Joint Conference on Artificial Intelligence (IJCAI). He was a program, conference and journal track co-chair for the Asian Conference on Machine Learning (ACML), and he is currently the co-chair of the steering committee of ACML.
Nan Rong (Cornell University)
Related Events (a corresponding poster, oral, or spotlight)
-
2007 Poster: What makes some POMDP problems easy to approximate? »
Tue. Dec 4th 06:30 -- 06:40 PM Room
More from the Same Authors
-
2022 : Efficient Offline Policy Optimization with a Learned Model »
Zichen Liu · Siyi Li · Wee Sun Lee · Shuicheng Yan · Zhongwen Xu -
2022 Poster: Receding Horizon Inverse Reinforcement Learning »
Yiqing Xu · Wei Gao · David Hsu -
2021 : Part 4: Appendix: Proofs and Derivations »
Wee Sun Lee -
2021 : Part 3: Graph Neural Networks and Attention Networks »
Wee Sun Lee -
2021 : Part 2: Markov Decision Process »
Wee Sun Lee -
2021 Tutorial: Message Passing In Machine Learning »
Wee Sun Lee -
2021 : Part 1: Message Passing Overview and Probabilistic Graphical Models »
Wee Sun Lee -
2020 Poster: Factor Graph Neural Networks »
Zhen Zhang · Fan Wu · Wee Sun Lee -
2018 Workshop: Reinforcement Learning under Partial Observability »
Joni Pajarinen · Chris Amato · Pascal Poupart · David Hsu -
2017 Poster: QMDP-Net: Deep Learning for Planning under Partial Observability »
Peter Karkus · David Hsu · Wee Sun Lee -
2015 Poster: Adaptive Stochastic Optimization: From Sets to Paths »
Zhan Wei Lim · David Hsu · Wee Sun Lee -
2013 Poster: DESPOT: Online POMDP Planning with Regularization »
Adhiraj Somani · Nan Ye · David Hsu · Wee Sun Lee -
2013 Poster: Learning with Invariance via Linear Functionals on Reproducing Kernel Hilbert Space »
Xinhua Zhang · Wee Sun Lee · Yee Whye Teh -
2013 Spotlight: Learning with Invariance via Linear Functionals on Reproducing Kernel Hilbert Space »
Xinhua Zhang · Wee Sun Lee · Yee Whye Teh -
2013 Poster: Active Learning for Probabilistic Hypotheses Using the Maximum Gibbs Error Criterion »
Nguyen Viet Cuong · Wee Sun Lee · Nan Ye · Kian Ming Adam Chai · Hai Leong Chieu -
2011 Poster: Monte Carlo Value Iteration with Macro-Actions »
Zhan Wei Lim · David Hsu · Wee Sun Lee -
2010 Session: Oral Session 2 »
Wee Sun Lee -
2009 Poster: Conditional Random Fields with High-Order Features for Sequence Labeling »
Nan Ye · Wee Sun Lee · Hai Leong Chieu · Dan Wu -
2007 Poster: Cooled and Relaxed Survey Propagation for MRFs »
Hai Leong Chieu · Wee Sun Lee · Yee Whye Teh -
2007 Spotlight: Cooled and Relaxed Survey Propagation for MRFs »
Hai Leong Chieu · Wee Sun Lee · Yee Whye Teh -
2006 Poster: Hyperparameter Learning for Graph Based Semi-supervised Learning Algorithms »
Xinhua Zhang · Wee Sun Lee