Timezone: »
We study few-shot prompting of pretrained large language models (LLMs) towards solving PDDL planning problems. We are interested in two questions: (1) To what extent can LLMs solve PDDL planning problems on their own? (2) How and to what extent can LLMs be used to guide AI planners? Recent work by Valmeekam et al. (2022) presents negative evidence for (1) in the classic blocks world domain. We confirm this finding, but expand the inquiry to 18 domains and find more mixed results with a few clear successes. For (2), we propose a simple mechanism for using good-but-imperfect LLM outputs to aid a heuristic-search planner. We also find that the LLM performance is due not only to syntactic pattern matching, but also to its commonsense understanding of English terms that appear in the PDDL.
Author Information
Tom Silver (MIT)
Varun Hariprasad
Reece Shuttleworth (Computer Science and Artificial Intelligence Laboratory, Electrical Engineering & Computer Science)
Nishanth Kumar (Massachusetts Institute of Technology)
Nishanth Kumar is a Ph.D. student in the LIS Group at MIT CSAIL, where his research is supported by an NSF GRFP fellowship. Nishanth's research interests lie in enabling robots to exhibit long-horizon, multi-task intelligent behavior in real-world scenarios. To this end, his work seeks to synthesize ideas from a number of sub-fields of AI, including Reinforcement Learning, Task and Motion Planning, Program Synthesis and Neurosymbolic AI. Previously, Nishanth obtained a Bachelor of Science in Computer Engineering from Brown University, where he was a Goldwater Scholar, CRA Outstanding Undergrad Researcher Award Finalist, and was named the Outstanding Senior in Computer Engineering upon graduation.
Tomás Lozano-Pérez (Massachusetts Institute of Technology)
Leslie Kaelbling (MIT)
More from the Same Authors
-
2020 : Robotic gripper design with Evolutionary Strategies and Graph Element Networks »
Ferran Alet · Maria Bauza · Adarsh K Jeewajee · Max Thomsen · Alberto Rodriguez · Leslie Kaelbling · Tomás Lozano-Pérez -
2022 Poster: PDSketch: Integrated Domain Programming, Learning, and Planning »
Jiayuan Mao · Tomás Lozano-Pérez · Josh Tenenbaum · Leslie Kaelbling -
2021 Poster: Understanding End-to-End Model-Based Reinforcement Learning Methods as Implicit Parameterization »
Clement Gehring · Kenji Kawaguchi · Jiaoyang Huang · Leslie Kaelbling -
2021 Poster: Tailoring: encoding inductive biases by optimizing unsupervised objectives at prediction time »
Ferran Alet · Maria Bauza · Kenji Kawaguchi · Nurullah Giray Kuru · Tomás Lozano-Pérez · Leslie Kaelbling -
2020 Poster: Online Bayesian Goal Inference for Boundedly Rational Planning Agents »
Tan Zhi-Xuan · Jordyn Mann · Tom Silver · Josh Tenenbaum · Vikash Mansinghka -
2020 Poster: Adversarially-learned Inference via an Ensemble of Discrete Undirected Graphical Models »
Adarsh Keshav Jeewajee · Leslie Kaelbling -
2020 : Doing for our robots what nature did for us »
Leslie Kaelbling -
2019 Poster: Neural Relational Inference with Fast Modular Meta-learning »
Ferran Alet · Erica Weng · Tomás Lozano-Pérez · Leslie Kaelbling -
2018 : Discussion Panel: Ryan Adams, Nicolas Heess, Leslie Kaelbling, Shie Mannor, Emo Todorov (moderator: Roy Fox) »
Ryan Adams · Nicolas Heess · Leslie Kaelbling · Shie Mannor · Emo Todorov · Roy Fox -
2018 : On the Value of Knowing What You Don't Know: Learning to Sample and Sampling to Learn for Robot Planning (Leslie Kaelbling) »
Leslie Kaelbling -
2018 : Leslie Kaelbling »
Leslie Kaelbling -
2018 Workshop: Infer to Control: Probabilistic Reinforcement Learning and Structured Control »
Leslie Kaelbling · Martin Riedmiller · Marc Toussaint · Igor Mordatch · Roy Fox · Tuomas Haarnoja -
2018 : Talk 8: Leslie Kaelbling - Learning models of very large hybrid domains »
Leslie Kaelbling -
2018 Poster: Regret bounds for meta Bayesian optimization with an unknown Gaussian process prior »
Zi Wang · Beomjoon Kim · Leslie Kaelbling -
2018 Spotlight: Regret bounds for meta Bayesian optimization with an unknown Gaussian process prior »
Zi Wang · Beomjoon Kim · Leslie Kaelbling -
2015 Poster: Bayesian Optimization with Exponential Convergence »
Kenji Kawaguchi · Leslie Kaelbling · Tomás Lozano-Pérez -
2008 Poster: Multi-Agent Filtering with Infinitely Nested Beliefs »
Luke Zettlemoyer · Brian Milch · Leslie Kaelbling -
2008 Spotlight: Multi-Agent Filtering with Infinitely Nested Beliefs »
Luke Zettlemoyer · Brian Milch · Leslie Kaelbling -
2007 Workshop: The Grammar of Vision: Probabilistic Grammar-Based Models for Visual Scene Understanding and Object Categorization »
Virginia Savova · Josh Tenenbaum · Leslie Kaelbling · Alan Yuille