Timezone: »
We present a novel linear program for the approximation of the dynamic programming cost-to-go function in high-dimensional stochastic control problems. LP approaches to approximate DP naturally restrict attention to approximations that are lower bounds to the optimal cost-to-go function. Our program -- the `smoothed approximate linear program -- relaxes this restriction in an appropriate fashion while remaining computationally tractable. Doing so appears to have several advantages: First, we demonstrate superior bounds on the quality of approximation to the optimal cost-to-go function afforded by our approach. Second, experiments with our approach on a challenging problem (the game of Tetris) show that the approach outperforms the existing LP approach (which has previously been shown to be competitive with several ADP algorithms) by an order of magnitude.
Author Information
Vijay Desai
Vivek Farias (Massachusetts Institute of Technology)
Ciamac C Moallemi (Columbia University)
Related Events (a corresponding poster, oral, or spotlight)
-
2009 Poster: A Smoothed Approximate Linear Program »
Wed. Dec 9th 03:00 -- 07:59 AM Room
More from the Same Authors
-
2021 Spotlight: Fair Exploration via Axiomatic Bargaining »
Jackie Baek · Vivek Farias -
2021 : Learning Treatment Effects in Panels with General Intervention Patterns »
Vivek Farias · Andrew Li · Tianyi Peng -
2021 : The Limits to Learning a Diffusion Model »
Jackie Baek · Vivek Farias · ANDREEA GEORGESCU · Retsef Levi · Tianyi Peng · Joshua Wilde · Andrew Zheng -
2021 : The Limits to Learning a Diffusion Model »
Jackie Baek · Vivek Farias · ANDREEA GEORGESCU · Retsef Levi · Tianyi Peng · Joshua Wilde · Andrew Zheng -
2022 Poster: Markovian Interference in Experiments »
Vivek Farias · Andrew Li · Tianyi Peng · Andrew Zheng -
2021 Oral: Learning Treatment Effects in Panels with General Intervention Patterns »
Vivek Farias · Andrew Li · Tianyi Peng -
2021 Poster: Fair Exploration via Axiomatic Bargaining »
Jackie Baek · Vivek Farias -
2021 Poster: Learning Treatment Effects in Panels with General Intervention Patterns »
Vivek Farias · Andrew Li · Tianyi Peng -
2019 Poster: Thompson Sampling with Information Relaxation Penalties »
Seungki Min · Costis Maglaras · Ciamac C Moallemi -
2016 Poster: Optimistic Gittins Indices »
Eli Gutin · Vivek Farias -
2012 Poster: Non-parametric Approximate Dynamic Programming via the Kernel Method »
Nikhil Bhat · Ciamac C Moallemi · Vivek Farias -
2009 Poster: A Data-Driven Approach to Modeling Choice »
Vivek Farias · Srikanth Jagabathula · Devavrat Shah -
2009 Spotlight: A Data-Driven Approach to Modeling Choice »
Vivek Farias · Srikanth Jagabathula · Devavrat Shah