Timezone: »
Poster
Non-parametric Approximate Dynamic Programming via the Kernel Method
Nikhil Bhat · Ciamac C Moallemi · Vivek Farias
Thu Dec 06 02:00 PM -- 12:00 AM (PST) @ Harrah’s Special Events Center 2nd Floor
This paper presents a novel non-parametric approximate dynamic programming (ADP) algorithm that enjoys graceful, dimension-independent approximation and sample complexity guarantees. In particular, we establish both theoretically and computationally that our proposal can serve as a viable alternative to state-of-the-art parametric ADP algorithms, freeing the designer from carefully specifying an approximation architecture. We accomplish this by developing a kernel-based mathematical program for ADP. Via a computational study on a controlled queueing network, we show that our non-parametric procedure is competitive with parametric ADP approaches.
Author Information
Nikhil Bhat (Columbia University)
Ciamac C Moallemi (Columbia University)
Vivek Farias (Massachusetts Institute of Technology)
More from the Same Authors
-
2021 Spotlight: Fair Exploration via Axiomatic Bargaining »
Jackie Baek · Vivek Farias -
2021 : Learning Treatment Effects in Panels with General Intervention Patterns »
Vivek Farias · Andrew Li · Tianyi Peng -
2021 : The Limits to Learning a Diffusion Model »
Jackie Baek · Vivek Farias · ANDREEA GEORGESCU · Retsef Levi · Tianyi Peng · Joshua Wilde · Andrew Zheng -
2021 : The Limits to Learning a Diffusion Model »
Jackie Baek · Vivek Farias · ANDREEA GEORGESCU · Retsef Levi · Tianyi Peng · Joshua Wilde · Andrew Zheng -
2022 Poster: Markovian Interference in Experiments »
Vivek Farias · Andrew Li · Tianyi Peng · Andrew Zheng -
2021 Oral: Learning Treatment Effects in Panels with General Intervention Patterns »
Vivek Farias · Andrew Li · Tianyi Peng -
2021 Poster: Fair Exploration via Axiomatic Bargaining »
Jackie Baek · Vivek Farias -
2021 Poster: Learning Treatment Effects in Panels with General Intervention Patterns »
Vivek Farias · Andrew Li · Tianyi Peng -
2019 Poster: Thompson Sampling with Information Relaxation Penalties »
Seungki Min · Costis Maglaras · Ciamac C Moallemi -
2016 Poster: Optimistic Gittins Indices »
Eli Gutin · Vivek Farias -
2009 Poster: A Data-Driven Approach to Modeling Choice »
Vivek Farias · Srikanth Jagabathula · Devavrat Shah -
2009 Spotlight: A Data-Driven Approach to Modeling Choice »
Vivek Farias · Srikanth Jagabathula · Devavrat Shah -
2009 Poster: A Smoothed Approximate Linear Program »
Vijay Desai · Vivek Farias · Ciamac C Moallemi -
2009 Spotlight: A Smoothed Approximate Linear Program »
Vijay Desai · Vivek Farias · Ciamac C Moallemi