Timezone: »
A dynamic treatment regime (DTR) consists of a sequence of decision rules, one per stage of intervention, that dictates how to determine the treatment assignment to patients based on evolving treatments and covariates' history. These regimes are particularly effective for managing chronic disorders and is arguably one of the key aspects towards more personalized decision-making. In this paper, we investigate the online reinforcement learning (RL) problem for selecting optimal DTRs provided that observational data is available. We develop the first adaptive algorithm that achieves near-optimal regret in DTRs in online settings, without any access to historical data. We further derive informative bounds on the system dynamics of the underlying DTR from confounded, observational data. Finally, we combine these results and develop a novel RL algorithm that efficiently learns the optimal DTR while leveraging the abundant, yet imperfect confounded observations.
Author Information
Junzhe Zhang (Columbia University)
Elias Bareinboim (Columbia University)
More from the Same Authors
-
2019 Poster: Efficient Identification in Linear Structural Causal Models with Instrumental Cutsets »
Daniel Kumor · Bryant Chen · Elias Bareinboim -
2019 Poster: Characterization and Learning of Causal Graphs with Latent Variables from Soft Interventions »
Murat Kocaoglu · Amin Jaber · Karthikeyan Shanmugam · Elias Bareinboim -
2019 Poster: Identification of Conditional Causal Effects under Markov Equivalence »
Amin Jaber · Jiji Zhang · Elias Bareinboim -
2019 Spotlight: Identification of Conditional Causal Effects under Markov Equivalence »
Amin Jaber · Jiji Zhang · Elias Bareinboim -
2018 : Datasets and Benchmarks for Causal Learning »
Csaba Szepesvari · Isabelle Guyon · Nicolai Meinshausen · David Blei · Elias Bareinboim · Bernhard Schölkopf · Pietro Perona -
2018 : Causality and Transfer Learning »
Elias Bareinboim -
2018 Poster: Structural Causal Bandits: Where to Intervene? »
Sanghack Lee · Elias Bareinboim -
2018 Poster: Equality of Opportunity in Classification: A Causal Approach »
Junzhe Zhang · Elias Bareinboim -
2017 Poster: Experimental Design for Learning Causal Graphs with Latent Variables »
Murat Kocaoglu · Karthikeyan Shanmugam · Elias Bareinboim -
2016 : The Data-Fusion Problem: Causal Inference and Reinforcement Learning »
Elias Bareinboim -
2015 Poster: Bandits with Unobserved Confounders: A Causal Approach »
Elias Bareinboim · Andrew Forney · Judea Pearl -
2014 Poster: Transportability from Multiple Environments with Limited Experiments: Completeness Results »
Elias Bareinboim · Judea Pearl -
2014 Spotlight: Transportability from Multiple Environments with Limited Experiments: Completeness Results »
Elias Bareinboim · Judea Pearl -
2013 Poster: Transportability from Multiple Environments with Limited Experiments »
Elias Bareinboim · Sanghack Lee · Vasant Honavar · Judea Pearl -
2013 Tutorial: Causes and Counterfactuals: Concepts, Principles and Tools. »
Judea Pearl · Elias Bareinboim