Timezone: »
Offline reinforcement learning methods have been used to learn policies from observational data for recommending treatment of chronic diseases and interventions in critical care. In these formulations, treatments can be recommended for each patient individually without regard to treatment availability because resources are plentiful and patients are independent of one another. However, in many decision making problems, such as recommending care in resource poor settings, the space of available actions is constrained and the policy must take these constraints into account. We consider the problem of learning policies for personalized treatment when there are limited resources and actions taken for one patient affect the actions available for other patients.
One such sequential decision making problem is hospital bed assignment. Hospitals are complex systems, in which not only the medical care, but also the physical hospital environment affect patients’ outcomes. For CDI, one of the most common healthcare acquired infections, the history of a patient’s bed and room can contribute to their risk of infection because c. diff. spores can linger on surfaces. We consider the problem of assigning patients to hospital beds with the objective of reducing the incidence of Clostridioides difficile infection (CDI) while taking into account the limited availability of beds. Our algorithm first learns a Q-function for assigning beds to an individual patient ignoring bed availability. We use this Q-function to assign patients to beds in order of their risk level, taking the highest value action among those available for each patient. We test our algorithm on simulated data as well as a real dataset of hospitalizations from a large urban hospital.
Author Information
Hallee Wong (MIT)
Maggie Makar (University of Michigan)
Aniruddh Raghu (MIT)
John Guttag (Massachusetts Institute of Technology)
More from the Same Authors
-
2022 : Probabilistic Interactive Segmentation for Medical Images »
Hallee Wong · John Guttag · Adrian Dalca -
2022 : Conditional Contrastive Networks »
Emily Mu · John Guttag -
2022 : Conditional differential measurement error: partial identifiability and estimation »
Pengrun Huang · Maggie Makar -
2022 : UniverSeg: Universal Medical Image Segmentation »
Victor Butoi · Jose Javier Gonzalez Ortiz · Tianyu Ma · John Guttag · Mert Sabuncu · Adrian Dalca -
2022 : Probabilistic Interactive Segmentation for Medical Images »
Hallee Wong · John Guttag · Adrian Dalca -
2023 : Improving Domain Generalization in Contrastive Learning via Domain-Aware Temperature Control »
Robert Lewis · Katie Matton · Rosalind Picard · John Guttag -
2023 : Improving Domain Generalization in Contrastive Learning Using Adaptive Temperature Control »
Katie Matton · Robert Lewis · Rosalind Picard · John Guttag -
2023 Poster: Scale-Space Hypernetworks for Efficient Biomedical Image Analysis »
Jose Javier Gonzalez Ortiz · John Guttag · Adrian Dalca -
2022 : At the Intersection of Conceptual Art and Deep Learning: The End of Signature »
Kathleen Lewis · Divya Shanmugam · Jose Javier Gonzalez Ortiz · Agnieszka Kurant · John Guttag -
2022 : Contrastive Learning of Electrodermal Activity Representations for Stress Detection »
Katie Matton · Robert Lewis · John Guttag · Rosalind Picard -
2022 : Contrastive Pre-Training for Multimodal Medical Time Series »
Aniruddh Raghu · Payal Chandak · Ridwan Alam · John Guttag · Collin Stultz -
2022 : Contrastive Pre-Training for Multimodal Medical Time Series »
Aniruddh Raghu · Payal Chandak · Ridwan Alam · John Guttag · Collin Stultz -
2022 Poster: Learning Concept Credible Models for Mitigating Shortcuts »
Jiaxuan Wang · Sarah Jabbour · Maggie Makar · Michael Sjoding · Jenna Wiens -
2022 Poster: Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare »
Shengpu Tang · Maggie Makar · Michael Sjoding · Finale Doshi-Velez · Jenna Wiens -
2022 Poster: Causally motivated multi-shortcut identification and removal »
Jiayun Zheng · Maggie Makar -
2019 Poster: Learning Conditional Deformable Templates with Convolutional Networks »
Adrian Dalca · Marianne Rakic · John Guttag · Mert Sabuncu