Timezone: »
Case Study: Applying Decision Focused Learning in the Real World
Shresth Verma · Aditya Mate · Kai Wang · Aparna Taneja · Milind Tambe
Event URL: https://openreview.net/forum?id=Nmfuvm6yTVI »
Many real world optimization problems with unknown parameters are solved using the predict-then-optimize framework where a learnt model predicts the parameters of an optimization problem which is subsequently solved using an optimization algorithm.However, this approach maximises for the predictive accuracy rather than the quality of the final solution. Decision Focused Learning (DFL) solves this objective mismatch by integrating the optimization problem in the learning pipeline. Previous works have only shown the applicability of DFL in simulation setting. In our work, we consider the optimization problem of scheduling limited live service calls in Maternal and Child Health Awareness Programs and model it using Restless Multi-Armed Bandits (RMAB).We present results from a large-scale field study consisting of 9000 beneficiaries and demonstrate that DFL cuts $\sim 200\%$ more call engagement drops as compared to previous methods. Through detailed post-hoc analysis, we show that high predictive accuracy of problem parameters is not sufficient to ensure a well-performing system. We also demonstrate that DFL makes optimal decision choices by learning a better decision boundary between the RMAB actions, and by correctly predicting parameterswhich contribute most to the final decision outcome.
Many real world optimization problems with unknown parameters are solved using the predict-then-optimize framework where a learnt model predicts the parameters of an optimization problem which is subsequently solved using an optimization algorithm.However, this approach maximises for the predictive accuracy rather than the quality of the final solution. Decision Focused Learning (DFL) solves this objective mismatch by integrating the optimization problem in the learning pipeline. Previous works have only shown the applicability of DFL in simulation setting. In our work, we consider the optimization problem of scheduling limited live service calls in Maternal and Child Health Awareness Programs and model it using Restless Multi-Armed Bandits (RMAB).We present results from a large-scale field study consisting of 9000 beneficiaries and demonstrate that DFL cuts $\sim 200\%$ more call engagement drops as compared to previous methods. Through detailed post-hoc analysis, we show that high predictive accuracy of problem parameters is not sufficient to ensure a well-performing system. We also demonstrate that DFL makes optimal decision choices by learning a better decision boundary between the RMAB actions, and by correctly predicting parameterswhich contribute most to the final decision outcome.
Author Information
Shresth Verma (Google Research India)
Aditya Mate (Harvard University)
Kai Wang (Harvard University)
Aparna Taneja (Google)
Milind Tambe (Harvard University/Google Research)
More from the Same Authors
-
2021 Spotlight: Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning »
Kai Wang · Sanket Shah · Haipeng Chen · Andrew Perrault · Finale Doshi-Velez · Milind Tambe -
2021 : Your Bandit Model is Not Perfect: Introducing Robustness to Restless Bandits Enabled by Deep Reinforcement Learning »
Jackson Killian · Lily Xu · Arpita Biswas · Milind Tambe -
2022 : On the Pitfalls of Visual Learning in Referential Games »
Shresth Verma -
2022 : Invited Talk: Milind Tambe »
Milind Tambe -
2022 Poster: Decision-Focused Learning without Decision-Making: Learning Locally Optimized Decision Losses »
Sanket Shah · Kai Wang · Bryan Wilder · Andrew Perrault · Milind Tambe -
2021 : Restless Bandits in the Field: Real-World Study for Improving Maternal and Child Health Outcomes »
Aditya Mate -
2021 : Restless Bandits in the Field: Real-World Study for Improving Maternal and Child Health Outcomes »
Aditya Mate -
2021 : Invite Talk Q&A »
Milind Tambe · Tejumade Afonja · Paula Rodriguez Diaz -
2021 : Invited Talk: AI for Social Impact: Results from Deployments for Public Health »
Milind Tambe -
2021 Poster: Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning »
Kai Wang · Sanket Shah · Haipeng Chen · Andrew Perrault · Finale Doshi-Velez · Milind Tambe -
2020 : Q/A and Panel Discussion for People-Earth with Dan Kammen and Milind Tambe »
Daniel Kammen · Milind Tambe · Giulio De Leo · Mayur Mudigonda · Surya Karthik Mukkavilli -
2020 : Q/A and Discussion »
Surya Karthik Mukkavilli · Mayur Mudigonda · Milind Tambe -
2020 : Milind Tambe »
Milind Tambe -
2020 Poster: Automatically Learning Compact Quality-aware Surrogates for Optimization Problems »
Kai Wang · Bryan Wilder · Andrew Perrault · Milind Tambe -
2020 Spotlight: Automatically Learning Compact Quality-aware Surrogates for Optimization Problems »
Kai Wang · Bryan Wilder · Andrew Perrault · Milind Tambe -
2020 Poster: Collapsing Bandits and Their Application to Public Health Intervention »
Aditya Mate · Jackson Killian · Haifeng Xu · Andrew Perrault · Milind Tambe