Timezone: »
Finding systematic deviations of an outcome of interest in data and models is an important goal of trustworthy and socially responsible AI. To understand systematic deviations at a subgroup level, it is important to look beyond \emph{predefined} groups and consider all possible subgroups for analysis. Of course this exhaustive enumeration is not possible and there needs to be a balance of exploratory and confirmation analysis in socially-responsible AI. In this paper we compare recently proposed methods for detecting systematic deviations in an outcome of interest at the subgroup level across three socially-relevant data sets. Furthermore, we show the importance of looking through all possible subgroups for systematic deviations by comparing detected patterns using only protected attributes against patterns detected using the entire search space. One interesting pattern found in the OULAD dataset is that while having a high course load and not being from the highest socio-economic decile of UK regions makes students 2.3 times more likely to fail or withdraw from courses, being from Ireland or Wales mitigates this risk by 37%. This pattern may have been missed if we focused our analysis on the protected groups of gender and disability only. Python code for all methods, including the most recently proposed "AutoStrat" are available on open-sourced code repositories.
Author Information
Adebayo Oshingbesan (IBM Research)
Winslow Omondi (International Business Machines)
Girmaw Abebe Tadesse (IBM Research | Africa)
Celia Cintas (IBM Research Africa)
Skyler D. Speakman (IBM Research | Africa)
Skyler Speakman is a Research Scientist at IBM Research -- Africa. His projects use data science to impact the lives of millions of people on the continent. He believes that data collected through phones and drones will fundamentally change service delivery and African development in the next decade. Skyler completed a Ph.D. in Information Systems at Carnegie Mellon University as well as a M.S. in Machine Learning. He also holds masters in Mathematics, Statistics, and Public Policy. He lives in Nairobi, Kenya with his wife and two young sons.
More from the Same Authors
-
2021 : Automated Supervised Feature Selection for Differentiated Patterns of Care »
Catherine Wanjiru · William Ogallo · Girmaw Abebe Tadesse · Charles Wachira · Isaiah Onando Mulang' · Aisha Walcott-Bryant -
2021 : Post-discovery Analysis of Anomalous Subsets »
Isaiah Onando Mulang' · William Ogallo · Girmaw Abebe Tadesse · Aisha Walcott-Bryant -
2022 : Panel »
Pin-Yu Chen · Alex Gittens · Bo Li · Celia Cintas · Hilde Kuehne · Payel Das -
2020 : Unsupervised Discovery of Subgroups with Anomalous Maternal and Neonatal Outcomes with WHO´s Safe Childbirth Checklist as Intervention - Girmaw Abebe Tadesse »
Girmaw Abebe Tadesse -
2020 : Spotlight on women at IBM Research »
Lisa Amini · Francesca Rossi · Celia Cintas · Payel Das -
2019 : Poster Session »
Nathalie Baracaldo · Seth Neel · Tuyen Le · Dan Philps · Suheng Tao · Sotirios Chatzis · Toyo Suzumura · Wei Wang · WENHANG BAO · Solon Barocas · Manish Raghavan · Samuel Maina · Reginald Bryant · Kush Varshney · Skyler D. Speakman · Navdeep Gill · Nicholas Schmidt · Kevin Compher · Naveen Sundar Govindarajulu · Vivek Sharma · Praneeth Vepakomma · Tristan Swedish · Jayashree Kalpathy-Cramer · Ramesh Raskar · Shihao Zheng · Mykola Pechenizkiy · Marco Schreyer · Li Ling · Chirag Nagpal · Robert Tillman · Manuela Veloso · Hanjie Chen · Xintong Wang · Michael Wellman · Matthew van Adelsberg · Ben Wood · Hans Buehler · Mahmoud Mahfouz · Antonios Alexos · Megan Shearer · Antigoni Polychroniadou · Lucia Larise Stavarache · Dmitry Efimov · Johnston P Hall · Yukun Zhang · Emily Diana · Sumitra Ganesh · Vineeth Ravi · · Swetasudha Panda · Xavier Renard · Matthew Jagielski · Yonadav Shavit · Joshua Williams · Haoran Wei · Shuang (Sophie) Zhai · Xinyi Li · Hongda Shen · Daiki Matsunaga · Jaesik Choi · Alexis Laignelet · Batuhan Guler · Jacobo Roa Vicens · Ajit Desai · Jonathan Aigrain · Robert Samoilescu -
2017 : Skyler Speakman (IBM Research Africa): Three Population Covariate Shift for Mobile Phone-based Credit Scoring »
Skyler D. Speakman