Timezone: »

Shared Interest: Human Annotations vs. AI Saliency
Angie Boggust · Benjamin Hoover · Arvind Satyanarayan · Hendrik Strobelt

Tue Dec 08 09:00 AM -- 09:20 AM & Wed Dec 09 09:00 AM -- 09:20 AM (PST) @
Event URL: http://shared-interest.csail.mit.edu »

As deep learning is applied to high stakes scenarios, it is increasingly important that a model is not only making accurate decisions, but doing so for the right reasons. Common explainability methods provide pixel attributions as an explanation for a model's decision on a single image; however, using input-level explanations to understand patterns in model behavior is challenging for large datasets as it requires selecting and analyzing an interesting subset of inputs. Utilizing human generated ground truth object locations, we introduce metrics for ranking inputs based on the correspondence between the input’s ground truth location and the explainability method’s explanation region. Our methodology is agnostic to model architecture, explanation method, and dataset allowing it to be applied to many tasks. We demo our method on two high profile scenarios: a widely used image classification model and a melanoma prediction model, showing it surfaces patterns in model behavior by aligning model explanations with human annotations.

Author Information

Angie Boggust (MIT)
Benjamin Hoover (IBM Research)
Arvind Satyanarayan (MIT CSAIL)
Hendrik Strobelt (IBM Research)

More from the Same Authors