Timezone: »
Large language models can produce powerful contextual representations that lead to improvements across many NLP tasks. Though these models can comprise undesired inductive biases, it is challenging to identify what information they encode in their learned representations. Since the model-internal reasoning process is often guided by a sequence of learned self-attention mechanisms, it is paramount to be able to explore what the attention has learned. While static analyses for this can lead to targeted insights, interactive tools can be more dynamic and help humans gain an intuition for the model-internal reasoning process. We present exBERT, a tool that helps gather insights into the meaning of contextual representations. exBERT matches a human-specified input to similar contexts in a large annotated dataset. By aggregating these annotations across all similar contexts, exBERT can help to explain what each attention-head has learned.
Author Information
Benjamin Hoover (IBM)
Hendrik Strobelt (IBM Research)

Hendrik Strobelt is Senior Research Scientist at IBM Research (Cambridge, MA) and the Explainability Lead at the MIT-IBM Watson AI Lab. His recent research is on visualization for and human collaboration with AI models to foster explainability and intuition. His work involves NLP models and generative models while he is advocating to utilize a mix of data modalities to solve real-world problems. His research is applied to tasks in machine learning, in NLP, in the biomedical domain, and in chemistry. Hendrik joined IBM in 2017 after postdoctoral positions at Harvard SEAS and NYU Tandon. He received a Ph.D. (Dr. rer. nat.) from the University of Konstanz in computer science (Visualization) and holds an MSc (Diplom) in computer science from TU Dresden. His work has been published at venues like IEEE VIS, ICLR, ACM Siggraph, ACL, NeurIPS, ICCV, PNAS, Nature BME, or Science Advances. He received multiple best paper/honorable mention awards at EuroVis, BioVis, VAST, CHI, ACL Demo, or NeurIPS demo. He received the Lohrmann medal from TU Dresden as the highest student honor. Hendrik has served in program committees and organization committees for IEEE VIS, BioVis, EuroVis. He served on organization committees for IEEE VIS, VISxAI, ICLR, ICML, NeurIPS. Hendrik is visiting researcher at MIT CSAIL.
Sebastian Gehrmann (Harvard)
More from the Same Authors
-
2022 : A Universal Abstraction for Hierarchical Hopfield Networks »
Benjamin Hoover · Duen Horng Chau · Hendrik Strobelt · Dmitry Krotov -
2022 : A Universal Abstraction for Hierarchical Hopfield Networks »
Benjamin Hoover · Duen Horng Chau · Hendrik Strobelt · Dmitry Krotov -
2023 Poster: Energy-based Attention for Associative Memory »
Benjamin Hoover · Yuchen Liang · Bao Pham · Rameswar Panda · Hendrik Strobelt · Duen Horng Chau · Mohammed Zaki · Dmitry Krotov -
2022 : A Universal Abstraction for Hierarchical Hopfield Networks »
Benjamin Hoover · Duen Horng Chau · Hendrik Strobelt · Dmitry Krotov -
2021 : Interactive Exploration for 60 Years of AI Research »
Hendrik Strobelt · Benjamin Hoover -
2020 Poster: CogMol: Target-Specific and Selective Drug Design for COVID-19 Using Deep Generative Models »
Vijil Chenthamarakshan · Payel Das · Samuel Hoffman · Hendrik Strobelt · Inkit Padhi · Kar Wai Lim · Benjamin Hoover · Matteo Manica · Jannis Born · Teodoro Laino · Aleksandra Mojsilovic -
2020 Demonstration: LMdiff: A Visual Diff Tool to Compare LanguageModels »
Hendrik Strobelt · Benjamin Hoover · Arvind Satyanarayan · Sebastian Gehrmann -
2020 Social: A Decemberfest on Trustworthy AI Research - An overview and panel discussion with virtual drinks and Bretzels »
Vanessa Faber · Hendrik Strobelt -
2020 Demonstration: Shared Interest: Human Annotations vs. AI Saliency »
Angie Boggust · Benjamin Hoover · Arvind Satyanarayan · Hendrik Strobelt