Sun 8:50 a.m. - 9:00 a.m.
|
Opening Remarks
(
Intro
)
>
SlidesLive Video
|
馃敆
|
Sun 9:00 a.m. - 9:30 a.m.
|
Cynthia Rudin: The Marriage of Noise and Simplicity
(
Invited Talk
)
>
SlidesLive Video
|
Cynthia Rudin
馃敆
|
Sun 9:30 a.m. - 10:00 a.m.
|
Rich Caruana: The Unexpected Success of GlassBox Learning with Tabular Data
(
Invited Talk
)
>
SlidesLive Video
|
Rich Caruana
馃敆
|
Sun 10:00 a.m. - 11:15 a.m.
|
Poster Session
(
Poster
)
>
|
馃敆
|
Sun 11:15 a.m. - 12:00 p.m.
|
Panel Discussion: Moderator - Kamalika Chaudhuri
(
Panel
)
>
SlidesLive Video
|
馃敆
|
Sun 12:00 p.m. - 1:00 p.m.
|
Lunch
(
Lunch
)
>
|
馃敆
|
Sun 1:00 p.m. - 1:30 p.m.
|
Contributed Talks 1
(
Contributed talks
)
>
SlidesLive Video
|
馃敆
|
Sun 1:30 p.m. - 2:00 p.m.
|
Jiaxin Zhang: Building AI-Native Customer Experiences with Confidence at Intuit
(
Invited talk
)
>
SlidesLive Video
|
Jiaxin Zhang
馃敆
|
Sun 2:00 p.m. - 2:30 p.m.
|
Tong Wang: Using Advanced LLMs to Enhance Smaller LLMs - An Interpretable Knowledge Distillation Approach
(
Invited talk
)
>
SlidesLive Video
|
Tong Wang
馃敆
|
Sun 2:30 p.m. - 3:00 p.m.
|
Coffee Break
(
Coffee Break
)
>
|
馃敆
|
Sun 3:00 p.m. - 3:30 p.m.
|
Neel Nanda: Sparse Autoencoders - Assessing the evidence
(
Invited Talk
)
>
SlidesLive Video
|
Neel Nanda
馃敆
|
Sun 3:30 p.m. - 4:00 p.m.
|
Contributed Talks 2
(
Contributed talks
)
>
SlidesLive Video
|
馃敆
|
Sun 4:00 p.m. - 4:45 p.m.
|
Poster Session 2
(
Poster Session
)
>
|
馃敆
|
Sun 4:45 p.m. - 5:00 p.m.
|
Concluding Remarks
(
Concluding Remarks
)
>
SlidesLive Video
|
馃敆
|
-
|
Clustering and Alignment: Understanding the Training Dynamics in Modular Addition
(
Poster
)
>
link
|
Tiberiu Mu葯at
馃敆
|
-
|
How Do Training Methods Influence the Utilization of Vision Models?
(
Poster
)
>
link
|
Paul Gavrikov 路 Shashank Agnihotri 路 Margret Keuper 路 Janis Keuper
馃敆
|
-
|
[published paper track (COLT 2024)] A Theory of Interpretable Approximations
(
Poster
)
>
link
|
Marco Bressan 路 Nicol貌 Cesa-Bianchi 路 Emmanuel Esposito 路 Yishay Mansour 路 Shay Moran 路 Maximilian Thiessen
馃敆
|
-
|
Enhancing patient stratification and interpretability through class-contrastive and feature attribution techniques
(
Poster
)
>
link
|
Sharday Olowu 路 Neil Lawrence 路 Soumya Banerjee
馃敆
|
-
|
ProtoS-ViT: Visual foundation models for sparse self-explainable classifications
(
Poster
)
>
link
|
Hugues Turbe 路 Mina Bjelogrlic 路 Gianmarco Mengaldo 路 Christian Lovis
馃敆
|
-
|
Competence-Based Analysis of Language Models
(
Poster
)
>
link
|
Adam Davies 路 Jize Jiang 路 Cheng Xiang Zhai
馃敆
|
-
|
Residual Stream Analysis with Multi-Layer SAEs
(
Poster
)
>
link
|
Tim Lawson 路 Lucy Farnik 路 Conor Houghton 路 Laurence Aitchison
馃敆
|
-
|
PCNN: Probable-Class Nearest-Neighbor Explanations Improve Fine-Grained Image Classification Accuracy for AIs and Humans
(
Poster
)
>
link
|
Giang Nguyen 路 Valerie Chen 路 Mohammad Reza Taesiri 路 Anh Nguyen
馃敆
|
-
|
Latent Concept-based Explanation of NLP Models
(
Poster
)
>
link
|
Xuemin Yu 路 Fahim Dalvi 路 Nadir Durrani 路 Marzia Nouri 路 Hassan Sajjad
馃敆
|
-
|
Measuring the Reliability of Causal Probing Methods: Tradeoffs, Limitations, and the Plight of Nullifying Interventions
(
Poster
)
>
link
|
Marc Canby 路 Adam Davies 路 Chirag Rastogi 路 Julia C Hockenmaier
馃敆
|
-
|
Exploiting Interpretable Capabilities with Concept-Enhanced Diffusion and Prototype Networks
(
Poster
)
>
link
|
Alba Carballo Castro 路 Sonia Laguna 路 Moritz Vandenhirtz 路 Julia Vogt
馃敆
|
-
|
Subgroup Discovery with the Cox Model
(
Poster
)
>
link
|
Zachary Izzo 路 Iain Melvin
馃敆
|
-
|
Explainable AI-based analysis of human pancreas sections detects traits of type 2 diabetes
(
Poster
)
>
link
|
25 presenters
Lukas Klein 路 Sebastian Ziegler 路 Felicia Gerst 路 Yanni Morgenroth 路 Karol Gotkowski 路 Eyke Sch枚niger 路 Nicole Kipke 路 Annika Seiler 路 Ellen Geibelt 路 Martin Heni 路 Silvia Wagner 路 Silvio Nadalin 路 Falko Fend 路 Daniela Aust 路 Andre Mihaljevic 路 Daniel Hartmann 路 Jurgen Weitz 路 Reiner Schwartzenberg 路 Marius Distler 路 Andreas Birkefeld 路 Susanne Ullrich 路 Paul Jaeger 路 Fabian Isensee 路 Michele Solimena 路 Robert Wagner
馃敆
|
-
|
Explainable Concept Generation through Vision-Language Preference Learning
(
Poster
)
>
link
|
Aditya Taparia 路 Som Sagar 路 Ransalu Senanayake
馃敆
|
-
|
Disentangling Mean Embeddings for Better Diagnostics of Image Generators
(
Poster
)
>
link
|
Sebastian Gruber 路 Pascal Tobias Ziegler 路 Florian Buettner
馃敆
|
-
|
You can remove GPT2's LayerNorm by fine-tuning
(
Poster
)
>
link
|
Stefan Heimersheim
馃敆
|
-
|
Towards scientific discovery with dictionary learning: Extracting biological concepts from microscopy foundation models
(
Poster
)
>
link
|
Konstantin Donhauser 路 Gemma Moran 路 Aditya Ravuri 路 Kian Kenyon-Dean 路 Kristina Ulicna 路 Cian Eastwood 路 Jason Hartford
馃敆
|
-
|
Words in Motion: Interpreting Motion Forecasting Transformers by Controlling Representations
(
Poster
)
>
link
|
Omer Sahin Tas 路 Royden Wagner
馃敆
|
-
|
Position: XAI needs formal notions of explanation correctness
(
Poster
)
>
link
|
Stefan Haufe 路 Rick Wilming 路 Benedict Clark 路 Rustam Zhumagambetov 路 Danny Panknin 路 Ahcene Boubekki
馃敆
|
-
|
Aligning Characteristic Descriptors with Images for Human-Expert-like Explainability
(
Poster
)
>
link
|
Bharat Chandra Yalavarthi 路 Nalini Ratha
馃敆
|
-
|
Error-controlled interaction discovery in deep neural networks
(
Poster
)
>
link
|
Winston Chen 路 Yifan Jiang 路 William Stafford Noble 路 Yang Lu
馃敆
|
-
|
Evaluating Machine Learning Models with NERO: Non-Equivariance Revealed on Orbits
(
Poster
)
>
link
|
Zhuokai Zhao 路 Takumi Matsuzawa 路 William Irvine 路 Michael Maire 路 Gordon Kindlmann
馃敆
|
-
|
This Looks Like Those: Illuminating Prototypical Concepts Using Multiple Visualizations
(
Poster
)
>
link
|
Chiyu Ma 路 Brandon Zhao 路 Chaofan Chen 路 Cynthia Rudin
馃敆
|
-
|
Bivariate Decision Trees: Smaller, Interpretable, More Accurate
(
Poster
)
>
link
|
Rasul Kairgeldin 路 Miguel A. Carreira-Perpinan
馃敆
|
-
|
ConceptDrift: Uncovering Biases through the Lens of Foundational Models
(
Poster
)
>
link
|
Cristian D Paduraru 路 Elena Burceanu 路 Antonio Barbalau 路 Andrei Nicolicioiu 路 Radu Filipescu
馃敆
|
-
|
A is for Absorption: Studying Sparse Autoencoder Feature Splitting and Absorption in Spelling Tasks
(
Poster
)
>
link
|
James Wilken-Smith 路 Tom谩拧 Dulka 路 David Chanin 路 Hardik Bhatnagar 路 Joseph Bloom
馃敆
|
-
|
Deep quantum graph dreaming: deciphering neural network insights into quantum experiments
(
Poster
)
>
link
|
Tareq Jaouni 路 S枚ren Arlt 路 Carlos Ruiz-Gonzalez 路 Ebrahim Karimi 路 Xuemei Gu 路 Mario Krenn
馃敆
|
-
|
CoS: Enhancing Personalization and Mitigating Bias with Context Steering
(
Poster
)
>
link
|
Sashrika Pandey 路 Jerry He 路 Mariah Schrum 路 Anca Dragan
馃敆
|
-
|
A Concept-Based Explainability Framework for Large Multimodal Models
(
Poster
)
>
link
|
Jayneel Parekh 路 Pegah KHAYATAN 路 Mustafa Shukor 路 Alasdair Newson 路 Matthieu Cord
馃敆
|
-
|
Riemann Sum Optimization for Accurate Integrated Gradients Computation
(
Poster
)
>
link
|
Swadesh Swain 路 Shree Singhi
馃敆
|
-
|
Interpretability as Compression: Reconsidering SAE Explanations of Neural Activations
(
Poster
)
>
link
|
Kola Ayonrinde 路 Michael Pearce
馃敆
|
-
|
Model Reconstruction Using Counterfactual Explanations: A Perspective From Polytope Theory
(
Poster
)
>
link
|
Pasan Dissanayake 路 Sanghamitra Dutta
馃敆
|
-
|
Interpretable AI in Human-Machine Systems: Insights from Human-in-the-Loop Product Recommendation Engines
(
Poster
)
>
link
|
Pooria Assadi 路 NIMA SAFAEI
馃敆
|
-
|
SignAttention: On the Interpretability of Transformer Models for Sign Language Translation
(
Poster
)
>
link
|
Pedro Alejandro Dal Bianco 路 Oscar Stanchi 路 Facundo Manuel Quiroga 路 Franco Ronchetti 路 Enzo Ferrante
馃敆
|
-
|
Interactive Semantic Interventions for VLMs: A Human-in-the-Loop Approach to Interpretability
(
Poster
)
>
link
|
Lukas Klein 路 Kenza Amara 路 Carsten L眉th 路 Antonio Foncubierta-Rodriguez 路 Hendrik Strobelt 路 Mennatallah El-Assady 路 Paul Jaeger
馃敆
|
-
|
Right on Time: Revising Time Series Models by Constraining their Explanations
(
Poster
)
>
link
|
Maurice Kraus 路 David Steinmann 路 Antonia W眉st 路 Andre Kokozinski 路 Kristian Kersting
馃敆
|
-
|
Position: In Defense of Post-hoc Explainability
(
Poster
)
>
link
|
Nick Oh
馃敆
|
-
|
Isometry pursuit
(
Poster
)
>
link
|
Samson Koelle 路 Marina Meila
馃敆
|
-
|
From Flexibility to Manipulation: The Slippery Slope of Parameterizing Interpretability Evaluation
(
Poster
)
>
link
|
Kristoffer Wickstr酶m 路 Marina H枚hne 路 Anna Hedstr枚m
馃敆
|
-
|
Your Theory Is Wrong: Using Linguistic Frameworks for LLM Probing
(
Poster
)
>
link
|
Victoria Firsanova
馃敆
|
-
|
Can sparse autoencoders be used to decompose and interpret steering vectors?
(
Poster
)
>
link
|
Harry Mayne 路 Yushi Yang 路 Adam Mahdi
馃敆
|
-
|
Policy-shaped prediction: improving world modeling through interpretability
(
Poster
)
>
link
|
Miles Hutson 路 Isaac Kauvar 路 Nick Haber
馃敆
|
-
|
A Mechanism for Storing Positional Information Without Positional Embeddings
(
Poster
)
>
link
|
Chunsheng Zuo 路 Pavel Guerzhoy 路 Michael Guerzhoy
馃敆
|
-
|
What do we even know about interpretability?
(
Poster
)
>
link
|
Julian Skirzynski 路 Berk Ustun 路 Elena Glassman
馃敆
|
-
|
GAMformer: Exploring In-Context Learning for Generalized Additive Models
(
Poster
)
>
link
|
Andreas Mueller 路 Julien Siems 路 Harsha Nori 路 Rich Caruana 路 Frank Hutter
馃敆
|
-
|
The effect of whitening on explanation performance
(
Poster
)
>
link
|
Benedict Clark 路 Stoyan Karastoyanov 路 Rick Wilming 路 Stefan Haufe
馃敆
|