Skip to yearly menu bar Skip to main content

Workshop

I Can’t Believe It’s Not Better: Understanding Deep Learning Through Empirical Falsification

Arno Blaas · Sahra Ghalebikesabi · Javier Antorán · Fan Feng · Melanie F. Pradier · Ian Mason · David Rohde

Project Page [ Contact: cant.believe.it.is.not.better+workshop@gmail.com ]

Abstract

Deep learning has flourished in the last decade. Recent breakthroughs have shown stunning results, and yet, researchers still cannot fully explain why neural networks generalise so well or why some architectures or optimizers work better than others. There is a lack of understanding of existing deep learning systems, which led NeurIPS 2017 test of time award winners Rahimi & Recht to compare machine learning with alchemy and to call for the return of the 'rigour police'.

Despite excellent theoretical work in the field, deep neural networks are so complex that they might not be able to be fully comprehended with theory alone. Unfortunately, the experimental alternative - rigorous work that neither proves a theorem nor proposes a new method - is currently under-valued in the machine learning community.

To change this, this workshop aims to promote the method of empirical falsification.

We solicit contributions which explicitly formulate a hypothesis related to deep learning or its applications (based on first principles or prior work), and then empirically falsify it through experiments. We further encourage submissions to go a layer deeper and investigate the causes of an initial idea not working as expected. This workshop will showcase how negative results offer important learning opportunities for deep learning researchers, possibly far greater than the incremental improvements found in conventional machine learning papers!

Why empirical falsification? In the words of Karl Popper, "It is easy to obtain confirmations, or verifications, for nearly every theory—if we look for confirmations. Confirmations should count only if they are the result of risky predictions."
We believe that similarly to physics, which seeks to understand nature, the complexity of deep neural networks makes any understanding about them built inductively likely to be brittle.

The most reliable method with which physicists can probe nature is by experimentally validating (or not) the falsifiable predictions made by their existing theories. We posit the same could be the case for deep learning and believe that the task of understanding deep neural networks would benefit from adopting the approach of empirical falsification.

Video

Chat is not available.

Schedule

Timezone: America/Los_Angeles

6:15 AM

Welcome and Opening Remarks

Video

6:25 AM

Introduction to ICBINB

Video

6:30 AM

Jeffrey Bowers: Researchers Comparing DNNs to Brains Need to Adopt Standard Methods of Science.

Jeffrey Bowers

Video

6:55 AM

Jeffrey Bowers: Researchers Comparing DNNs to Brains Need to Adopt Standard Methods of Science.

Jeffrey Bowers

7:00 AM

Lawrence Udeigwe: On the Elements of Theory in Neuroscience.

Lawrence Udeigwe

Video

7:25 AM

Lawrence Udeigwe: On the Elements of Theory in Neuroscience.

Lawrence Udeigwe

7:30 AM

Spotlight 1 - Elre Talea Oldewage: Adversarial Attacks are a Surprisingly Strong Baseline for Poisoning Few-Shot Meta-Learners

Elre Oldewage

Video

7:35 AM

Spotlight 2 - Abhishek Moturu: Volume-based Performance not Guaranteed by Promising Patch-based Results in Medical Imaging

Abhishek Moturu

Video

7:40 AM

Spotlight 3 -Rebecca Saul: Lempel-Ziv Networks

Rebecca Saul

Video

7:45 AM

Spotlight 4 -Tatiana Likhomanenko: Continuous Soft Pseudo-Labeling in ASR

Tatiana Likhomanenko

Video

7:50 AM

Spotlight 5 - Gabriel Loaiza-Ganem: Denoising Deep Generative Models

Gabriel Loaiza-Ganem

Video

7:55 AM

Spotlight 6 - Sheheryar Zaidi: When Does Re-initialization Work?

Sheheryar Zaidi

Video

8:00 AM

Coffee Break (and Poster Session Set Up)

8:30 AM

Poster Session

9:30 AM

ICBINB Virtual Meet-Up

9:30 AM

ICBINB In-Person Meet-Up

10:00 AM

Lunch break

11:00 AM

Kathrin Grosse: On the Limitations of Bayesian Uncertainty in Adversarial Settings.

Kathrin Grosse

Video

11:25 AM

Kathrin Grosse: On the Limitations on Bayesian Uncertainty in Adversarial Settings.

Kathrin Grosse

11:30 AM

Andrew Gordon Wilson: When Bayesian Orthodoxy Can Go Wrong: Model Selection and Out-of-Distribution Generalization

Andrew Gordon Wilson

Video

11:55 AM

Andrew Gordon Wilson: When Bayesian Orthodoxy Can Go Wrong: Model Selection and Out-of-Distribution Generalization

Andrew Gordon Wilson

12:00 PM

Kun Zhang: Causal Principles Meet Deep Learning: Successes and Challenges.

Kun Zhang

Video

12:25 PM

Kun Zhang: Causal Principles Meet Deep Learning: Successes and Challenges.

Kun Zhang

12:30 PM

Piersilvio De Bartolomeis: Certified defences hurt generalisation

Piersilvio De Bartolomeis

Video

12:40 PM

Simran Kaur: On the Maximum Hessian Eigenvalue and Generalization

Simran Kaur

Video

12:50 PM

Taiga Abe: The Best Deep Ensembles Sacrifice Predictive Diversity

Video

1:00 PM

Coffee break

1:30 PM

Fanny Yang: Surprising failures of standard practices in ML when the sample size is small.

Fanny Yang

Video

1:55 PM

Fanny Yang: Surprising failures of standard practices in ML when the sample size is small.

Fanny Yang

2:00 PM

Panel Discussion - What Role Should Empiricism Play in Building AI?

Video

2:50 PM

Closing remarks & awards

Video

Exploring the Long-Term Generalization of Counting Behavior in RNNs

Nadine El-Naggar · Pranava Madhyastha · Tillman Weyde

Scaling Laws Beyond Backpropagation

Matthew Filipovich · Alessandro Cappelli · Daniel Hesslow · Julien Launay

Dynamic Statistical Learning with Engineered Features Outperforms Deep Neural Networks for Smart Building Cooling Load Predictions

Yiren Liu · S. Joe Qin · Xiangyu Zhao · Yixiao HUANG · Shenglong Yao · Guo Han

Spread Love Not Hate: Undermining the Importance of Hateful Pre-training for Hate Speech Detection

Omkar Gokhale · Aditya Kane · Shantanu Patankar · Tanmay Chavan · Raviraj Joshi

On Equivalences between Weight and Function-Space Langevin Dynamics

Ziyu Wang · Yuhao Zhou · Ruqi Zhang · Jun Zhu

Pitfalls of conditional computation for multi-modal learning

Ivaxi Sheth · Mohammad Havaei · Samira Ebrahimi Kahou

The Effect of Data Dimensionality on Neural Network Prunability

Zachary Ankner · Alex Renda · Gintare Karolina Dziugaite · Jonathan Frankle · Tian Jin

Spike-and-Slab Probabilistic Backpropagation: When Smarter Approximations Make No Difference

Evan Ott · Sinead Williamson

Can We Forecast And Detect Earthquakes From Heterogeneous Multivariate Time Series Data?

Asadullah Hill Galib · Luke Cullen · Andy Smith · Debvrat Varshney · Edward Brown · Peter Chi · Xiangning Chu · Filip Svoboda

Surgical Fine-Tuning Improves Adaptation to Distribution Shifts

Yoonho Lee · Annie Chen · Fahim Tajwar · Ananya Kumar · Huaxiu Yao · Percy Liang · Chelsea Finn

Models with Conditional Computation Learn Suboptimal Solutions

Mohammed Muqeeth · Haokun Liu · Colin Raffel

On The Diversity of ASR Hypotheses In Spoken Language Understanding

Surya Kant Sahu · Swaraj Dalmia

Lessons from Developing Multimodal Models with Code and Developer Interactions

Nicholas Botzer · Yasanka Horawalavithana · Tim Weninger · Svitlana Volkova

An Empirical Analysis of the Advantages of Finite v.s. Infinite Width Bayesian Neural Networks

Jiayu Yao · Yaniv Yacoby · Beau Coker · Weiwei Pan · Finale Doshi-Velez

Are Neurons Actually Collapsed? On the Fine-Grained Structure in Neural Representations

Yongyi Yang · Jacob Steinhardt · Wei Hu

Model Stitching: Looking For Functional Similarity Between Representations

Adriano Hernandez · Rumen Dangovski · Peter Y. Lu

On the Sparsity of Image Super-resolution Network

Chenyu Dong · Hailong Ma · Jinjin Gu · Ruofan Zhang · Jieming Li · Chun Yuan

Paradigmatic Revolutions in Computer Vision

Andreas Kriegler

The curse of (non)convexity: The case of an Optimization-Inspired Data Pruning algorithm

Fadhel Ayed · Soufiane Hayou

An Empirical Study on Clustering Pretrained Embeddings: Is Deep Strictly Better?

Tyler Scott · Ting Liu · Michael Mozer · Andrew Gallagher

When Are Graph Neural Networks Better Than Structure-Agnostic Methods?

Diana Gomes · Fred RL · Kyriakos Efthymiadis · Ann Nowe · Peter Vrancx

The (Un)Scalability of Heuristic Approximators for NP-Hard Search Problems

Sumedh Dattaguru Pendurkar · Taoan Huang · Sven Koenig · Guni Sharon

DARTFormer: Finding The Best Type Of Attention

Jason Brown · Yiren Zhao · I Shumailov · Robert Mullins

Exploring the Sharpened Cosine Similarity

Skyler Wu · Fred Lu · Edward Raff · James Holt

Are you using test log-likelihood correctly?

Sameer Deshpande · Soumya Ghosh · Tin Nguyen · Tamara Broderick

Identifying the Context Shift between Test Benchmarks and Production Data

Matt Groh

On the performance of Direct Loss Minimization for Bayesian Neural Networks

Yadi Wei · Roni Khardon

Analysing the Relations of Misclassified Inputs Between Models

Hadar Shavit

How many trained neural networks are needed for influence estimation in modern deep learning?

Sasha (Alexandre) Doubov · Tianshi Cao · David Acuna · Sanja Fidler

Much Easier Said Than Done: Falsifying the Causal Relevance of Decoding Methods

Lucas Hayne · Abhijit Suresh · Hunar Jain · Rahul Kumar Mohan Kumar · R. McKell Carter

Evaluating Robust Perceptual Losses for Image Reconstruction

Tobias Uelwer · Felix Michels · Oliver De Candido

The Best Deep Ensembles Sacrifice Predictive Diversity

Taiga Abe · Estefany Kelly Buchanan · Geoff Pleiss · John Cunningham

Volume-based Performance not Guaranteed by Promising Patch-based Results in Medical Imaging

Abhishek Moturu · Sayali Joshi · Andrea Doria · Anna Goldenberg

Continuous Soft Pseudo-Labeling in ASR

Tatiana Likhomanenko · Ronan Collobert · Navdeep Jaitly · Samy Bengio

On the Maximum Hessian Eigenvalue and Generalization

Simran Kaur · Jeremy M Cohen · Zachary Lipton

Denoising Deep Generative Models

Gabriel Loaiza-Ganem · Brendan Ross · Luhuan Wu · John Cunningham · Jesse Cresswell · Anthony Caterini

Adversarial Attacks are a Surprisingly Strong Baseline for Poisoning Few-Shot Meta-Learners

Elre Oldewage · John Bronskill · Richard Turner

Certified defences hurt generalisation

Piersilvio De Bartolomeis · Jacob Clarysse · Fanny Yang · Amartya Sanyal

When Does Re-initialization Work?

Sheheryar Zaidi · Tudor Berariu · Hyunjik Kim · Jorg Bornschein · Claudia Clopath · Yee Whye Teh · Razvan Pascanu

Lempel-Ziv Networks

Rebecca Saul · Mohammad Mahmudul Alam · John Hurwitz · Edward Raff · Tim Oates · James Holt