Skip to yearly menu bar Skip to main content


( events)   Timezone:  
Workshop
Thu Dec 08 11:00 PM -- 09:30 AM (PST) @ Room 113
Reliable Machine Learning in the Wild
Dylan Hadfield-Menell · Adrian Weller · David Duvenaud · Jacob Steinhardt · Percy Liang





Workshop Home Page

When will a system that has performed well in the past continue to do so in the future? How do we design such systems in the presence of novel and potentially adversarial input distributions? What techniques will let us safely build and deploy autonomous systems on a scale where human monitoring becomes difficult or infeasible? Answering these questions is critical to guaranteeing the safety of emerging high stakes applications of AI, such as self-driving cars and automated surgical assistants. This workshop will bring together researchers in areas such as human-robot interaction, security, causal inference, and multi-agent systems in order to strengthen the field of reliability engineering for machine learning systems. We are interested in approaches that have the potential to provide assurances of reliability, especially as systems scale in autonomy and complexity. We will focus on four aspects — robustness (to adversaries, distributional shift, model mis-specification, corrupted data); awareness (of when a change has occurred, when the model might be mis-calibrated, etc.); adaptation (to new situations or objectives); and monitoring (allowing humans to meaningfully track the state of the system). Together, these will aid us in designing and deploying reliable machine learning systems.

Opening Remarks (Talk)
Rules for Reliable Machine Learning (Invited Talk)
What's your ML Test Score? A rubric for ML production systems (Contributed Talk)
Poster Spotlights I (Spotlight)
Robust Learning and Inference (Invited Talk)
Automated versus do-it-yourself methods for causal inference: Lessons learned from a data analysis competition (Invited Talk)
Robust Covariate Shift Classification Using Multiple Feature Views (Contributed Talk)
Poster Spotlights II (Spotlight)
Doug Tygar (Invited Talk)
Adversarial Examples and Adversarial Training (Invited Talk)
Summoning Demons: The Pursuit of Exploitable Bugs in Machine Learning (Contributed Talk)
Poster Spotlights III (Spotlight)
Poster Session
Learning Reliable Objectives (Invited Talk)
Building and Validating the AI behind the Next-Generation Aircraft Collision Avoidance System (Invited Talk)
Online Prediction with Selfish Experts (Contributed Talk)
TensorFlow Debugger: Debugging Dataflow Graphs for Machine Learning (Contributed Talk)
What are the challenges to making machine learning reliable in practice? (Panel Discussion)