Timezone: »
Deep neural networks (DNNs) have proven to be powerful predictors and are widely used for various tasks. Credible uncertainty estimation of their predictions, however, is crucial for their deployment in many risk-sensitive applications. In this paper we present a novel and simple attack, which unlike adversarial attacks, does not cause incorrect predictions but instead cripples the network's capacity for uncertainty estimation. The result is that after the attack, the DNN is more confident of its incorrect predictions than about its correct ones without having its accuracy reduced. We present two versions of the attack. The first scenario focuses on a black-box regime (where the attacker has no knowledge of the target network) and the second scenario attacks a white-box setting. The proposed attack is only required to be of minuscule magnitude for its perturbations to cause severe uncertainty estimation damage, with larger magnitudes resulting in completely unusable uncertainty estimations.We demonstrate successful attacks on three of the most popular uncertainty estimation methods: the vanilla softmax score, Deep Ensembles and MC-Dropout. Additionally, we show an attack on SelectiveNet, the selective classification architecture. We test the proposed attack on several contemporary architectures such as MobileNetV2 and EfficientNetB0, all trained to classify ImageNet.
Author Information
Ido Galil (Technion)
Ran El-Yaniv (Technion & Deci.AI)
More from the Same Authors
-
2023 Poster: Window-Based Distribution Shift Detection for Deep Neural Networks »
Guy Bar-Shalom · Yonatan Geifman · Ran El-Yaniv -
2022 Poster: TransBoost: Improving the Best ImageNet Performance using Deep Transduction »
Omer Belhasin · Guy Bar-Shalom · Ran El-Yaniv -
2019 Poster: Deep Active Learning with a Neural Architecture Search »
Yonatan Geifman · Ran El-Yaniv -
2018 Poster: Deep Anomaly Detection Using Geometric Transformations »
Izhak Golan · Ran El-Yaniv -
2017 Poster: Multi-Objective Non-parametric Sequential Prediction »
Guy Uziel · Ran El-Yaniv -
2017 Poster: Selective Classification for Deep Neural Networks »
Yonatan Geifman · Ran El-Yaniv -
2012 Poster: Pointwise Tracking the Optimal Regression Function »
Yair Wiener · Ran El-Yaniv -
2012 Spotlight: Pointwise Tracking the Optimal Regression Function »
Yair Wiener · Ran El-Yaniv -
2011 Poster: Selective Prediction of Financial Trends with Hidden Markov Models »
Dmitry Pidan · Ran El-Yaniv -
2011 Poster: Agnostic Selective Classification »
Yair Wiener · Ran El-Yaniv -
2006 Poster: Optimal Single-Class Classification Strategies »
Ran El-Yaniv · Mordechai Nisenson