Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

13 Results

<<   <   Page 1 of 2   >   >>
Poster
Fri 16:30 High-dimensional (Group) Adversarial Training in Linear Regression
Yiling Xie · Xiaoming Huo
Workshop
Does Refusal Training in LLMs Generalize to the Past Tense?
Maksym Andriushchenko · Nicolas Flammarion
Poster
Fri 16:30 Stability and Generalization of Adversarial Training for Shallow Neural Networks with Smooth Activation
Kaibo Zhang · Yunjuan Wang · Raman Arora
Poster
Fri 11:00 Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning
Honghao Wei · Xiyue Peng · Arnob Ghosh · Xin Liu
Workshop
An Adversarial Perspective on Machine Unlearning for AI Safety
Jakub Łucki · Boyi Wei · Yangsibo Huang · Peter Henderson · Florian Tramer · Javier Rando
Workshop
An Adversarial Perspective on Machine Unlearning for AI Safety
Jakub Łucki · Boyi Wei · Yangsibo Huang · Peter Henderson · Florian Tramer · Javier Rando
Workshop
Cold Posterior Effect towards Adversarial Robustness
Bruce Rushing · Antonios Alexos · Harrison Espino · Nicholas Cohen · Pierre Baldi
Poster
Thu 11:00 Efficient Adversarial Training in LLMs with Continuous Attacks
Sophie Xhonneux · Alessandro Sordoni · Stephan Günnemann · Gauthier Gidel · Leo Schwinn
Workshop
Adversarial Training Can Provably Improve Robustness: Theoretical Analysis of Feature Learning Process Under Structured Data
Binghui Li · Yuanzhi Li
Workshop
Adversarial Training based Domain Adaptation for Cross-Subject Emotion Recognition
Sungpil Woo · MUHAMMAD ZUBAIR · Sunhwan Lim · Daeyoung Kim
Poster
Wed 11:00 Self-Supervised Adversarial Training via Diverse Augmented Queries and Self-Supervised Double Perturbation
Ruize Zhang · Sheng Tang · Juan Cao
Poster
Fri 11:00 Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models
Yimeng Zhang · Xin Chen · Jinghan Jia · Yihua Zhang · Chongyu Fan · Jiancheng Liu · Mingyi Hong · Ke Ding · Sijia Liu