Skip to yearly menu bar Skip to main content


Response-Based Knowledge Distillation for Multilingual Jailbreak Prevention Unwittingly Compromises Safety

Max Zhang ⋅ Derek Liu ⋅ Kai Zhang ⋅ Joshua Franco ⋅ Haihao Liu ⋅ Kevin Zhu

Abstract

Chat is not available.