Skip to yearly menu bar Skip to main content


Response-Based Knowledge Distillation for Multilingual Jailbreak Prevention Unwittingly Compromises Safety

Max Zhang · Derek Liu · Kai Zhang · Joshua Franco · Haihao Liu · Kevin Zhu

Abstract

Chat is not available.