Skip to yearly menu bar Skip to main content


A Granular Study of Safety Pretraining under Model Abliteration

Shashank Agnihotri ⋅ Jonas Jakubassa ⋅ Priyam Dey ⋅ Sachin Goyal ⋅ Bernt Schiele ⋅ Venkatesh Babu Radhakrishnan ⋅ Margret Keuper

Abstract

Chat is not available.