Skip to yearly menu bar Skip to main content


Poster

Representation Noising: A Defence Mechanism Against Harmful Finetuning

Domenic Rosati ⋅ Jan Wehner ⋅ Kai Williams ⋅ Lukasz Bartoszcze ⋅ Robie Gonzales ⋅ carsten maple ⋅ Subhabrata Majumdar ⋅ Hassan Sajjad ⋅ Frank Rudzicz
2024 Poster

Abstract

Video

Chat is not available.