Skip to yearly menu bar Skip to main content


Jailbreaking Language Models at Scale via Persona Modulation

Rusheb Shah ⋅ Quentin Feuillade Montixi ⋅ Soroush Pour ⋅ Arush Tagade ⋅ Javier Rando

Abstract

Chat is not available.