Skip to yearly menu bar Skip to main content


Second-order Jailbreaks: Generative Agents Successfully Manipulate Through an Intermediary

Mikhail Terekhov ⋅ Romain Graux ⋅ Eduardo Neville ⋅ Denis Rosset ⋅ Gabin Kolly

Abstract

Video

Chat is not available.