Skip to yearly menu bar Skip to main content


Poster

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

Patrick Chao ⋅ Edoardo Debenedetti ⋅ Alexander Robey ⋅ Maksym Andriushchenko ⋅ Francesco Croce ⋅ Vikash Sehwag ⋅ Edgar Dobriban ⋅ Nicolas Flammarion ⋅ George J. Pappas ⋅ Florian Tramer ⋅ Hamed Hassani ⋅ Eric Wong
2024 Poster

Abstract

Video

Chat is not available.