Skip to yearly menu bar Skip to main content


Jailbreak-Zero: A Path to Pareto Optimal Red Teaming for Large Language Models

Kai Hu ⋅ Abhinav Aggarwal ⋅ Mehran Khodabandeh ⋅ David Zhang ⋅ Eric Hsin ⋅ Li Chen ⋅ Ankit Jain ⋅ Matt Fredrikson ⋅ Akash Bharadwaj

Abstract

Chat is not available.