Skip to yearly menu bar Skip to main content


ShieldBench: A Comprehensive Benchmark for Evaluating the Persistence of LLM Safety Interventions

Mert Ogul ⋅ Rishitha Voleti ⋅ Shanduojiao Jiang ⋅ Kevin Zhu

Abstract

Chat is not available.