Skip to yearly menu bar Skip to main content


ShieldBench: A Comprehensive Benchmark for Evaluating the Persistence of LLM Safety Interventions

Mert Ogul · Rishitha Voleti · Shanduojiao Jiang · Kevin Zhu

Abstract

Chat is not available.