Skip to yearly menu bar Skip to main content


Curiosity-driven Red teaming for Large Language Models

Zhang-Wei Hong ⋅ Idan Shenfeld ⋅ Tsun-Hsuan Johnson Wang ⋅ Yung-Sung Chuang ⋅ Aldo Pareja ⋅ Jim Glass ⋅ Akash Srivastava ⋅ Pulkit Agrawal

Abstract

Chat is not available.