Skip to yearly menu bar Skip to main content


Curiosity-driven Red teaming for Large Language Models

Zhang-Wei Hong · Idan Shenfeld · Tsun-Hsuan Johnson Wang · Yung-Sung Chuang · Aldo Pareja · Jim Glass · Akash Srivastava · Pulkit Agrawal

Abstract

Chat is not available.