Skip to yearly menu bar Skip to main content


Contributed Talk 2: Failures to Find Transferable Image Jailbreaks Between Vision-Language Models

Rylan Schaeffer · Dan Valentine · Luke Bailey · James Chua · Zane Durante · Cristobal Eyzaguirre · Joe Benton · Brando Miranda · Henry Sleight · Tony Wang · John Hughes · Rajashree Agrawal · Mrinank Sharma · Scott Emmons · Sanmi Koyejo · Ethan Perez

Abstract

Video

Chat is not available.