Skip to yearly menu bar Skip to main content


Contributed Talk 2: Failures to Find Transferable Image Jailbreaks Between Vision-Language Models

Rylan Schaeffer ⋅ Dan Valentine ⋅ Luke Bailey ⋅ James Chua ⋅ Zane Durante ⋅ Cristobal Eyzaguirre ⋅ Joe Benton ⋅ Brando Miranda ⋅ Henry Sleight ⋅ Tony Wang ⋅ John Hughes ⋅ Rajashree Agrawal ⋅ Mrinank Sharma ⋅ Scott Emmons ⋅ Sanmi Koyejo ⋅ Ethan Perez

Abstract

Video

Chat is not available.