firstbacksecondback
10 Results
Workshop
|
Sun 10:10 |
Invited talk 2: Danqi Chen on Uncovering Simple Failures in Generative Models and How to Fix Them Danqi Chen |
|
Workshop
|
LLM-Assisted Red Teaming of Diffusion Models through "Failures Are Fated, But Can Be Faded" Som Sagar · Aditya Taparia · Ransalu Senanayake |
||
Workshop
|
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models Rylan Schaeffer · Dan Valentine · Luke Bailey · James Chua · Zane Durante · Cristobal Eyzaguirre · Joe Benton · Brando Miranda · Henry Sleight · Tony Wang · John Hughes · Rajashree Agrawal · Mrinank Sharma · Scott Emmons · Sanmi Koyejo · Ethan Perez |
||
Workshop
|
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models Rylan Schaeffer · Dan Valentine · Luke Bailey · James Chua · Zane Durante · Cristobal Eyzaguirre · Joe Benton · Brando Miranda · Henry Sleight · Tony Wang · John Hughes · Rajashree Agrawal · Mrinank Sharma · Scott Emmons · Sanmi Koyejo · Ethan Perez |
||
Workshop
|
Sun 10:55 |
Contributed Talk 2: Failures to Find Transferable Image Jailbreaks Between Vision-Language Models Rylan Schaeffer · Dan Valentine · Luke Bailey · James Chua · Zane Durante · Cristobal Eyzaguirre · Joe Benton · Brando Miranda · Henry Sleight · Tony Wang · John Hughes · Rajashree Agrawal · Mrinank Sharma · Scott Emmons · Sanmi Koyejo · Ethan Perez |
|
Workshop
|
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models Rylan Schaeffer · Dan Valentine · Luke Bailey · James Chua · Zane Durante · Cristobal Eyzaguirre · Joe Benton · Brando Miranda · Henry Sleight · Tony Wang · John Hughes · Rajashree Agrawal · Mrinank Sharma · Scott Emmons · Sanmi Koyejo · Ethan Perez |
||
Workshop
|
Interactive Semantic Interventions for VLMs: Breaking VLMs with Human Ingenuity Lukas Klein · Kenza Amara · Carsten Lüth · Hendrik Strobelt · Mennatallah El-Assady · Paul Jaeger |
||
Workshop
|
Interactive Semantic Interventions for VLMs: A Causality-Inspired Investigation of VLM Failures Lukas Klein · Kenza Amara · Carsten Lüth · Hendrik Strobelt · Mennatallah El-Assady · Paul Jaeger |
||
Workshop
|
Sat 15:45 |
Interactive Semantic Interventions for VLMs: A Human-in-the-Loop Approach to Interpretability Lukas Klein · Kenza Amara · Carsten Lüth · Hendrik Strobelt · Mennatallah El-Assady · Paul Jaeger |
|
Workshop
|
Developing a Foundation Model for Predicting Material Failure Agnese Marcato · Javier E. Santos · Aleksandra Pachalieva · Kai Gao · Ryley Hill · Esteban Rougier · Qinjun Kang · Jeffrey Hyman · Abigail Hunter · Janel Chua · Earl Lawrence · Hari Viswanathan · Daniel O'Malley |