Skip to yearly menu bar Skip to main content


Poster

Adversarial training for high-stakes reliability

Daniel Ziegler ⋅ Seraphina Nix ⋅ Lawrence Chan ⋅ Tim Bauman ⋅ Peter Schmidt-Nielsen ⋅ Tao Lin ⋅ Adam Scherlis ⋅ Noa Nabeshima ⋅ Benjamin Weinstein-Raun ⋅ Daniel de Haas ⋅ Buck Shlegeris ⋅ Nate Thomas
2022 Poster
[ Paper [ Slides [ Poster [ OpenReview

Abstract

Video

Chat is not available.