Skip to yearly menu bar Skip to main content


Spotlight Poster

CTRL-ALT-DECEIT Sabotage Evaluations for Automated AI R&D

Francis Ward ⋅ Teun van der Weij ⋅ Hanna Gábor ⋅ Sam Martin ⋅ Raja Moreno ⋅ Harel Lidar ⋅ Louis Makower ⋅ Thomas Jodrell ⋅ Lauren Robson
2025 Spotlight Poster

Abstract

Video

Chat is not available.