Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Towards Safe & Trustworthy Agents

Getting By Goal Misgeneralization With a Little Help From a Mentor

Tu Trinh ⋅ Mohamad Hosein Danesh ⋅ Khanh Nguyen ⋅ Benjamin Plaut

Abstract

Chat is not available.