Skip to yearly menu bar Skip to main content


Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs

Sonia Murthy ⋅ Rosie Zhao ⋅ Jennifer Hu ⋅ Sham Kakade ⋅ Markus Wulfmeier ⋅ Peng Qian ⋅ Tomer Ullman

Abstract

Video

Chat is not available.