Skip to yearly menu bar Skip to main content


Evaluation Awareness Scales Predictably in Open-Weights Large Language Models

Maheep Chaudhary · Ian Su · Nikhil Hooda · Nishith Shankar · Julia Tan · Kevin Zhu · Ashwinee Panda · Ryan Lagasse · Vasu Sharma

Abstract

Chat is not available.