Skip to yearly menu bar Skip to main content


Evaluation Awareness Scales Predictably in Open-Weights Large Language Models

Maheep Chaudhary ⋅ Ian Su ⋅ Nikhil Hooda ⋅ Nishith Shankar ⋅ Julia Tan ⋅ Kevin Zhu ⋅ Ashwinee Panda ⋅ Ryan Lagasse ⋅ Vasu Sharma

Abstract

Chat is not available.