Skip to yearly menu bar Skip to main content


MISR: Measuring Instrumental Self-Reasoning in Frontier Models

Kai Fronsdal · David Lindner

Abstract

Chat is not available.