Skip to yearly menu bar Skip to main content


Can LLMs Reliably Evaluate Themselves? A Probabilistic VC Framework

Jae Oh Woo ⋅ Mengdie (Flora) Wang ⋅ Rahul Ghosh ⋅ Baishali Chaudhury ⋅ Mun Kim

Abstract

Chat is not available.