Skip to yearly menu bar Skip to main content


Internal Value Functions: Leveraging Hidden States for Efficient Test-Time Scaling in Large Reasoning Models

Duc Khiem Pham · Sai Muralidhar Jayanthi · Saket Dingliwal · Bhavana Ganesh · Karthik Valmeekam · Xiangchen Song · Vivek Govindan · Beidi Chen · Sravan Babu Bodapati · Aram Galstyan

Abstract

Chat is not available.