Skip to yearly menu bar Skip to main content


Poster

Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms

Rafael Rafailov ⋅ Yaswanth Chittepu ⋅ Ryan Park ⋅ Harshit Sushil Sikchi ⋅ Joey Hejna ⋅ Brad Knox ⋅ Chelsea Finn ⋅ Scott Niekum
2024 Poster

Abstract

Video

Chat is not available.