Skip to yearly menu bar Skip to main content


Model Soup for Better RLHF: Weight Space Averaging to Improve Alignment in LLMs

Atoosa Chegini ⋅ Hamid Kazemi ⋅ Iman Mirzadeh ⋅ Dong Yin ⋅ Maxwell Horton ⋅ Moin Nabi ⋅ Mehrdad Farajtabar ⋅ Keivan Alizadeh vahid

Abstract

Chat is not available.