Skip to yearly menu bar Skip to main content


Model Soup for Better RLHF: Weight Space Averaging to Improve Alignment in LLMs

Atoosa Chegini · Hamid Kazemi · Iman Mirzadeh · Dong Yin · Maxwell Horton · Moin Nabi · Mehrdad Farajtabar · Keivan Alizadeh vahid

Abstract

Chat is not available.