Skip to yearly menu bar Skip to main content


Poster

On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting

Tomasz Korbak ⋅ Hady Elsahar ⋅ Germán Kruszewski ⋅ Marc Dymetman
2022 Poster

Abstract

Video

Chat is not available.