firstbacksecondback
1 Results
Poster
|
Thu 9:00 |
On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting Tomasz Korbak · Hady Elsahar · Germán Kruszewski · Marc Dymetman |