Skip to yearly menu bar Skip to main content


Poster

Offline RL by Reward-Weighted Fine-Tuning for Conversation Optimization

Subhojyoti Mukherjee ⋅ Viet Lai ⋅ Raghavendra Addanki ⋅ Ryan Rossi ⋅ Seunghyun Yoon ⋅ Trung Bui ⋅ Anup B. Rao ⋅ Jayakumar Subramanian ⋅ Branislav Kveton
2025 Poster

Abstract

Video

Chat is not available.