Skip to yearly menu bar Skip to main content


Improving LLM Generation with Inverse and Forward Alignment: Reward Modeling, Prompting, Fine-Tuning, and Inference-Time Optimization

Hao Sun · Thomas Pouplin · Nicolás Astorga · Tennison Liu · Mihaela van der Schaar

Abstract

Chat is not available.