Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

2 Results

<<   <   Page 1 of 1   >>   >
Workshop
The Crucial Role of Samplers in Online Direct Preference Optimization
Ruizhe Shi · Runlong Zhou · Simon Du
Workshop
Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning
Yihe Deng · Paul Mineiro