Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

55 Results

<<   <   Page 4 of 5   >   >>
Poster
Wed 16:30 Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs
Xuan Zhang · Chao Du · Tianyu Pang · Qian Liu · Wei Gao · Min Lin
Poster
Thu 11:00 Discovering Preference Optimization Algorithms with and for Large Language Models
Chris Lu · Samuel Holt · Claudio Fanconi · Alex Chan · Jakob Foerster · Mihaela van der Schaar · Robert Lange
Workshop
Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization
Hritik Bansal · Ashima Suvarna · Gantavya Bhatt · Nanyun Peng · Kai-Wei Chang · Aditya Grover
Workshop
P3O: Pessimistic Preference-based Policy Optimization for Robust Alignment from Preferences
Dhawal Gupta · Christoph Dann · Alekh Agarwal
Workshop
Sat 12:00 Uncertainty-Penalized Directed Preference Optimization
Sam Houliston · Alexander Immer · Alizée Pace · Gunnar Rätsch
Workshop
Bayesian Optimization of High-dimensional Outputs with Human Feedback
Qing Feng · Zhiyuan Jerry Lin · Yujia Zhang · Ben Letham · Jelena Markovic-Voronov · Ryan-Rhys Griffiths · Peter Frazier · Eytan Bakshy
Workshop
Common Pitfalls of Margin-based Preference Optimization in Language Model Alignment
Hui Yuan · Yifan Zeng · Yue Wu · Huazheng Wang · Mengdi Wang · Liu Leqi
Poster
Thu 11:00 Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment
Teng Xiao · Yige Yuan · Huaisheng Zhu · Mingxiao Li · Vasant Honavar
Poster
Thu 16:30 Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
Yuanpu Cao · Tianrong Zhang · Bochuan Cao · Ziyi Yin · Lu Lin · Fenglong Ma · Jinghui Chen
Poster
Wed 16:30 Decision-Focused Learning with Directional Gradients
Michael Huang · Vishal Gupta
Poster
Thu 16:30 Directional Smoothness and Gradient Methods: Convergence and Adaptivity
Aaron Mishkin · Ahmed Khaled · Yuanhao Wang · Aaron Defazio · Robert Gower
Workshop
Directly Optimizing for Synthesizability in Generative Molecular Design using Retrosynthesis Models
Jeff Guo · Philippe Schwaller