firstbacksecondback
55 Results
Poster
|
Wed 16:30 |
Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs Xuan Zhang · Chao Du · Tianyu Pang · Qian Liu · Wei Gao · Min Lin |
|
Poster
|
Thu 11:00 |
Discovering Preference Optimization Algorithms with and for Large Language Models Chris Lu · Samuel Holt · Claudio Fanconi · Alex Chan · Jakob Foerster · Mihaela van der Schaar · Robert Lange |
|
Workshop
|
Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization Hritik Bansal · Ashima Suvarna · Gantavya Bhatt · Nanyun Peng · Kai-Wei Chang · Aditya Grover |
||
Workshop
|
P3O: Pessimistic Preference-based Policy Optimization for Robust Alignment from Preferences Dhawal Gupta · Christoph Dann · Alekh Agarwal |
||
Workshop
|
Sat 12:00 |
Uncertainty-Penalized Directed Preference Optimization Sam Houliston · Alexander Immer · Alizée Pace · Gunnar Rätsch |
|
Workshop
|
Bayesian Optimization of High-dimensional Outputs with Human Feedback Qing Feng · Zhiyuan Jerry Lin · Yujia Zhang · Ben Letham · Jelena Markovic-Voronov · Ryan-Rhys Griffiths · Peter Frazier · Eytan Bakshy |
||
Workshop
|
Common Pitfalls of Margin-based Preference Optimization in Language Model Alignment Hui Yuan · Yifan Zeng · Yue Wu · Huazheng Wang · Mengdi Wang · Liu Leqi |
||
Poster
|
Thu 11:00 |
Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment Teng Xiao · Yige Yuan · Huaisheng Zhu · Mingxiao Li · Vasant Honavar |
|
Poster
|
Thu 16:30 |
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization Yuanpu Cao · Tianrong Zhang · Bochuan Cao · Ziyi Yin · Lu Lin · Fenglong Ma · Jinghui Chen |
|
Poster
|
Wed 16:30 |
Decision-Focused Learning with Directional Gradients Michael Huang · Vishal Gupta |
|
Poster
|
Thu 16:30 |
Directional Smoothness and Gradient Methods: Convergence and Adaptivity Aaron Mishkin · Ahmed Khaled · Yuanhao Wang · Aaron Defazio · Robert Gower |
|
Workshop
|
Directly Optimizing for Synthesizability in Generative Molecular Design using Retrosynthesis Models Jeff Guo · Philippe Schwaller |