Skip to yearly menu bar Skip to main content


ORPO-Distill: Mixed-Policy Preference Optimization for Cross-Architecture LLM Distillation

Aasheesh Singh · Vishal Vaddina · Dagnachew Birru

Abstract

Chat is not available.