Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Pluralistic Alignment Workshop

RLDF: Reinforcement Learning from Multi-role Debates as Feedback for Bias Mitigation in LLMs

Ruoxi Cheng ⋅ Haoxuan Ma ⋅ Shuirong Cao ⋅ Jiaqi Li ⋅ Aihua Pei ⋅ Zhiqiang wang ⋅ Pengliang Ji ⋅ Haoyu Wang ⋅ Jiaqi Huo

Abstract

Chat is not available.