Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Pluralistic Alignment Workshop

RLDF: Reinforcement Learning from Multi-role Debates as Feedback for Bias Mitigation in LLMs

Ruoxi Cheng · Haoxuan Ma · Shuirong Cao · Jiaqi Li · Aihua Pei · Zhiqiang wang · Pengliang Ji · Haoyu Wang · Jiaqi Huo

Abstract

Chat is not available.