Skip to yearly menu bar Skip to main content


Poster
in
Workshop: System-2 Reasoning at Scale

Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning

Yuxi Xie · Anirudh Goyal · Wenyue Zheng · Min-Yen Kan · Timothy Lillicrap · Kenji Kawaguchi · Michael Qizhe Shieh

Abstract

Chat is not available.