Skip to yearly menu bar Skip to main content


Poster

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search

Dan Zhang ⋅ Sining Zhoubian ⋅ Ziniu Hu ⋅ Yisong Yue ⋅ Yuxiao Dong ⋅ Jie Tang
2024 Poster

Abstract

Video

Chat is not available.