firstbacksecondback
17 Results
Workshop
|
A Linear Network Theory of Iterated Learning Devon Jarvis · Richard Klein · Benjamin Rosman · Andrew Saxe |
||
Poster
|
Fri 11:00 |
Bias Amplification in Language Model Evolution: An Iterated Learning Perspective Yi Ren · Shangmin Guo · Linlu Qiu · Bailin Wang · Danica J. Sutherland |
|
Workshop
|
Honesty to Subterfuge: In-Context Reinforcement Learning Can Make Honest Models Reward Hack Leo McKee-Reid · Joe Needham · Maria Martinez · Christoph Sträter · Mikita Balesni |
||
Poster
|
Fri 16:30 |
Truncated Variance Reduced Value Iteration Yujia Jin · Ishani Karmarkar · Aaron Sidford · Jiayi Wang |
|
Workshop
|
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning Yuxi Xie · Anirudh Goyal · Wenyue Zheng · Min-Yen Kan · Timothy Lillicrap · Kenji Kawaguchi · Michael Qizhe Shieh |