firstbacksecondback
115 Results
Workshop
|
Discovering Temporally-Aware Reinforcement Learning Algorithms Matthew T Jackson · Chris Lu · Louis Kirsch · Robert Lange · Shimon Whiteson · Jakob Foerster |
||
Poster
|
Thu 8:45 |
Gradient Informed Proximal Policy Optimization Sanghyun Son · Laura Zheng · Ryan Sullivan · Yi-Ling Qiao · Ming Lin |
|
Poster
|
Wed 8:45 |
Multi-Step Generalized Policy Improvement by Leveraging Approximate Models Lucas N. Alegre · Ana Bazzan · Ann Nowe · Bruno C. da Silva |
|
Poster
|
Tue 8:45 |
ChessGPT: Bridging Policy Learning and Language Modeling Xidong Feng · Yicheng Luo · Ziyan Wang · Hongrui Tang · Mengyue Yang · Kun Shao · David Mguni · Yali Du · Jun Wang |
|
Poster
|
Tue 15:15 |
HyTrel: Hypergraph-enhanced Tabular Data Representation Learning Pei Chen · Soumajyoti Sarkar · Leonard Lausen · Balasubramaniam Srinivasan · Sheng Zha · Ruihong Huang · George Karypis |
|
Poster
|
Wed 8:45 |
Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive Approach Riccardo Poiani · Nicole Nobili · Alberto Maria Metelli · Marcello Restelli |
|
Workshop
|
Can Reinforcement Learning support policy makers? A preliminary study with Integrated Assessment Models Theodore Wolf · Nantas Nardelli · John Shawe-Taylor · Maria Perez-Ortiz |
||
Poster
|
Tue 8:45 |
Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes Emmeran Johnson · Ciara Pike-Burke · Patrick Rebeschini |
|
Poster
|
Tue 8:45 |
Conservative Offline Policy Adaptation in Multi-Agent Games Chengjie Wu · Pingzhong Tang · Jun Yang · Yujing Hu · Tangjie Lv · Changjie Fan · Chongjie Zhang |
|
Poster
|
Wed 15:00 |
Interactive Multi-fidelity Learning for Cost-effective Adaptation of Language Model with Sparse Human Supervision Jiaxin Zhang · Zhuohang Li · Kamalika Das · Sricharan Kumar |
|
Poster
|
Wed 8:45 |
Direct Preference-based Policy Optimization without Reward Modeling Gaon An · Junhyeok Lee · Xingdong Zuo · Norio Kosaka · Kyung-Min Kim · Hyun Oh Song |
|
Workshop
|
TOD-Flow: Modeling the Structure of Task-Oriented Dialogues Sungryull Sohn · Yiwei Lyu · Anthony Liu · Lajanugen Logeswaran · Dong-Ki Kim · Dongsub Shim · Honglak Lee |