firstbacksecondback
156 Results
Poster
|
Wed 9:00 |
Optimistic Mirror Descent Either Converges to Nash or to Strong Coarse Correlated Equilibria in Bimatrix Games Ioannis Anagnostides · Gabriele Farina · Ioannis Panageas · Tuomas Sandholm |
|
Workshop
|
FedSynth: Gradient Compression via Synthetic Data in Federated Learning Shengyuan Hu · Jack Goetz · Kshitiz Malik · Hongyuan Zhan · Zhe Liu · Yue Liu |
||
Workshop
|
Contextual Transformer for Offline Meta Reinforcement Learning Runji Lin · Ye Li · Xidong Feng · Zhaowei Zhang · XIAN HONG WU FUNG · Haifeng Zhang · Jun Wang · Yali Du · Yaodong Yang |
||
Poster
|
Tue 14:00 |
Enhanced Meta Reinforcement Learning via Demonstrations in Sparse Reward Environments Desik Rengarajan · Sapana Chaudhary · Jaewon Kim · Dileep Kalathil · Srinivas Shakkottai |
|
Poster
|
Tue 14:00 |
Learning State-Aware Visual Representations from Audible Interactions Himangi Mittal · Pedro Morgado · Unnat Jain · Abhinav Gupta |
|
Workshop
|
An Exploration of Methods for Zero-shot Transfer in Small Language Models Alon Albalak · Akshat Shrivastava · Chinnadhurai Sankar · Adithya Sagar · Mike Ross |
||
Poster
|
Wed 9:00 |
Off-Team Learning Brandon Cui · Hengyuan Hu · Andrei Lupu · Samuel Sokota · Jakob Foerster |
|
Poster
|
Wed 9:00 |
GriddlyJS: A Web IDE for Reinforcement Learning Christopher Bamford · Minqi Jiang · Mikayel Samvelyan · Tim Rocktäschel |
|
Workshop
|
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning Anton Bakhtin · David Wu · Adam Lerer · Jonathan Gray · Athul Jacob · Gabriele Farina · Alexander Miller · Noam Brown |
||
Poster
|
Thu 9:00 |
On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting Tomasz Korbak · Hady Elsahar · Germán Kruszewski · Marc Dymetman |
|
Poster
|
Tue 9:00 |
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning Bo Liu · Xidong Feng · Jie Ren · Luo Mai · Rui Zhu · Haifeng Zhang · Jun Wang · Yaodong Yang |
|
Poster
|
Tue 9:00 |
FinRL-Meta: Market Environments and Benchmarks for Data-Driven Financial Reinforcement Learning Xiao-Yang Liu · Ziyi Xia · Jingyang Rui · Jiechao Gao · Hongyang Yang · Ming Zhu · Christina Wang · Zhaoran Wang · Jian Guo |