firstbacksecondback
136 Results
Workshop
|
First-Explore, then Exploit: Meta-Learning to Solve Hard Exploration-Exploitation Trade-Offs Ben Norman · Jeff Clune |
||
Workshop
|
HSCL-RL: Mitigating Hallucinations in Multimodal Large Language Models Zichen Song · 思潭 黄 |
||
Poster
|
Thu 11:00 |
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning Anthony Liang · Guy Tennenholtz · Chih-wei Hsu · Yinlam Chow · Erdem Bıyık · Craig Boutilier |
|
Workshop
|
First-Explore, then Exploit: Meta-Learning to Solve Hard Exploration-Exploitation Trade-Offs Ben Norman · Jeff Clune |
||
Workshop
|
Reward Copilot for RL-driven Systems Optimization Karan Tandon · Manav Mishra · Gagan Somashekar · Mayukh Das · Nagarajan Natarajan |
||
Workshop
|
Using adaptive intrinsic motivation in RL to model learning across development Kai Sandbrink · Brian Christian · Linas Nasvytis · Christian Schroeder de Witt · Patrick Butlin |
||
Oral
|
Wed 15:30 |
RL-GPT: Integrating Reinforcement Learning and Code-as-policy Shaoteng Liu · Haoqi Yuan · Minda Hu · Yanwei Li · Yukang Chen · Shu Liu · Zongqing Lu · Jiaya Jia |
|
Poster
|
Wed 16:30 |
RL-GPT: Integrating Reinforcement Learning and Code-as-policy Shaoteng Liu · Haoqi Yuan · Minda Hu · Yanwei Li · Yukang Chen · Shu Liu · Zongqing Lu · Jiaya Jia |
|
Poster
|
Fri 16:30 |
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement Zhi Wang · Li Zhang · Wenhao Wu · Yuanheng Zhu · Dongbin Zhao · Chunlin Chen |
|
Poster
|
Thu 11:00 |
ODRL: A Benchmark for Off-Dynamics Reinforcement Learning Jiafei Lyu · Kang Xu · Jiacheng Xu · yan · Jing-Wen Yang · Zongzhang Zhang · Chenjia Bai · Zongqing Lu · Xiu Li |
|
Poster
|
Wed 16:30 |
Focus On What Matters: Separated Models For Visual-Based RL Generalization Di Zhang · Bowen Lv · Hai Zhang · Feifan Yang · Junqiao Zhao · Hang Yu · Chang Huang · Hongtu Zhou · Chen Ye · changjun jiang |
|
Poster
|
Wed 16:30 |
JaxMARL: Multi-Agent RL Environments and Algorithms in JAX Alexander Rutherford · Benjamin Ellis · Matteo Gallici · Jonathan Cook · Andrei Lupu · Garðar Ingvarsson Juto · Timon Willi · Ravi Hammond · Akbir Khan · Christian Schroeder de Witt · Alexandra Souly · Saptarashmi Bandyopadhyay · Mikayel Samvelyan · Minqi Jiang · Robert Lange · Shimon Whiteson · Bruno Lacerda · Nick Hawes · Tim Rocktäschel · Chris Lu · Jakob Foerster |