Fri 8:25 a.m. - 8:30 a.m.
|
Opening Remarks
(
Opening Remarks
)
>
SlidesLive Video
|
馃敆
|
Fri 8:30 a.m. - 9:00 a.m.
|
Tobias Gerstenberg
(
Invited Talk
)
>
SlidesLive Video
|
Tobias Gerstenberg
馃敆
|
Fri 9:00 a.m. - 9:15 a.m.
|
ESCHER: ESCHEWING IMPORTANCE SAMPLING IN GAMES BY COMPUTING A HISTORY VALUE FUNCTION TO ESTIMATE REGRET
(
Poster
)
>
link
|
Stephen McAleer 路 Gabriele Farina 路 Marc Lanctot 路 Tuomas Sandholm
馃敆
|
Fri 9:15 a.m. - 9:30 a.m.
|
Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training
(
Poster
)
>
link
SlidesLive Video
|
Jason Yecheng Ma 路 Shagun Sodhani 路 Dinesh Jayaraman 路 Osbert Bastani 路 Vikash Kumar 路 Amy Zhang
馃敆
|
Fri 9:30 a.m. - 9:45 a.m.
|
Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function
(
Poster
)
>
link
SlidesLive Video
|
Ruijie Zheng 路 Xiyao Wang 路 Huazhe Xu 路 Furong Huang
馃敆
|
Fri 9:45 a.m. - 10:00 a.m.
|
Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes
(
Poster
)
>
link
SlidesLive Video
|
Aviral Kumar 路 Rishabh Agarwal 路 XINYANG GENG 路 George Tucker 路 Sergey Levine
馃敆
|
Fri 10:00 a.m. - 10:30 a.m.
|
Jakob Foerster
(
Invited Talk
)
>
SlidesLive Video
|
Jakob Foerster
馃敆
|
Fri 11:00 a.m. - 11:30 a.m.
|
Scientific Experiments in Reinforcement Learning
(
Opinion Talk
)
>
SlidesLive Video
|
Scott Jordan
馃敆
|
Fri 11:30 a.m. - 11:45 a.m.
|
Transformers are Sample-Efficient World Models
(
Poster
)
>
link
|
Vincent Micheli 路 Eloi Alonso 路 Fran莽ois Fleuret
馃敆
|
Fri 11:45 a.m. - 12:00 p.m.
|
Scaling Laws for a Multi-Agent Reinforcement Learning Model
(
Poster
)
>
link
SlidesLive Video
|
Oren Neumann 路 Claudius Gros
馃敆
|
Fri 12:00 p.m. - 12:30 p.m.
|
Natasha Jaques
(
Opinion Talk
)
>
SlidesLive Video
|
Natasha Jaques
馃敆
|
Fri 1:30 p.m. - 2:00 p.m.
|
The World is not Uniformly Distributed; Important Implications for Deep RL
(
Opinion Talk
)
>
|
Stephanie Chan
馃敆
|
Fri 2:00 p.m. - 2:30 p.m.
|
Amy Zhang
(
Invited Talk
)
>
|
Amy Zhang
馃敆
|
Fri 3:00 p.m. - 3:30 p.m.
|
Igor Mordatch
(
Invited Talk
)
>
SlidesLive Video
|
Igor Mordatch
馃敆
|
Fri 3:30 p.m. - 3:45 p.m.
|
John Schulman
(
Implementation Talk
)
>
SlidesLive Video
|
John Schulman
馃敆
|
Fri 3:45 p.m. - 4:00 p.m.
|
Danijar Hafner
(
Implementation Talk
)
>
SlidesLive Video
|
Danijar Hafner
馃敆
|
Fri 4:00 p.m. - 4:15 p.m.
|
Kristian Hartikainen
(
Implementation Talk
)
>
|
Kristian Hartikainen
馃敆
|
Fri 4:15 p.m. - 4:30 p.m.
|
Ilya Kostrikov, Aviral Kumar
(
Implementation Talk
)
>
SlidesLive Video
|
Ilya Kostrikov 路 Aviral Kumar
馃敆
|
Fri 4:30 p.m. - 5:30 p.m.
|
Panel Discussion
(
Panel Discussion
)
>
SlidesLive Video
|
馃敆
|
Fri 5:30 p.m. - 5:35 p.m.
|
Closing Remarks
(
Closing Remarks
)
>
|
馃敆
|
-
|
Compositional Task Generalization with Modular Successor Feature Approximators
(
Poster
)
>
link
|
Wilka Carvalho Carvalho
馃敆
|
-
|
Learning Dexterous Manipulation from Exemplar Object Trajectories and Pre-Grasps
(
Poster
)
>
link
|
Sudeep Dasari 路 Vikash Kumar
馃敆
|
-
|
Neural All-Pairs Shortest Path for Reinforcement Learning
(
Poster
)
>
link
|
Cristina Pinneri 路 Georg Martius 路 Andreas Krause
馃敆
|
-
|
VI2N: A Network for Planning Under Uncertainty based on Value of Information
(
Poster
)
>
link
SlidesLive Video
|
Samantha Johnson 路 Michael Buice 路 Koosha Khalvati
馃敆
|
-
|
Efficient Multi-Horizon Learning for Off-Policy Reinforcement Learning
(
Poster
)
>
link
|
Raja Farrukh Ali 路 Nasik Muhammad Nafi 路 Kevin Duong 路 William Hsu
馃敆
|
-
|
Analyzing the Sensitivity to Policy-Value Decoupling in Deep Reinforcement Learning Generalization
(
Poster
)
>
link
SlidesLive Video
|
Nasik Muhammad Nafi 路 Raja Farrukh Ali 路 William Hsu
馃敆
|
-
|
Lagrangian Model Based Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Adithya Ramesh 路 Balaraman Ravindran
馃敆
|
-
|
Noisy Symbolic Abstractions for Deep RL: A case study with Reward Machines
(
Poster
)
>
link
SlidesLive Video
|
Andrew Li 路 Zizhao Chen 路 Pashootan Vaezipoor 路 Toryn Klassen 路 Rodrigo Toro Icarte 路 Sheila McIlraith
馃敆
|
-
|
Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes
(
Poster
)
>
link
SlidesLive Video
|
Min Zhang 路 Hongyao Tang 路 Jianye Hao 路 YAN ZHENG
馃敆
|
-
|
Informative rewards and generalization in curriculum learning
(
Poster
)
>
link
SlidesLive Video
|
Rahul Siripurapu 路 Vihang Patil 路 Kajetan Schweighofer 路 Marius-Constantin Dinu 路 Markus Holzleitner 路 Hamid Eghbalzadeh 路 Luis Ferro 路 Thomas Schmied 路 Michael Kopp 路 Sepp Hochreiter
馃敆
|
-
|
Generalizable Point Cloud Reinforcement Learning for Sim-to-Real Dexterous Manipulation
(
Poster
)
>
link
|
Yuzhe Qin 路 Binghao Huang 路 Zhao-Heng Yin 路 Hao Su 路 Xiaolong Wang
馃敆
|
-
|
CLUTR: Curriculum Learning via Unsupervised Task Representation Learning
(
Poster
)
>
link
SlidesLive Video
|
Abdus Salam Azad 路 Izzeddin Gur 路 Aleksandra Faust 路 Pieter Abbeel 路 Ion Stoica
馃敆
|
-
|
The Emphatic Approach to Average-Reward Policy Evaluation
(
Poster
)
>
link
SlidesLive Video
|
Jiamin He 路 Yi Wan 路 Rupam Mahmood
馃敆
|
-
|
Learning Exploration Policies with View-based Intrinsic Rewards
(
Poster
)
>
link
SlidesLive Video
|
Yijie Guo 路 Yao Fu 路 Run Peng 路 Honglak Lee
馃敆
|
-
|
Scaling Covariance Matrix Adaptation MAP-Annealing to High-Dimensional Controllers
(
Poster
)
>
link
SlidesLive Video
|
Bryon Tjanaka 路 Matthew Fontaine 路 Aniruddha Kalkar 路 Stefanos Nikolaidis
馃敆
|
-
|
Policy Aware Model Learning via Transition Occupancy Matching
(
Poster
)
>
link
SlidesLive Video
|
Jason Yecheng Ma 路 Kausik Sivakumar 路 Osbert Bastani 路 Dinesh Jayaraman
馃敆
|
-
|
On The Fragility of Learned Reward Functions
(
Poster
)
>
link
SlidesLive Video
|
Lev McKinney 路 Yawen Duan 路 Adam Gleave 路 David Krueger
馃敆
|
-
|
Temporary Goals for Exploration
(
Poster
)
>
link
SlidesLive Video
|
Haoyang Xu 路 Jimmy Ba 路 Silviu Pitis 路 Harris Chan
馃敆
|
-
|
Revisiting Bellman Errors for Offline Model Selection
(
Poster
)
>
link
|
Joshua Zitovsky 路 Daniel de Marchi 路 Rishabh Agarwal 路 Michael Kosorok
馃敆
|
-
|
Unleashing The Potential of Data Sharing in Ensemble Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Zhixuan Lin 路 Pierluca D'Oro 路 Evgenii Nikishin 路 Aaron Courville
馃敆
|
-
|
What Makes Certain Pre-Trained Visual Representations Better for Robotic Learning?
(
Poster
)
>
link
|
Kyle Hsu 路 Tyler Lum 路 Ruohan Gao 路 Shixiang (Shane) Gu 路 Jiajun Wu 路 Chelsea Finn
馃敆
|
-
|
Curiosity in Hindsight
(
Poster
)
>
link
SlidesLive Video
|
Daniel Jarrett 路 Corentin Tallec 路 Florent Altch茅 路 Thomas Mesnard 路 Remi Munos 路 Michal Valko
馃敆
|
-
|
Train Offline, Test Online: A Real Robot Learning Benchmark
(
Poster
)
>
link
SlidesLive Video
|
12 presenters
Gaoyue Zhou 路 Victoria Dean 路 Mohan Kumar Srirama 路 Aravind Rajeswaran 路 Jyothish Pari 路 Kyle Hatch 路 Aryan Jain 路 Tianhe Yu 路 Pieter Abbeel 路 Lerrel Pinto 路 Chelsea Finn 路 Abhinav Gupta
馃敆
|
-
|
A Framework for Predictable Actor-Critic Control
(
Poster
)
>
link
SlidesLive Video
|
Josiah Coad 路 James Ault 路 Jeff Hykin 路 Guni Sharon
馃敆
|
-
|
Ensemble based uncertainty estimation with overlapping alternative predictions
(
Poster
)
>
link
SlidesLive Video
|
Dirk Eilers 路 Felippe Schmoeller Roza 路 Karsten Roscher
馃敆
|
-
|
Offline Reinforcement Learning on Real Robot with Realistic Data Sources
(
Poster
)
>
link
SlidesLive Video
|
Gaoyue Zhou 路 Liyiming Ke 路 Siddhartha Srinivasa 路 Abhinav Gupta 路 Aravind Rajeswaran 路 Vikash Kumar
馃敆
|
-
|
Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
(
Poster
)
>
link
SlidesLive Video
|
JB Lanier 路 Stephen McAleer 路 Pierre Baldi 路 Roy Fox
馃敆
|
-
|
Training Equilibria in Reinforcement Learning
(
Poster
)
>
link
|
Lauro Langosco 路 David Krueger 路 Adam Gleave
馃敆
|
-
|
A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games
(
Poster
)
>
link
|
Samuel Sokota 路 Ryan D'Orazio 路 J. Zico Kolter 路 Nicolas Loizou 路 Marc Lanctot 路 Ioannis Mitliagkas 路 Noam Brown 路 Christian Kroer
馃敆
|
-
|
Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Ali Rahimi-Kalahroudi 路 Janarthanan Rajendran 路 Ida Momennejad 路 Harm Van Seijen 路 Sarath Chandar
馃敆
|
-
|
Confidence-Conditioned Value Functions for Offline Reinforcement Learning
(
Poster
)
>
link
|
Joey Hong 路 Aviral Kumar 路 Sergey Levine
馃敆
|
-
|
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
(
Poster
)
>
link
SlidesLive Video
|
Yanqiu Wu 路 Xinyue Chen 路 Che Wang 路 Yiming Zhang 路 Keith Ross
馃敆
|
-
|
Integrating Episodic and Global Bonuses for Efficient Exploration
(
Poster
)
>
link
|
Mikael Henaff 路 Minqi Jiang 路 Roberta Raileanu
馃敆
|
-
|
Deconfounded Imitation Learning
(
Poster
)
>
link
SlidesLive Video
|
Risto Vuorio 路 Pim de Haan 路 Johann Brehmer 路 Hanno Ackermann 路 Daniel Dijkman 路 Taco Cohen
馃敆
|
-
|
ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning
(
Poster
)
>
link
SlidesLive Video
|
Eddy Hudson 路 Ishan Durugkar 路 Garrett Warnell 路 Peter Stone
馃敆
|
-
|
Human-AI Coordination via Human-Regularized Search and Learning
(
Poster
)
>
link
SlidesLive Video
|
Hengyuan Hu 路 David Wu 路 Adam Lerer 路 Jakob Foerster 路 Noam Brown
馃敆
|
-
|
Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks
(
Poster
)
>
link
|
Jesse Farebrother 路 Joshua Greaves 路 Rishabh Agarwal 路 Charline Le Lan 路 Ross Goroshin 路 Pablo Samuel Castro 路 Marc Bellemare
馃敆
|
-
|
Return Augmentation gives Supervised RL Temporal Compositionality
(
Poster
)
>
link
SlidesLive Video
|
Keiran Paster 路 Silviu Pitis 路 Sheila McIlraith 路 Jimmy Ba
馃敆
|
-
|
Design Process is a Reinforcement Learning Problem
(
Poster
)
>
link
SlidesLive Video
|
Reza Kakooee 路 Benjamin Dillenburger
馃敆
|
-
|
Bayesian Q-learning With Imperfect Expert Demonstrations
(
Poster
)
>
link
SlidesLive Video
|
Fengdi Che 路 Xiru Zhu 路 Doina Precup 路 David Meger 路 Gregory Dudek
馃敆
|
-
|
Efficient Deep Reinforcement Learning Requires Regulating Statistical Overfitting
(
Poster
)
>
link
|
Qiyang Li 路 Aviral Kumar 路 Ilya Kostrikov 路 Sergey Levine
馃敆
|
-
|
Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning
(
Poster
)
>
link
|
Anikait Singh 路 Aviral Kumar 路 Frederik Ebert 路 Yanlai Yang 路 Chelsea Finn 路 Sergey Levine
馃敆
|
-
|
Offline Reinforcement Learning from Heteroskedastic Data Via Support Constraints
(
Poster
)
>
link
|
Anikait Singh 路 Aviral Kumar 路 Quan Vuong 路 Yevgen Chebotar 路 Sergey Levine
馃敆
|
-
|
Variance Double-Down: The Small Batch Size Anomaly in Multistep Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Johan Obando Ceron 路 Marc Bellemare 路 Pablo Samuel Castro
馃敆
|
-
|
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-Oriented Dialogue Systems
(
Poster
)
>
link
|
Yihao Feng 路 Shentao Yang 路 Shujian Zhang 路 Jianguo Zhang 路 Caiming Xiong 路 Mingyuan Zhou 路 Huan Wang
馃敆
|
-
|
In the ZONE: Measuring difficulty and progression in curriculum generation
(
Poster
)
>
link
SlidesLive Video
|
Rose Wang 路 Jesse Mu 路 Dilip Arumugam 路 Natasha Jaques 路 Noah Goodman
馃敆
|
-
|
Better state exploration using action sequence equivalence
(
Poster
)
>
link
SlidesLive Video
|
Nathan Grinsztajn 路 Toby Johnstone 路 Johan Ferret 路 philippe preux
馃敆
|
-
|
Deep Learning of Intrinsically Motivated Options in the Arcade Learning Environment
(
Poster
)
>
link
SlidesLive Video
|
Louis Bagot 路 Kevin Mets 路 Tom De Schepper 路 Steven Latre
馃敆
|
-
|
Guiding Exploration Towards Impactful Actions
(
Poster
)
>
link
SlidesLive Video
|
Vaibhav Saxena 路 Jimmy Ba 路 Danijar Hafner
馃敆
|
-
|
Domain Invariant Q-Learning for model-free robust continuous control under visual distractions
(
Poster
)
>
link
SlidesLive Video
|
Tom Dupuis 路 Jaonary Rabarisoa 路 Quoc Cuong PHAM 路 David Filliat
馃敆
|
-
|
Multi-Agent Policy Transfer via Task Relationship Modeling
(
Poster
)
>
link
SlidesLive Video
|
Rong-Jun Qin 路 Feng Chen 路 Tonghan Wang 路 Lei Yuan 路 Xiaoran Wu 路 Yipeng Kang 路 Zongzhang Zhang 路 Chongjie Zhang 路 Yang Yu
馃敆
|
-
|
Foundation Models for History Compression in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Fabian Paischer 路 Thomas Adler 路 Andreas Radler 路 Markus Hofmarcher 路 Sepp Hochreiter
馃敆
|
-
|
A Game-Theoretic Perspective of Generalization in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Chang Yang 路 RUIYU WANG 路 Xinrun Wang 路 Zhen Wang
馃敆
|
-
|
Imitating Human Behaviour with Diffusion Models
(
Poster
)
>
link
SlidesLive Video
|
11 presenters
Tim Pearce 路 Tabish Rashid 路 Anssi Kanervisto 路 David Bignell 路 Mingfei Sun 路 Raluca Georgescu 路 Sergio Valcarcel Macua 路 Shan Zheng Tan 路 Ida Momennejad 路 Katja Hofmann 路 Sam Devlin
馃敆
|
-
|
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model
(
Poster
)
>
link
SlidesLive Video
|
Yifu Yuan 路 Jianye Hao 路 Fei Ni 路 Yao Mu 路 YAN ZHENG 路 Yujing Hu 路 Jinyi Liu 路 Yingfeng Chen 路 Changjie Fan
馃敆
|
-
|
ERL-Re: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation
(
Poster
)
>
link
SlidesLive Video
|
Pengyi Li 路 Hongyao Tang 路 Jianye Hao 路 YAN ZHENG 路 Xian Fu 路 Zhaopeng Meng
馃敆
|
-
|
Quantization-aware Policy Distillation (QPD)
(
Poster
)
>
link
SlidesLive Video
|
Thomas Av茅 路 Kevin Mets 路 Tom De Schepper 路 Steven Latre
馃敆
|
-
|
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search
(
Poster
)
>
link
SlidesLive Video
|
Micha艂 Zawalski 路 Micha艂 Tyrolski 路 Konrad Czechowski 路 Damian Stachura 路 Piotr Pi臋kos 路 Tomasz Odrzyg贸藕d藕 路 Yuhuai Wu 路 艁ukasz Kuci艅ski 路 Piotr Mi艂o艣
馃敆
|
-
|
Cyclophobic Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Stefan Wagner 路 Peter Arndt 路 Jan Robine 路 Stefan Harmeling
馃敆
|
-
|
AsymQ: Asymmetric Q-loss to mitigate overestimation bias in off-policy reinforcement learning
(
Poster
)
>
link
|
Qinsheng Zhang 路 Arjun Krishna 路 Sehoon Ha 路 Yongxin Chen
馃敆
|
-
|
Fine-tuning Offline Policies with Optimistic Action Selection
(
Poster
)
>
link
SlidesLive Video
|
Max Sobol Mark 路 Ali Ghadirzadeh 路 Xi Chen 路 Chelsea Finn
馃敆
|
-
|
SEM2: Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model
(
Poster
)
>
link
SlidesLive Video
|
Zeyu Gao 路 Yao Mu 路 Ruoyan Shen 路 Chen Chen 路 Yangang Ren 路 Jianyu Chen 路 Shengbo Li 路 Ping Luo 路 Yanfeng Lu
馃敆
|
-
|
Policy Architectures for Compositional Generalization in Control
(
Poster
)
>
link
SlidesLive Video
|
Allan Zhou 路 Vikash Kumar 路 Chelsea Finn 路 Aravind Rajeswaran
馃敆
|
-
|
Rethinking Learning Dynamics in RL using Adversarial Networks
(
Poster
)
>
link
SlidesLive Video
|
Ramnath Kumar 路 Tristan Deleu 路 Yoshua Bengio
馃敆
|
-
|
Look Back When Surprised: Stabilizing Reverse Experience Replay for Neural Approximation
(
Poster
)
>
link
SlidesLive Video
|
Ramnath Kumar 路 Dheeraj Nagaraj
馃敆
|
-
|
Off-policy Reinforcement Learning with Optimistic Exploration and Distribution Correction
(
Poster
)
>
link
SlidesLive Video
|
Jiachen Li 路 Shuo Cheng 路 Zhenyu Liao 路 Huayan Wang 路 William Yang Wang 路 Qinxun Bai
馃敆
|
-
|
Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization
(
Poster
)
>
link
SlidesLive Video
|
Stone Tao 路 Xiaochen Li 路 Tongzhou Mu 路 Zhiao Huang 路 Yuzhe Qin 路 Hao Su
馃敆
|
-
|
Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier
(
Poster
)
>
link
|
Pierluca D'Oro 路 Max Schwarzer 路 Evgenii Nikishin 路 Pierre-Luc Bacon 路 Marc Bellemare 路 Aaron Courville
馃敆
|
-
|
Adversarial Policies Beat Professional-Level Go AIs
(
Poster
)
>
link
SlidesLive Video
|
Tony Wang 路 Adam Gleave 路 Nora Belrose 路 Tom Tseng 路 Michael Dennis 路 Yawen Duan 路 Viktor Pogrebniak 路 Joseph Miller 路 Sergey Levine 路 Stuart J Russell
馃敆
|
-
|
VARIATIONAL REPARAMETRIZED POLICY LEARNING WITH DIFFERENTIABLE PHYSICS
(
Poster
)
>
link
SlidesLive Video
|
Zhiao Huang 路 Litian Liang 路 Zhan Ling 路 Xuanlin Li 路 Chuang Gan 路 Hao Su
馃敆
|
-
|
Efficient Multi-Task Reinforcement Learning via Selective Behavior Sharing
(
Poster
)
>
link
SlidesLive Video
|
Grace Zhang 路 Ayush Jain 路 Injune Hwang 路 Shao-Hua Sun 路 Joseph Lim
馃敆
|
-
|
Contrastive Example-Based Control
(
Poster
)
>
link
|
Kyle Hatch 路 Sarthak J Shetty 路 Benjamin Eysenbach 路 Tianhe Yu 路 Rafael Rafailov 路 Russ Salakhutdinov 路 Sergey Levine 路 Chelsea Finn
馃敆
|
-
|
A study of natural robustness of deep reinforcement learning algorithms towards adversarial perturbations
(
Poster
)
>
link
SlidesLive Video
|
Qisai Liu 路 Xian Yeow Lee 路 Soumik Sarkar
馃敆
|
-
|
Multi-skill Mobile Manipulation for Object Rearrangement
(
Poster
)
>
link
SlidesLive Video
|
Jiayuan Gu 路 Devendra Singh Chaplot 路 Hao Su 路 Jitendra Malik
馃敆
|
-
|
Visual Reinforcement Learning with Self-Supervised 3D Representations
(
Poster
)
>
link
SlidesLive Video
|
Yanjie Ze 路 Nicklas Hansen 路 Yinbo Chen 路 Mohit Jain 路 Xiaolong Wang
馃敆
|
-
|
One-shot Visual Imitation via Attributed Waypoints and Demonstration Augmentation
(
Poster
)
>
link
SlidesLive Video
|
Matthew Chang 路 Saurabh Gupta
馃敆
|
-
|
Building a Subspace of Policies for Scalable Continual Learning
(
Poster
)
>
link
SlidesLive Video
|
Jean-Baptiste Gaya 路 Thang Long Doan 路 Lucas Page-Caccia 路 Laure Soulier 路 Ludovic Denoyer 路 Roberta Raileanu
馃敆
|
-
|
Skill Machines: Temporal Logic Composition in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Geraud Nangue Tasse 路 Devon Jarvis 路 Steven James 路 Benjamin Rosman
馃敆
|
-
|
Learning Representations for Reinforcement Learning with Hierarchical Forward Models
(
Poster
)
>
link
SlidesLive Video
|
Trevor McInroe 路 Lukas Sch盲fer 路 Stefano Albrecht
馃敆
|
-
|
In-context Reinforcement Learning with Algorithm Distillation
(
Poster
)
>
link
SlidesLive Video
|
14 presenters
Michael Laskin 路 Luyu Wang 路 Junhyuk Oh 路 Emilio Parisotto 路 Stephen Spencer 路 Richie Steigerwald 路 DJ Strouse 路 Steven Hansen 路 Angelos Filos 路 Ethan Brooks 路 Maxime Gazeau 路 Himanshu Sahni 路 Satinder Singh 路 Volodymyr Mnih
馃敆
|
-
|
Time-Myopic Go-Explore: Learning A State Representation for the Go-Explore Paradigm
(
Poster
)
>
link
SlidesLive Video
|
Marc H枚ftmann 路 Jan Robine 路 Stefan Harmeling
馃敆
|
-
|
MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations
(
Poster
)
>
link
SlidesLive Video
|
Nicklas Hansen 路 Yixin Lin 路 Hao Su 路 Xiaolong Wang 路 Vikash Kumar 路 Aravind Rajeswaran
馃敆
|
-
|
Scaling up and Stabilizing Differentiable Planning with Implicit Differentiation
(
Poster
)
>
link
SlidesLive Video
|
Linfeng Zhao 路 Huazhe Xu 路 Lawson Wong
馃敆
|
-
|
Graph Inverse Reinforcement Learning from Diverse Videos
(
Poster
)
>
link
SlidesLive Video
|
Sateesh Kumar 路 Jonathan Zamora 路 Nicklas Hansen 路 Rishabh Jangir 路 Xiaolong Wang
馃敆
|
-
|
Simple Emergent Action Representations from Multi-Task Policy Training
(
Poster
)
>
link
SlidesLive Video
|
Pu Hua 路 Yubei Chen 路 Huazhe Xu
馃敆
|
-
|
Adversarial Cheap Talk
(
Poster
)
>
link
|
Chris Lu 路 Timon Willi 路 Alistair Letcher 路 Jakob Foerster
馃敆
|
-
|
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
yifan xu 路 Nicklas Hansen 路 Zirui Wang 路 Yung-Chieh Chan 路 Hao Su 路 Zhuowen Tu
馃敆
|
-
|
SPRINT: Scalable Semantic Policy Pre-training via Language Instruction Relabeling
(
Poster
)
>
link
SlidesLive Video
|
Jesse Zhang 路 Karl Pertsch 路 Jiahui Zhang 路 Taewook Nam 路 Sung Ju Hwang 路 Xiang Ren 路 Joseph Lim
馃敆
|
-
|
Towards True Lossless Sparse Communication in Multi-Agent Systems
(
Poster
)
>
link
SlidesLive Video
|
Seth Karten 路 Mycal Tucker 路 Siva Kailas 路 Katia Sycara
馃敆
|
-
|
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
(
Poster
)
>
link
SlidesLive Video
|
Anton Bakhtin 路 David Wu 路 Adam Lerer 路 Jonathan Gray 路 Athul Jacob 路 Gabriele Farina 路 Alexander Miller 路 Noam Brown
馃敆
|
-
|
PnP-Nav: Plug-and-Play Policies for Generalizable Visual Navigation Across Robots
(
Poster
)
>
link
SlidesLive Video
|
Dhruv Shah 路 Ajay Sridhar 路 Arjun Bhorkar 路 Noriaki Hirose 路 Sergey Levine
馃敆
|
-
|
Offline Reinforcement Learning for Customizable Visual Navigation
(
Poster
)
>
link
|
Dhruv Shah 路 Arjun Bhorkar 路 Hrishit Leen 路 Ilya Kostrikov 路 Nicholas Rhinehart 路 Sergey Levine
馃敆
|
-
|
Multi-Source Transfer Learning for Deep Model-Based Reinforcement Learning
(
Poster
)
>
link
|
Remo Sasso 路 Matthia Sabatelli 路 Marco Wiering
馃敆
|
-
|
Hyperbolic Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Edoardo Cetin 路 Benjamin Chamberlain 路 Michael Bronstein 路 jonathan j hunt
馃敆
|
-
|
Investigating Multi-task Pretraining and Generalization in Reinforcement Learning
(
Poster
)
>
link
|
Adrien Ali Taiga 路 Rishabh Agarwal 路 Jesse Farebrother 路 Aaron Courville 路 Marc Bellemare
馃敆
|
-
|
Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Zhendong Wang 路 jonathan j hunt 路 Mingyuan Zhou
馃敆
|
-
|
Efficient Exploration using Model-Based Quality-Diversity with Gradients
(
Poster
)
>
link
SlidesLive Video
|
Bryan Lim 路 Manon Flageat 路 Antoine Cully
馃敆
|
-
|
Choreographer: Learning and Adapting Skills in Imagination
(
Poster
)
>
link
SlidesLive Video
|
Pietro Mazzaglia 路 Tim Verbelen 路 Bart Dhoedt 路 Alexandre Lacoste 路 Sai Rajeswar Mudumba
馃敆
|
-
|
Giving Robots a Hand: Broadening Generalization via Hand-Centric Human Video Demonstrations
(
Poster
)
>
link
|
Moo J Kim 路 Jiajun Wu 路 Chelsea Finn
馃敆
|
-
|
Efficient Offline Policy Optimization with a Learned Model
(
Poster
)
>
link
SlidesLive Video
|
Zichen Liu 路 Siyi Li 路 Wee Sun Lee 路 Shuicheng Yan 路 Zhongwen Xu
馃敆
|
-
|
Emergent collective intelligence from massive-agent cooperation and competition
(
Poster
)
>
link
SlidesLive Video
|
Hanmo Chen 路 Stone Tao 路 JIAXIN CHEN 路 Weihan Shen 路 Xihui Li 路 Chenghui Yu 路 Sikai Cheng 路 Xiaolong Zhu 路 Xiu Li
馃敆
|
-
|
Distance-Sensitive Offline Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Li Jianxiong 路 Xianyuan Zhan 路 Haoran Xu 路 Xiangyu Zhu 路 Jingjing Liu 路 Ya-Qin Zhang
馃敆
|
-
|
Uncertainty-Driven Exploration for Generalization in Reinforcement Learning
(
Poster
)
>
link
|
Yiding Jiang 路 J. Zico Kolter 路 Roberta Raileanu
馃敆
|
-
|
Language Models Can Teach Themselves to Program Better
(
Poster
)
>
link
SlidesLive Video
|
Patrick Haluptzok 路 Matthew Bowers 路 Adam Kalai
馃敆
|
-
|
Graph Q-Learning for Combinatorial Optimization
(
Poster
)
>
link
SlidesLive Video
|
Victoria Magdalena Dax 路 Jiachen Li 路 Kevin Leahy 路 Mykel J Kochenderfer
馃敆
|
-
|
Transformer-based World Models Are Happy With 100k Interactions
(
Poster
)
>
link
SlidesLive Video
|
Jan Robine 路 Marc H枚ftmann 路 Tobias Uelwer 路 Stefan Harmeling
馃敆
|
-
|
Contrastive Value Learning: Implicit Models for Simple Offline RL
(
Poster
)
>
link
SlidesLive Video
|
Bogdan Mazoure 路 Benjamin Eysenbach 路 Ofir Nachum 路 Jonathan Tompson
馃敆
|
-
|
CASA: Bridging the Gap between Policy Improvement and Policy Evaluation with Conflict Averse Policy Iteration
(
Poster
)
>
link
SlidesLive Video
|
Changnan Xiao 路 Haosen Shi 路 Jiajun Fan 路 Shihong Deng 路 Haiyan Yin
馃敆
|
-
|
MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Mikayel Samvelyan 路 Akbir Khan 路 Michael Dennis 路 Minqi Jiang 路 Jack Parker-Holder 路 Jakob Foerster 路 Roberta Raileanu 路 Tim Rockt盲schel
馃敆
|
-
|
Pink Noise Is All You Need: Colored Noise Exploration in Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Onno Eberhard 路 Jakob Hollenstein 路 Cristina Pinneri 路 Georg Martius
馃敆
|
-
|
Evaluating Long-Term Memory in 3D Mazes
(
Poster
)
>
link
|
Jurgis Pa拧ukonis 路 Timothy Lillicrap 路 Danijar Hafner
馃敆
|
-
|
Visual Imitation Learning with Patch Rewards
(
Poster
)
>
link
|
Minghuan Liu 路 Tairan He 路 Weinan Zhang 路 Shuicheng Yan 路 Zhongwen Xu
馃敆
|
-
|
Memory-Efficient Reinforcement Learning with Priority based on Surprise and On-policyness
(
Poster
)
>
link
SlidesLive Video
|
Ryosuke Unno 路 Yoshimasa Tsuruoka
馃敆
|
-
|
Learning a Domain-Agnostic Policy through Adversarial Representation Matching for Cross-Domain Policy Transfer
(
Poster
)
>
link
SlidesLive Video
|
Hayato Watahiki 路 Ryo Iwase 路 Ryosuke Unno 路 Yoshimasa Tsuruoka
馃敆
|
-
|
Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Mhairi Dunion 路 Trevor McInroe 路 Kevin Sebastian Luck 路 Josiah Hanna 路 Stefano Albrecht
馃敆
|
-
|
Toward Effective Deep Reinforcement Learning for 3D Robotic Manipulation: End-to-End Learning from Multimodal Raw Sensory Data
(
Poster
)
>
link
SlidesLive Video
|
Samyeul Noh 路 Hyun Myung
馃敆
|
-
|
Momentum Boosted Episodic Memory for Improving Learning in Long-Tailed RL Environments
(
Poster
)
>
link
SlidesLive Video
|
Dolton Fernandes 路 Pramod Kaushik 路 Harsh Shukla 路 Raju Bapi
馃敆
|
-
|
A Ranking Game for Imitation Learning
(
Poster
)
>
link
SlidesLive Video
|
Harshit Sushil Sikchi 路 Akanksha Saran 路 Wonjoon Goo 路 Scott Niekum
馃敆
|
-
|
Implicit Offline Reinforcement Learning via Supervised Learning
(
Poster
)
>
link
|
Alexandre Piche 路 Rafael Pardinas 路 David Vazquez 路 Igor Mordatch 路 Igor Mordatch 路 Chris Pal
馃敆
|
-
|
Distributional deep Q-learning with CVaR regression
(
Poster
)
>
link
SlidesLive Video
|
Mastane Achab 路 REDA ALAMI 路 YASSER ABDELAZIZ DAHOU DJILALI 路 Kirill Fedyanin 路 Eric Moulines 路 Maxim Panov
馃敆
|
-
|
The Surprising Effectiveness of Latent World Models for Continual Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Samuel Kessler 路 Piotr Mi艂o艣 路 Jack Parker-Holder 路 S Roberts
馃敆
|
-
|
Understanding Hindsight Goal Relabeling Requires Rethinking Divergence Minimization
(
Poster
)
>
link
SlidesLive Video
|
Lunjun Zhang 路 Bradly Stadie
馃敆
|
-
|
Perturbed Quantile Regression for Distributional Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Taehyun Cho 路 Seungyub Han 路 Heesoo Lee 路 Kyungjae Lee 路 Jungwoo Lee
馃敆
|
-
|
Concept-based Understanding of Emergent Multi-Agent Behavior
(
Poster
)
>
link
SlidesLive Video
|
Niko Grupen 路 Shayegan Omidshafiei 路 Natasha Jaques 路 Been Kim
馃敆
|
-
|
Constrained Imitation Q-learning with Earth Mover鈥檚 Distance reward
(
Poster
)
>
link
SlidesLive Video
|
WENYAN Yang 路 Nataliya Strokina 路 Joni Pajarinen 路 Joni-kristian Kamarainen
馃敆
|
-
|
Hierarchical Abstraction for Combinatorial Generalization in Object Rearrangement
(
Poster
)
>
link
SlidesLive Video
|
Michael Chang 路 Alyssa L Dayan 路 Franziska Meier 路 Tom Griffiths 路 Sergey Levine 路 Amy Zhang
馃敆
|
-
|
SoftTreeMax: Policy Gradient with Tree Search
(
Poster
)
>
link
SlidesLive Video
|
Gal Dalal 路 Assaf Hallak 路 Shie Mannor 路 Gal Chechik
馃敆
|
-
|
Dynamic Collaborative Multi-Agent Reinforcement Learning Communication for Autonomous Drone Reforestation
(
Poster
)
>
link
SlidesLive Video
|
Philipp Siedler
馃敆
|
-
|
Hypernetwork-PPO for Continual Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Philemon Sch枚pf 路 Sayantan Auddy 路 Jakob Hollenstein 路 Antonio Rodriguez-sanchez
馃敆
|
-
|
DRL-EPANET: Deep reinforcement learning for optimal control at scale in Water Distribution Systems
(
Poster
)
>
link
SlidesLive Video
|
Anas Belfadil 路 David Modesto 路 Jose Martin H.
馃敆
|
-
|
Actor Prioritized Experience Replay
(
Poster
)
>
link
SlidesLive Video
|
Baturay Saglam 路 Furkan Burak Mutlu 路 Do臒an Can 脟i莽ek 路 Suleyman Kozat
馃敆
|
-
|
Model and Method: Training-Time Attack for Cooperative Multi-Agent Reinforcement Learning
(
Poster
)
>
link
|
Siyang Wu 路 Tonghan Wang 路 Xiaoran Wu 路 Jingfeng ZHANG 路 Yujing Hu 路 Changjie Fan 路 Chongjie Zhang
馃敆
|
-
|
Converging to Unexploitable Policies in Continuous Control Adversarial Games
(
Poster
)
>
link
SlidesLive Video
|
Maxwell Goldstein 路 Noam Brown
馃敆
|
-
|
Do As You Teach: A Multi-Teacher Approach to Self-Play in Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Chaitanya Kharyal 路 Tanmay Sinha 路 Vijaya Sai Krishna Gottipati 路 Srijita Das 路 Matthew Taylor
馃敆
|
-
|
On All-Action Policy Gradients
(
Poster
)
>
link
SlidesLive Video
|
Michal Nauman 路 Marek Cygan
馃敆
|
-
|
A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Benjamin Eysenbach 路 Matthieu Geist 路 Russ Salakhutdinov 路 Sergey Levine
馃敆
|
-
|
The Benefits of Model-Based Generalization in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Kenny Young 路 Aditya Ramesh 路 Louis Kirsch 路 J眉rgen Schmidhuber
馃敆
|
-
|
Training graph neural networks with policy gradients to perform tree search
(
Poster
)
>
link
SlidesLive Video
|
Matthew Macfarlane 路 Diederik Roijers 路 Herke van Hoof
馃敆
|
-
|
Co-Imitation: Learning Design and Behaviour by Imitation
(
Poster
)
>
link
SlidesLive Video
|
Chang Rajani 路 Karol Arndt 路 David Blanco-Mulero 路 Kevin Sebastian Luck 路 Ville Kyrki
馃敆
|
-
|
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Mingqi Yuan 路 Bo Li 路 Xin Jin 路 Wenjun Zeng
馃敆
|
-
|
BLaDE: Robust Exploration via Diffusion Models
(
Poster
)
>
link
SlidesLive Video
|
Bilal Piot 路 Zhaohan Guo 路 Shantanu Thakoor 路 Mohammad Gheshlaghi Azar
馃敆
|
-
|
Learning Semantics-Aware Locomotion Skills from Human Demonstrations
(
Poster
)
>
link
SlidesLive Video
|
Yuxiang Yang 路 Xiangyun Meng 路 Wenhao Yu 路 Tingnan Zhang 路 Jie Tan 路 Byron Boots
馃敆
|
-
|
Imitation from Observation With Bootstrapped Contrastive Learning
(
Poster
)
>
link
|
Medric Sonwa 路 Johanna Hansen 路 Eugene Belilovsky
馃敆
|
-
|
PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm
(
Poster
)
>
link
SlidesLive Video
|
Toygun Basaklar 路 Suat Gumussoy 路 Umit Ogras
馃敆
|
-
|
Improving Assistive Robotics with Deep Reinforcement Learning
(
Poster
)
>
link
|
Yash Jakhotiya 路 Iman Haque
馃敆
|
-
|
Selectively Sharing Experiences Improves Multi-Agent Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Matthias Gerstgrasser 路 Tom Danino 路 Sarah Keren
馃敆
|
-
|
Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Manuel Goul茫o 路 Arlindo L Oliveira
馃敆
|
-
|
Variance Reduction in Off-Policy Deep Reinforcement Learning using Spectral Normalization
(
Poster
)
>
link
SlidesLive Video
|
Payal Bawa 路 Rafael Oliveira 路 Fabio Ramos
馃敆
|
-
|
Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents
(
Poster
)
>
link
|
Minghuan Liu 路 Zhengbang Zhu 路 Menghui Zhu 路 Yuzheng Zhuang 路 Weinan Zhang 路 Jianye Hao
馃敆
|
-
|
Guided Skill Learning and Abstraction for Long-Horizon Manipulation
(
Poster
)
>
link
SlidesLive Video
|
Shuo Cheng 路 Danfei Xu
馃敆
|
-
|
Locally Constrained Representations in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Somjit Nath 路 Samira Ebrahimi Kahou
馃敆
|
-
|
Sample-efficient Adversarial Imitation Learning
(
Poster
)
>
link
SlidesLive Video
|
Dahuin Jung 路 Hyungyu Lee 路 Sungroh Yoon
馃敆
|
-
|
Prioritizing Samples in Reinforcement Learning with Reducible Loss
(
Poster
)
>
link
SlidesLive Video
|
Shivakanth Sujit 路 Somjit Nath 路 Pedro Braga 路 Samira Ebrahimi Kahou
馃敆
|
-
|
PCRL: Priority Convention Reinforcement Learning for Microscopically Sequencable Multi-agent Problems
(
Poster
)
>
link
SlidesLive Video
|
Xing Zhou 路 Hao Gao 路 Xin Xu 路 Xinglong Zhang 路 Hongda Jia 路 Dongzi Wang
馃敆
|
-
|
A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Zixiang Chen 路 Chris Junchi Li 路 Angela Yuan 路 Quanquan Gu 路 Michael Jordan
馃敆
|
-
|
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective
(
Poster
)
>
link
SlidesLive Video
|
Raj Ghugare 路 Homanga Bharadhwaj 路 Benjamin Eysenbach 路 Sergey Levine 路 Ruslan Salakhutdinov
馃敆
|
-
|
Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition
(
Poster
)
>
link
SlidesLive Video
|
Pascal Leroy 路 Jonathan Pisane 路 Damien Ernst
馃敆
|
-
|
Reinforcement Learning in System Identification
(
Poster
)
>
link
SlidesLive Video
|
Jose Martin H. 路 脫scar Fernandez Vicente 路 Sergio Perez 路 Anas Belfadil 路 Cristina Ibanez-Llano 路 Freddy Perozo Rond贸n 路 Jose Valle 路 Javier Arechalde Pelaz
馃敆
|
-
|
Robust Option Learning for Adversarial Generalization
(
Poster
)
>
link
SlidesLive Video
|
Kishor Jothimurugan 路 Steve Hsu 路 Osbert Bastani 路 Rajeev Alur
馃敆
|
-
|
Biological Neurons vs Deep Reinforcement Learning: Sample efficiency in a simulated game-world
(
Poster
)
>
link
SlidesLive Video
|
Forough Habibollahi 路 Moein Khajehnejad 路 Amitesh Gaurav 路 Brett J. Kagan
馃敆
|
-
|
Inducing Functions through Reinforcement Learning without Task Specification
(
Poster
)
>
link
SlidesLive Video
|
Junmo Cho 路 Donghwan Lee 路 Young-Gyu Yoon
馃敆
|
-
|
Learning Successor Feature Representations to Train Robust Policies for Multi-task Learning
(
Poster
)
>
link
|
Melissa Mozifian 路 Dieter Fox 路 David Meger 路 Fabio Ramos 路 Animesh Garg
馃敆
|
-
|
Automated Dynamics Curriculums for Deep Reinforcement Learning
(
Poster
)
>
link
|
Sean Metzger
馃敆
|
-
|
Supervised Q-Learning for Continuous Control
(
Poster
)
>
link
SlidesLive Video
|
Hao Sun 路 Ziping Xu 路 Taiyi Wang 路 Meng Fang 路 Bolei Zhou
馃敆
|
-
|
MOPA: a Minimalist Off-Policy Approach to Safe-RL
(
Poster
)
>
link
SlidesLive Video
|
Hao Sun 路 Ziping Xu 路 Zhenghao Peng 路 Meng Fang 路 Bo Dai 路 Bolei Zhou
馃敆
|
-
|
Novel Policy Seeking with Constrained Optimization
(
Poster
)
>
link
SlidesLive Video
|
Hao Sun 路 Zhenghao Peng 路 Bolei Zhou
馃敆
|
-
|
Toward Causal-Aware RL: State-Wise Action-Refined Temporal Difference
(
Poster
)
>
link
SlidesLive Video
|
Hao Sun 路 Taiyi Wang
馃敆
|