Toggle Poster Visibility
Fri Dec 09 08:30 AM -- 09:00 AM (PST) None
Tobias Gerstenberg
Fri Dec 09 09:00 AM -- 09:15 AM (PST) None
ESCHER: ESCHEWING IMPORTANCE SAMPLING IN GAMES BY COMPUTING A HISTORY VALUE FUNCTION TO ESTIMATE REGRET
[
Poster]
[
OpenReview]
[
Topia]
Fri Dec 09 09:15 AM -- 09:30 AM (PST) None
Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training
[
Poster]
[
OpenReview]
[
Topia]
Fri Dec 09 09:30 AM -- 09:45 AM (PST) None
Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function
[
Poster]
[
OpenReview]
[
Topia]
Fri Dec 09 09:45 AM -- 10:00 AM (PST) None
Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes
[
OpenReview]
[
Topia]
Fri Dec 09 11:00 AM -- 11:30 AM (PST) None
Scientific Experiments in Reinforcement Learning
Fri Dec 09 11:30 AM -- 11:45 AM (PST) None
Transformers are Sample-Efficient World Models
[
Poster]
[
OpenReview]
[
Topia]
Fri Dec 09 11:45 AM -- 12:00 PM (PST) None
Scaling Laws for a Multi-Agent Reinforcement Learning Model
[
Poster]
[
OpenReview]
[
Topia]
Fri Dec 09 01:30 PM -- 02:00 PM (PST) None
The World is not Uniformly Distributed; Important Implications for Deep RL
Fri Dec 09 04:00 PM -- 04:15 PM (PST) None
Kristian Hartikainen
Fri Dec 09 04:15 PM -- 04:30 PM (PST) None
Ilya Kostrikov, Aviral Kumar
None
Efficient Multi-Task Reinforcement Learning via Selective Behavior Sharing
None
A study of natural robustness of deep reinforcement learning algorithms towards adversarial perturbations
None
Learning Representations for Reinforcement Learning with Hierarchical Forward Models
None
Simple Emergent Action Representations from Multi-Task Policy Training
None
Towards True Lossless Sparse Communication in Multi-Agent Systems
None
Pink Noise Is All You Need: Colored Noise Exploration in Deep Reinforcement Learning
None
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
None
Choreographer: Learning and Adapting Skills in Imagination
None
Efficient Offline Policy Optimization with a Learned Model
None
Learning a Domain-Agnostic Policy through Adversarial Representation Matching for Cross-Domain Policy Transfer
None
Language Models Can Teach Themselves to Program Better
None
Graph Q-Learning for Combinatorial Optimization
None
Momentum Boosted Episodic Memory for Improving Learning in Long-Tailed RL Environments
None
CASA: Bridging the Gap between Policy Improvement and Policy Evaluation with Conflict Averse Policy Iteration
[
OpenReview]
[
Topia]
None
Hierarchical Abstraction for Combinatorial Generalization in Object Rearrangement
None
A Ranking Game for Imitation Learning
None
Distributional deep Q-learning with CVaR regression
None
Concept-based Understanding of Emergent Multi-Agent Behavior
None
Constrained Imitation Q-learning with Earth Mover’s Distance reward
None
SoftTreeMax: Policy Gradient with Tree Search
None
Dynamic Collaborative Multi-Agent Reinforcement Learning Communication for Autonomous Drone Reforestation
None
Hypernetwork-PPO for Continual Reinforcement Learning
None
DRL-EPANET: Deep reinforcement learning for optimal control at scale in Water Distribution Systems
[
OpenReview]
[
Topia]
None
Actor Prioritized Experience Replay
None
Converging to Unexploitable Policies in Continuous Control Adversarial Games
None
A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning
None
Training graph neural networks with policy gradients to perform tree search
None
Co-Imitation: Learning Design and Behaviour by Imitation
None
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning
None
PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm
None
Selectively Sharing Experiences Improves Multi-Agent Reinforcement Learning
None
Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement Learning
None
Guided Skill Learning and Abstraction for Long-Horizon Manipulation
None
Sample-efficient Adversarial Imitation Learning
None
PCRL: Priority Convention Reinforcement Learning for Microscopically Sequencable Multi-agent Problems
None
Robust Option Learning for Adversarial Generalization
None
Biological Neurons vs Deep Reinforcement Learning: Sample efficiency in a simulated game-world
None
Inducing Functions through Reinforcement Learning without Task Specification
None
Informative rewards and generalization in curriculum learning
None
Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier
None
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model
None
MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations
None
Do As You Teach: A Multi-Teacher Approach to Self-Play in Deep Reinforcement Learning
None
Visual Reinforcement Learning with Self-Supervised 3D Representations
None
One-shot Visual Imitation via Attributed Waypoints and Demonstration Augmentation
None
Skill Machines: Temporal Logic Composition in Reinforcement Learning
None
Deep Learning of Intrinsically Motivated Options in the Arcade Learning Environment
None
Policy Architectures for Compositional Generalization in Control
None
Reinforcement Learning in System Identification
None
SPRINT: Scalable Semantic Policy Pre-training via Language Instruction Relabeling
None
Building a Subspace of Policies for Scalable Continual Learning
None
Graph Inverse Reinforcement Learning from Diverse Videos
None
PnP-Nav: Plug-and-Play Policies for Generalizable Visual Navigation Across Robots
None
Efficient Exploration using Model-Based Quality-Diversity with Gradients
None
Contrastive Value Learning: Implicit Models for Simple Offline RL
None
Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
None
Toward Effective Deep Reinforcement Learning for 3D Robotic Manipulation: End-to-End Learning from Multimodal Raw Sensory Data
None
Perturbed Quantile Regression for Distributional Reinforcement Learning
None
On All-Action Policy Gradients
None
Locally Constrained Representations in Reinforcement Learning
None
VI2N: A Network for Planning Under Uncertainty based on Value of Information
None
Analyzing the Sensitivity to Policy-Value Decoupling in Deep Reinforcement Learning Generalization
None
Lagrangian Model Based Reinforcement Learning
None
Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes
None
Temporary Goals for Exploration
None
Unleashing The Potential of Data Sharing in Ensemble Deep Reinforcement Learning
None
Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
None
Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning
None
ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning
None
In the ZONE: Measuring difficulty and progression in curriculum generation
None
Better state exploration using action sequence equivalence
None
ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation
None
Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization
None
Fine-tuning Offline Policies with Optimistic Action Selection
None
Adversarial Policies Beat Professional-Level Go AIs
None
The Emphatic Approach to Average-Reward Policy Evaluation
None
Train Offline, Test Online: A Real Robot Learning Benchmark
None
Multi-Source Transfer Learning for Deep Model-Based Reinforcement Learning
None
Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning
None
Ensemble based uncertainty estimation with overlapping alternative predictions
None
Investigating Multi-task Pretraining and Generalization in Reinforcement Learning
None
Evaluating Long-Term Memory in 3D Mazes
None
Memory-Efficient Reinforcement Learning with Priority based on Surprise and On-policyness
None
Return Augmentation gives Supervised RL Temporal Compositionality
None
BLaDE: Robust Exploration via Diffusion Models
None
Guiding Exploration Towards Impactful Actions
None
Multi-Agent Policy Transfer via Task Relationship Modeling
None
Offline Reinforcement Learning on Real Robot with Realistic Data Sources
None
MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning
None
Training Equilibria in Reinforcement Learning
None
Hyperbolic Deep Reinforcement Learning
None
Transformer-based World Models Are Happy With 100k Interactions
None
The Benefits of Model-Based Generalization in Reinforcement Learning
None
Variance Reduction in Off-Policy Deep Reinforcement Learning using Spectral Normalization
[
OpenReview]
[
Topia]
None
Prioritizing Samples in Reinforcement Learning with Reducible Loss
None
Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition
None
Bayesian Q-learning With Imperfect Expert Demonstrations
None
Variance Double-Down: The Small Batch Size Anomaly in Multistep Deep Reinforcement Learning
None
Foundation Models for History Compression in Reinforcement Learning
None
Scaling up and Stabilizing Differentiable Planning with Implicit Differentiation
None
AsymQ: Asymmetric Q-loss to mitigate overestimation bias in off-policy reinforcement learning
None
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
None
Integrating Episodic and Global Bonuses for Efficient Exploration
None
Design Process is a Reinforcement Learning Problem
None
Efficient Deep Reinforcement Learning Requires Regulating Statistical Overfitting
None
Uncertainty-Driven Exploration for Generalization in Reinforcement Learning
None
The Surprising Effectiveness of Latent World Models for Continual Reinforcement Learning
None
Understanding Hindsight Goal Relabeling Requires Rethinking Divergence Minimization
[
OpenReview]
[
Topia]
None
Domain Invariant Q-Learning for model-free robust continuous control under visual distractions
None
Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning
[
OpenReview]
[
Topia]
None
Offline Reinforcement Learning from Heteroskedastic Data Via Support Constraints
[
OpenReview]
[
Topia]
None
Imitation from Observation With Bootstrapped Contrastive Learning
None
Improving Assistive Robotics with Deep Reinforcement Learning
None
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-Oriented Dialogue Systems
[
OpenReview]
[
Topia]
None
Learning Dexterous Manipulation from Exemplar Object Trajectories and Pre-Grasps
None
Efficient Multi-Horizon Learning for Off-Policy Reinforcement Learning
None
Generalizable Point Cloud Reinforcement Learning for Sim-to-Real Dexterous Manipulation
[
OpenReview]
[
Topia]
None
CLUTR: Curriculum Learning via Unsupervised Task Representation Learning
None
Revisiting Bellman Errors for Offline Model Selection
None
What Makes Certain Pre-Trained Visual Representations Better for Robotic Learning?
None
Contrastive Example-Based Control
None
Time-Myopic Go-Explore: Learning A State Representation for the Go-Explore Paradigm
None
Adversarial Cheap Talk
None
Giving Robots a Hand: Broadening Generalization via Hand-Centric Human Video Demonstrations
None
On The Fragility of Learned Reward Functions
None
Model and Method: Training-Time Attack for Cooperative Multi-Agent Reinforcement Learning
None
Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents
[
OpenReview]
[
Topia]
None
Confidence-Conditioned Value Functions for Offline Reinforcement Learning
[
OpenReview]
[
Topia]
None
A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning
None
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective
None
Learning Successor Feature Representations to Train Robust Policies for Multi-task Learning
None
Automated Dynamics Curriculums for Deep Reinforcement Learning
None
Implicit Offline Reinforcement Learning via Supervised Learning
None
Deconfounded Imitation Learning
None
Compositional Task Generalization with Modular Successor Feature Approximators
[
OpenReview]
[
Topia]
None
Neural All-Pairs Shortest Path for Reinforcement Learning
None
Off-policy Reinforcement Learning with Optimistic Exploration and Distribution Correction
None
Scaling Covariance Matrix Adaptation MAP-Annealing to High-Dimensional Controllers
None
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
None
Emergent collective intelligence from massive-agent cooperation and competition
None
A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games
None
Rethinking Learning Dynamics in RL using Adversarial Networks
None
In-context Reinforcement Learning with Algorithm Distillation
None
SEM2: Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model
None
Noisy Symbolic Abstractions for Deep RL: A case study with Reward Machines
None
Imitating Human Behaviour with Diffusion Models
None
Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks
None
A Game-Theoretic Perspective of Generalization in Reinforcement Learning
None
Quantization-aware Policy Distillation (QPD)
None
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search
None
Cyclophobic Reinforcement Learning
None
Look Back When Surprised: Stabilizing Reverse Experience Replay for Neural Approximation
None
VARIATIONAL REPARAMETRIZED POLICY LEARNING WITH DIFFERENTIABLE PHYSICS
Successful Page Load