firstbacksecondback
99 Results
Workshop
|
Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning Meng Feng · Viraj Parimi · Brian Williams |
||
Workshop
|
Hamiltonian Matching for Symplectic Neural Integrators Priscilla Canizares · Davide Murari · Carola-Bibiane Schönlieb · Ferdia Sherry · Zakhar Shumaylov |
||
Poster
|
Fri 11:00 |
Can an AI Agent Safely Run a Government? Existence of Probably Approximately Aligned Policies Frédéric Berdoz · Roger Wattenhofer |
|
Poster
|
Fri 11:00 |
MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts Rachel S.Y. Teo · Tan Nguyen |
|
Poster
|
Wed 16:30 |
Belief-State Query Policies for User-Aligned POMDPs Daniel Bramblett · Siddharth Srivastava |
|
Poster
|
Fri 16:30 |
Truncated Variance Reduced Value Iteration Yujia Jin · Ishani Karmarkar · Aaron Sidford · Jiayi Wang |
|
Poster
|
Thu 11:00 |
Risk-sensitive control as inference with Rényi divergence Kaito Ito · Kenji Kashima |
|
Poster
|
Wed 16:30 |
Cloud Object Detector Adaptation by Integrating Different Source Knowledge Shuaifeng Li · Mao Ye · Lihua Zhou · Nianxin Li · Siying Xiao · Song Tang · Xiatian Zhu |
|
Poster
|
Wed 16:30 |
SeeA*: Efficient Exploration-Enhanced A* Search by Selective Sampling Dengwei Zhao · Shikui Tu · Lei Xu |
|
Poster
|
Thu 11:00 |
Predicting Future Actions of Reinforcement Learning Agents Stephen Chung · Scott Niekum · David Krueger |
|
Poster
|
Fri 16:30 |
Towards the Transferability of Rewards Recovered via Regularized Inverse Reinforcement Learning Andreas Schlaginhaufen · Maryam Kamgarpour |
|
Poster
|
Thu 11:00 |
Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning rates Jincheng Mei · Bo Dai · Alekh Agarwal · Sharan Vaswani · Anant Raj · Csaba Szepesvari · Dale Schuurmans |