firstbacksecondback
26 Results
Poster
|
Tue 15:15 |
Weakly Coupled Deep Q-Networks Ibrahim El Shar · Daniel Jiang |
|
Tutorial
|
Mon 11:45 |
Machine Learning for Theorem Proving Zhangir Azerbayev · Emily First · Albert Q. Jiang · Kaiyu Yang · Anima Anandkumar · Noah Goodman · Alex Sanchez-Stern · Dawn Song · Sean Welleck |
|
Poster
|
Thu 8:45 |
Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL Yang Yue · Rui Lu · Bingyi Kang · Shiji Song · Gao Huang |
|
Poster
|
Wed 15:00 |
Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with Q-Value Predictions Tongxin Li · Yiheng Lin · Shaolei Ren · Adam Wierman |
|
Poster
|
Thu 8:45 |
SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning Dohyeok Lee · Seungyub Han · Taehyun Cho · Jungwoo Lee |
|
Poster
|
Tue 15:15 |
Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning Jianzhun Shao · Yun Qu · Chen Chen · Hongchang Zhang · Xiangyang Ji |
|
Poster
|
Tue 15:15 |
Bayesian Learning via Q-Exponential Process Shuyi Li · Michael O'Connor · Shiwei Lan |
|
Poster
|
Tue 8:45 |
Online RL in Linearly qπ-Realizable MDPs Is as Easy as in Linear MDPs If You Learn What to Ignore Gellert Weisz · András György · Csaba Szepesvari |
|
Oral
|
Tue 8:30 |
Online RL in Linearly qπ-Realizable MDPs Is as Easy as in Linear MDPs If You Learn What to Ignore Gellert Weisz · András György · Csaba Szepesvari |
|
Poster
|
Wed 8:45 |
TaskMet: Task-driven Metric Learning for Model Learning Dishank Bansal · Ricky T. Q. Chen · Mustafa Mukadam · Brandon Amos |
|
Workshop
|
Magnushammer: A Transformer-Based Approach to Premise Selection Maciej Mikuła · Szymon Antoniak · Szymon Tworkowski · Bartosz Piotrowski · Albert Q. Jiang · Jin Zhou · Christian Szegedy · Łukasz Kuciński · Piotr Miłoś · Yuhuai Wu |
||
Poster
|
Tue 8:45 |
On the Convergence and Sample Complexity Analysis of Deep Q-Networks with ϵ-Greedy Exploration Shuai Zhang · Hongkang Li · Meng Wang · Miao Liu · Pin-Yu Chen · Songtao Lu · Songtao Lu · Sijia Liu · Keerthiram Murugesan · Subhajit Chaudhury |