firstbacksecondback
26 Results
Workshop
|
Robust Q-Learning against State Perturbations: a Belief-Enriched Pessimistic Approach Xiaolin Sun · Zizhan Zheng |
||
Workshop
|
Sat 14:10 |
Robust Q-Learning against State Perturbations: a Belief-Enriched Pessimistic Approach Xiaolin Sun · Zizhan Zheng |
|
Workshop
|
Scaling Offline Q-Learning with Vision Transformers Yingjie Miao · Jordi Orbay · Rishabh Agarwal · Aviral Kumar · George Tucker · Aleksandra Faust |
||
Workshop
|
Scaling Offline Q-Learning with Vision Transformers Yingjie Miao · Jordi Orbay · Rishabh Agarwal · Aviral Kumar · George Tucker · Aleksandra Faust |
||
Poster
|
Wed 15:00 |
Bayesian Risk-Averse Q-Learning with Streaming Observations Yuhao Wang · Enlu Zhou |
|
Poster
|
Wed 15:00 |
Double Gumbel Q-Learning David Yu-Tung Hui · Aaron Courville · Pierre-Luc Bacon |
|
Poster
|
Tue 8:45 |
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation GUOJUN XIONG · Jian Li |
|
Poster
|
Wed 8:45 |
Residual Q-Learning: Offline and Online Policy Customization without Value Chenran Li · Chen Tang · Haruki Nishimura · Jean Mercat · Masayoshi TOMIZUKA · Wei Zhan |
|
Workshop
|
DGFN: Double Generative Flow Networks Elaine Lau · Nikhil Murali Vemgal · Doina Precup · Emmanuel Bengio |
||
Workshop
|
DGFN: Double Generative Flow Networks Elaine Lau · Nikhil Murali Vemgal · Doina Precup · Emmanuel Bengio |
||
Poster
|
Thu 8:45 |
Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage Masatoshi Uehara · Nathan Kallus · Jason Lee · Wen Sun |
|
Workshop
|
Improving dispersive readout of a superconducting qubit by machine learning on path signature Shuxiang Cao · Zhen Shao · Jian-Qing Zheng · Mustafa Bakr · Peter Leek · Terry Lyons |