Poster
|
Thu 11:00
|
Periodic agent-state based Q-learning for POMDPs
Amit Sinha · Matthieu Geist · Aditya Mahajan
|
|
Poster
|
Wed 16:30
|
REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR
Liang-Hsuan Tseng · En-Pei Hu · Cheng-Han Chiang · Yuan Tseng · Hung-yi Lee · Lin-shan Lee · Shao-Hua Sun
|
|
Workshop
|
|
Information-Theoretic Generalization Bounds for Batch Reinforcement Learning
Xingtu Liu
|
|
Poster
|
Fri 16:30
|
Truncated Variance Reduced Value Iteration
Yujia Jin · Ishani Karmarkar · Aaron Sidford · Jiayi Wang
|
|
Poster
|
Thu 11:00
|
On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation
Yuheng Zhang · Nan Jiang
|
|
Poster
|
Thu 11:00
|
Risk-sensitive control as inference with Rényi divergence
Kaito Ito · Kenji Kashima
|
|
Poster
|
Wed 16:30
|
Reinforcement Learning with Lookahead Information
Nadav Merlis
|
|
Poster
|
Wed 16:30
|
Deterministic Policies for Constrained Reinforcement Learning in Polynomial Time
Jeremy McMahan
|
|
Poster
|
Fri 11:00
|
Can an AI Agent Safely Run a Government? Existence of Probably Approximately Aligned Policies
Frédéric Berdoz · Roger Wattenhofer
|
|
Poster
|
Wed 16:30
|
Occupancy-based Policy Gradient: Estimation, Convergence, and Optimality
Audrey Huang · Nan Jiang
|
|
Oral
|
Wed 15:50
|
The Sample-Communication Complexity Trade-off in Federated Q-Learning
Sudeep Salgia · Yuejie Chi
|
|
Poster
|
Fri 11:00
|
Regularized Q-Learning
Han-Dong Lim · Donghwan Lee
|
|