firstbacksecondback
24 Results
Poster
|
Tue 9:00 |
The Pitfalls of Regularization in Off-Policy TD Learning Gaurav Manek · J. Zico Kolter |
|
Workshop
|
Balanced Off-Policy Evaluation for Personalized Pricing Adam N. Elmachtoub · Vishal Gupta · YUNFAN ZHAO |
||
Workshop
|
Off-policy Reinforcement Learning with Optimistic Exploration and Distribution Correction Jiachen Li · Shuo Cheng · Zhenyu Liao · Huayan Wang · William Yang Wang · Qinxun Bai |
||
Poster
|
Thu 14:00 |
Off-Policy Evaluation with Policy-Dependent Optimization Response Wenshuo Guo · Michael Jordan · Angela Zhou |
|
Poster
|
Wed 9:00 |
Off-Policy Evaluation with Deficient Support Using Side Information Nicolò Felicioni · Maurizio Ferrari Dacrema · Marcello Restelli · Paolo Cremonesi |
|
Poster
|
Wed 9:00 |
Off-Policy Evaluation for Action-Dependent Non-stationary Environments Yash Chandak · Shiv Shankar · Nathaniel Bastian · Bruno da Silva · Emma Brunskill · Philip Thomas |
|
Poster
|
Thu 9:00 |
MoCoDA: Model-based Counterfactual Data Augmentation Silviu Pitis · Elliot Creager · Ajay Mandlekar · Animesh Garg |
|
Poster
|
On the role of overparameterization in off-policy Temporal Difference learning with linear function approximation Valentin Thomas |
||
Poster
|
Thu 9:00 |
Conformal Off-Policy Prediction in Contextual Bandits Muhammad Faaiz Taufiq · Jean-Francois Ton · Rob Cornish · Yee Whye Teh · Arnaud Doucet |
|
Poster
|
Thu 9:00 |
A Unifying Framework of Off-Policy General Value Function Evaluation Tengyu Xu · Zhuoran Yang · Zhaoran Wang · Yingbin Liang |
|
Poster
|
Tue 14:00 |
Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models Rui Miao · Zhengling Qi · Xiaoke Zhang |
|
Poster
|
Wed 14:00 |
A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP Fan Chen · Junyu Zhang · Zaiwen Wen |