Workshop
|
SoftTreeMax: Policy Gradient with Tree Search Gal Dalal · Assaf Hallak · Shie Mannor · Gal Chechik |
||
Workshop
|
On All-Action Policy Gradients Michal Nauman · Marek Cygan |
||
Poster
|
Tue 14:00 |
Truly Deterministic Policy Optimization Ehsan Saleh · Saba Ghaffari · Tim Bretl · Matthew West |
|
Poster
|
Wed 14:00 |
Alleviating "Posterior Collapse'' in Deep Topic Models via Policy Gradient Yewen Li · Chaojie Wang · Zhibin Duan · Dongsheng Wang · Bo Chen · Bo An · Mingyuan Zhou |
|
Poster
|
Wed 14:00 |
The Role of Baselines in Policy Gradient Optimization Jincheng Mei · Wesley Chung · Valentin Thomas · Bo Dai · Csaba Szepesvari · Dale Schuurmans |
|
Workshop
|
Policy gradient finds global optimum of nearly linear-quadratic control systems Yinbin Han · Meisam Razaviyayn · Renyuan Xu |
||
Workshop
|
Training graph neural networks with policy gradients to perform tree search Matthew Macfarlane · Diederik Roijers · Herke van Hoof |
||
Poster
|
Wed 9:00 |
Policy Gradient With Serial Markov Chain Reasoning Edoardo Cetin · Oya Celiktutan |
|
Poster
|
Wed 14:00 |
DNA: Proximal Policy Optimization with a Dual Network Architecture Matthew Aitchison · Penny Sweetser |
|
Poster
|
Thu 14:00 |
Batch size-invariance for policy optimization Jacob Hilton · Karl Cobbe · John Schulman |
|
Poster
|
Thu 9:00 |
Global Convergence of Direct Policy Search for State-Feedback $\mathcal{H}_\infty$ Robust Control: A Revisit of Nonsmooth Synthesis with Goldstein Subdifferential Xingang Guo · Bin Hu |
|
Poster
|
Wed 14:00 |
On the convergence of policy gradient methods to Nash equilibria in general stochastic games Angeliki Giannou · Kyriakos Lotidis · Panayotis Mertikopoulos · Emmanouil-Vasileios Vlatakis-Gkaragkounis |