Workshop
|
|
SoftTreeMax: Policy Gradient with Tree Search
Gal Dalal · Assaf Hallak · Shie Mannor · Gal Chechik
|
|
Workshop
|
|
On All-Action Policy Gradients
Michal Nauman · Marek Cygan
|
|
Poster
|
Tue 14:00
|
Truly Deterministic Policy Optimization
Ehsan Saleh · Saba Ghaffari · Tim Bretl · Matthew West
|
|
Poster
|
Wed 14:00
|
Alleviating "Posterior Collapse'' in Deep Topic Models via Policy Gradient
Yewen Li · Chaojie Wang · Zhibin Duan · Dongsheng Wang · Bo Chen · Bo An · Mingyuan Zhou
|
|
Poster
|
Wed 14:00
|
The Role of Baselines in Policy Gradient Optimization
Jincheng Mei · Wesley Chung · Valentin Thomas · Bo Dai · Csaba Szepesvari · Dale Schuurmans
|
|
Workshop
|
|
Policy gradient finds global optimum of nearly linear-quadratic control systems
Yinbin Han · Meisam Razaviyayn · Renyuan Xu
|
|
Workshop
|
|
Training graph neural networks with policy gradients to perform tree search
Matthew Macfarlane · Diederik Roijers · Herke van Hoof
|
|
Poster
|
Wed 9:00
|
Policy Gradient With Serial Markov Chain Reasoning
Edoardo Cetin · Oya Celiktutan
|
|
Poster
|
Wed 14:00
|
DNA: Proximal Policy Optimization with a Dual Network Architecture
Matthew Aitchison · Penny Sweetser
|
|
Poster
|
Thu 14:00
|
Batch size-invariance for policy optimization
Jacob Hilton · Karl Cobbe · John Schulman
|
|
Poster
|
Thu 9:00
|
Global Convergence of Direct Policy Search for State-Feedback H∞ Robust Control: A Revisit of Nonsmooth Synthesis with Goldstein Subdifferential
Xingang Guo · Bin Hu
|
|
Poster
|
Tue 9:00
|
On the Global Convergence Rates of Decentralized Softmax Gradient Play in Markov Potential Games
Runyu Zhang · Jincheng Mei · Bo Dai · Dale Schuurmans · Na Li
|
|