NeurIPS 2022

Workshop

SoftTreeMax: Policy Gradient with Tree Search
Gal Dalal · Assaf Hallak · Shie Mannor · Gal Chechik

Workshop

On All-Action Policy Gradients
Michal Nauman · Marek Cygan

Poster

Tue 14:00

Truly Deterministic Policy Optimization
Ehsan Saleh · Saba Ghaffari · Tim Bretl · Matthew West

Poster

Wed 14:00

Alleviating "Posterior Collapse'' in Deep Topic Models via Policy Gradient
Yewen Li · Chaojie Wang · Zhibin Duan · Dongsheng Wang · Bo Chen · Bo An · Mingyuan Zhou

Poster

Wed 14:00

The Role of Baselines in Policy Gradient Optimization
Jincheng Mei · Wesley Chung · Valentin Thomas · Bo Dai · Csaba Szepesvari · Dale Schuurmans

Workshop

Policy gradient finds global optimum of nearly linear-quadratic control systems
Yinbin Han · Meisam Razaviyayn · Renyuan Xu

Workshop

Training graph neural networks with policy gradients to perform tree search
Matthew Macfarlane · Diederik Roijers · Herke van Hoof

Poster

Wed 9:00

Policy Gradient With Serial Markov Chain Reasoning
Edoardo Cetin · Oya Celiktutan

Poster

Wed 14:00

DNA: Proximal Policy Optimization with a Dual Network Architecture
Matthew Aitchison · Penny Sweetser

Poster

Thu 14:00

Batch size-invariance for policy optimization
Jacob Hilton · Karl Cobbe · John Schulman

Poster

Thu 9:00

Global Convergence of Direct Policy Search for State-Feedback $\mathcal{H}_\infty$ Robust Control: A Revisit of Nonsmooth Synthesis with Goldstein Subdifferential
Xingang Guo · Bin Hu

Poster

Tue 9:00

On the Global Convergence Rates of Decentralized Softmax Gradient Play in Markov Potential Games
Runyu Zhang · Jincheng Mei · Bo Dai · Dale Schuurmans · Na Li

Main Navigation

16 Results