Skip to yearly menu bar Skip to main content


Policy Mirror Descent for Regularized RL: A Generalized Framework with Linear Convergence

Wenhao Zhan ⋅ Shicong Cen ⋅ Baihe Huang ⋅ Yuxin Chen ⋅ Jason Lee ⋅ Yuejie Chi

Abstract

Chat is not available.