Skip to yearly menu bar Skip to main content


(3 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Tue Dec 12 08:00 AM -- 08:15 AM (PST) @ Hall C2 (level 1 gate 9 south of food court) None
Ordering-based Conditions for Global Convergence of Policy Gradient Methods
Jincheng Mei · Bo Dai · Alekh Agarwal · Mohammad Ghavamzadeh · Csaba Szepesvari · Dale Schuurmans
[ Slides [ OpenReview
Oral
Tue Dec 12 08:15 AM -- 08:30 AM (PST) @ Hall C2 (level 1 gate 9 south of food court) None
When Demonstrations meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning
Siliang Zeng · Chenliang Li · Alfredo Garcia · Mingyi Hong
[ OpenReview
Oral
Tue Dec 12 08:30 AM -- 08:45 AM (PST) @ Hall C2 (level 1 gate 9 south of food court) None
Online RL in Linearly $q^\pi$-Realizable MDPs Is as Easy as in Linear MDPs If You Learn What to Ignore
Gellert Weisz · András György · Csaba Szepesvari
[ OpenReview