NeurIPS 2023 Schedule

(3 events) Timezone:

Show all

Toggle Poster Visibility

Oral

Tue Dec 12 08:00 AM -- 08:15 AM (PST) @ Hall C2 (level 1 gate 9 south of food court) None

Ordering-based Conditions for Global Convergence of Policy Gradient Methods

In Oral 1A RL

Jincheng Mei · Bo Dai · Alekh Agarwal · Mohammad Ghavamzadeh · Csaba Szepesvari · Dale Schuurmans

[ Slides] [ OpenReview]

Oral

Tue Dec 12 08:15 AM -- 08:30 AM (PST) @ Hall C2 (level 1 gate 9 south of food court) None

When Demonstrations meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning

In Oral 1A RL

Siliang Zeng · Chenliang Li · Alfredo Garcia · Mingyi Hong

[ OpenReview]

Oral

Tue Dec 12 08:30 AM -- 08:45 AM (PST) @ Hall C2 (level 1 gate 9 south of food court) None

Online RL in Linearly $q^\pi$-Realizable MDPs Is as Easy as in Linear MDPs If You Learn What to Ignore

In Oral 1A RL

Gellert Weisz · András György · Csaba Szepesvari

[ OpenReview]