Skip to yearly menu bar Skip to main content


Poster

A Convergent O(n) Temporal-difference Algorithm for Off-policy Learning with Linear Function Approxi

Richard Sutton · Csaba Szepesvari · Hamid R Maei
2008 Poster
[ PDF

Abstract

Chat is not available.