Skip to yearly menu bar Skip to main content


Oral

Non-delusional Q-learning and value-iteration

Tyler Lu ⋅ Dale Schuurmans ⋅ Craig Boutilier
2018 Oral
[ Video

Abstract

Chat is not available.