Skip to yearly menu bar Skip to main content


Sample-Efficient Online Learning in LM Agents via Hindsight Trajectory Rewriting

Michael Hu ⋅ Ben Van Durme ⋅ Jacob Andreas ⋅ Harsh Jhamtani

Abstract

Chat is not available.