Skip to yearly menu bar Skip to main content


Sample-Efficient Online Learning in LM Agents via Hindsight Trajectory Rewriting

Michael Hu · Ben Van Durme · Jacob Andreas · Harsh Jhamtani

Abstract

Chat is not available.