Skip to yearly menu bar Skip to main content


Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Sean McLeish · Leon Li · John Kirchenbauer · Dayal Singh Kalra · Brian Bartoldson · Bhavya Kailkhura · Avi Schwarzschild · Jonas Geiping · Micah Goldblum · Tom Goldstein

Abstract

Chat is not available.