Skip to yearly menu bar Skip to main content


Loss-to-Loss Prediction: Language model scaling laws across datasets

David Brandfonbrener · Nikhil Anand · Nikhil Vyas · Eran Malach · Sham Kakade

Abstract

Chat is not available.