Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Optimization for ML Workshop

Understanding Critical Batch Sizes: Scheduling and Batch-Size Invariance in Data-constrained Pre-training

Hanlin Zhang ⋅ Depen Morwani ⋅ Nikhil Vyas ⋅ Jingfeng Wu ⋅ Difan Zou ⋅ Udaya Ghai ⋅ Dean Foster ⋅ Sham Kakade

Abstract

Chat is not available.