Poster
|
Thu 14:00 |
Instability and Local Minima in GAN Training with Kernel Discriminators Evan Becker · Parthe Pandit · Sundeep Rangan · Alyson Fletcher |
|
Poster
|
Thu 9:00 |
The Stability-Efficiency Dilemma: Investigating Sequence Length Warmup for Training GPT Models Conglong Li · Minjia Zhang · Yuxiong He |
|
Poster
|
Tue 14:00 |
Surprising Instabilities in Training Deep Networks and a Theoretical Analysis Yuxin Sun · DONG LAO · Ganesh Sundaramoorthi · Anthony Yezzi |