`

Timezone: »

 
Contributed talks in Session 4 (Zoom)
Quanquan Gu · Agnieszka Słowik · Jacques Chen · Neha Wadia · Difan Zou

Mon Dec 13 02:00 PM -- 02:30 PM (PST) @ None

Oral (10 min)

  • On the Relation between Distributionally Robust Optimization and Data Curation, Agnieszka Slowik

Spotlights (5 min)

  • Heavy-tailed noise does not explain the gap between SGD and Adam on Transformers, Jacques Chen
  • Optimization with Adaptive Step Size Selection from a Dynamical Systems Perspective, Neha Wadia
  • Understanding the Generalization of Adam in Learning Neural Networks with Proper Regularization, Difan Zou

There will be a Q&A in the last 5 minutes for all speakers. Abstracts for the talks are below the schedule.

Author Information

Quanquan Gu (UCLA)
Agnieszka Słowik (Department of Computer Science and Technology University of Cambridge)
Jacques Chen (University of British Columbia)
Neha Wadia (University of California, Berkeley)
Difan Zou (University of California, Los Angeles)

More from the Same Authors