Workshop
|
|
RGP: Achieving Memory-Efficient Model Fine-tuning Via Randomized Gradient Projection
Ali Saheb Pasand · Pouya Bashivan
|
|
Workshop
|
|
Memory-Efficient Large Language Model (LLM) Training and Fine-Tuning via Gradient Subspace Tracking
Sahar Rajabi · Sirisha Rambhatla
|
|
Workshop
|
|
Grow to Compress? Efficient Training of Robust Networks on the Edge
Vignesh Sundaresha · Naresh Shanbhag
|
|
Poster
|
Wed 16:30
|
4-bit Shampoo for Memory-Efficient Network Training
Sike Wang · Pan Zhou · Jia Li · Hua Huang
|
|
Poster
|
|
Efficient Sketches for Training Data Attribution and Studying the Loss Landscape
Andrea Schioppa
|
|
Workshop
|
|
FlashDP: Memory-Efficient and High-Throughput DP-SGD Training for Large Language Models
Liangyu Wang · Junxiao Wang · Jie Ren · Zihang Xiang · David Keyes · Di Wang
|
|
Poster
|
Fri 16:30
|
S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
Yuezhou Hu · Jun Zhu · Jianfei Chen
|
|
Workshop
|
|
Tensor-GaLore: Memory-Efficient Training via Gradient Tensor Decomposition
Robert Joseph George · David Pitt · Jiawei Zhao · Jean Kossaifi · cheng Luo · Yuandong Tian · Animashree Anandkumar
|
|
Workshop
|
|
FastDraft: How to Train Your Draft
Ofir Zafrir · Igor Margulis · Dorin Shteyman · Guy Boudoukh
|
|
Workshop
|
|
Less is Enough: Adapting Pre-trained Vision Transformers for Audio-Visual Speaker Verification
Gnana Praveen Rajasekhar · MD JAHANGIR ALAM
|
|
Workshop
|
Sat 13:36
|
Post-Training Statistical Calibration for Higher Activation Sparsity
Vui Seng Chua · Yujie Pan · Nilesh Jain · Vui Seng Chua
|
|
Workshop
|
|
Scaling laws for post-training quantized large language models
Zifei Xu · Alexander Lan · Wanzin Yazar · Tristan Webb · Sayeh Sharify · Xin Wang
|
|