Workshop
Fine-Tuning in Modern Machine Learning: Principles and Scalability
Fanghui Liu 路 Grigorios Chrysos 路 Beidi Chen 路 Rebekka Burkholz 路 Saleh Soltan 路 Angeliki Giannou 路 Masashi Sugiyama 路 Volkan Cevher
East Exhibition Hall A
Sat 14 Dec, 8:50 a.m. PST
This workshop aims to contribute to the recent radical paradigm shift for fine-tuning in modern machine learning, both theoretically, computationally, and systematically. It encourages researchers to push forward the frontiers of theoretical understanding of fine-tuning, devising expeditious and resource-efficient inference and fine-tuning methods in machine learning systems, enabling their deployment within constrained computational resources.
Chat is not available.
Timezone: America/Los_Angeles
Schedule
Sat 8:50 a.m. - 9:00 a.m.
|
Opening remarks
(
Opening session
)
>
SlidesLive Video |
Fanghui Liu 馃敆 |
Sat 9:00 a.m. - 9:40 a.m.
|
Invited talk 1 - Azalia Mirhoseini
(
Invited talk
)
>
SlidesLive Video |
Azalia Mirhoseini 馃敆 |
Sat 9:40 a.m. - 10:20 a.m.
|
Invited talk 2 - Jason Lee
(
Invited talk
)
>
SlidesLive Video |
Jason Lee 馃敆 |
Sat 10:29 a.m. - 11:30 a.m.
|
Oral presentation
(
Oral session
)
>
|
馃敆 |
Sat 10:30 a.m. - 10:42 a.m.
|
Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs
(
Oral
)
>
link
SlidesLive Video |
Jonas H眉botter 路 Sascha Bongni 路 Ido Hakimi 路 Andreas Krause 馃敆 |
Sat 10:42 a.m. - 10:54 a.m.
|
Parameter-Efficient Fine-Tuning of State Space Models
(
Oral
)
>
link
SlidesLive Video |
Kevin Galim 路 Jungtaek Kim 路 Wonjun Kang 路 Yuchen Zeng 路 HYUNG IL KOO 路 Kangwook Lee 馃敆 |
Sat 10:54 a.m. - 11:06 a.m.
|
Entropic Distribution Matching for Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity
(
Oral
)
>
link
SlidesLive Video |
Ziniu Li 路 Congliang Chen 路 Tian Xu 路 Zeyu Qin 路 Jiancong Xiao 路 Ruoyu Sun 路 Zhiquan Luo 馃敆 |
Sat 11:06 a.m. - 11:18 a.m.
|
RoCoFT: Efficient Finetuning of Large Language Models with Row-Column Updates
(
Oral
)
>
link
SlidesLive Video |
Md Kowsher 路 Tara Esmaeilbeig 路 Chun Nam Yu 路 Mojtaba Soltanalian 路 Niloofar Yousefi 馃敆 |
Sat 11:18 a.m. - 11:30 a.m.
|
COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences
(
Oral
)
>
link
SlidesLive Video |
Yixin Liu 路 Argyris Oikonomou 路 Weiqiang Zheng 路 Yang Cai 路 Arman Cohan 馃敆 |
Sat 11:30 a.m. - 12:30 p.m.
|
Poster session I
(
Poster
)
>
|
馃敆 |
Sat 2:00 p.m. - 2:40 p.m.
|
Invited talk 3 - Yuandong Tian (Invited talk)
(
Invited talk
)
>
SlidesLive Video |
Yuandong Tian 馃敆 |
Sat 2:40 p.m. - 3:20 p.m.
|
Invited talk 4 - Quanquan Gu
(
Invited talk
)
>
SlidesLive Video |
Quanquan Gu 馃敆 |
Sat 3:30 p.m. - 4:30 p.m.
|
Panel discussion
(
Discussion panel
)
>
SlidesLive Video |
Danqi Chen 路 Tri Dao 路 Taiji Suzuki 路 Yuandong Tian 路 Quanquan Gu 路 Leena Chennuru Vankadara 馃敆 |
Sat 4:30 p.m. - 4:40 p.m.
|
Closing remarks
(
Closing remarks
)
>
SlidesLive Video |
Grigorios Chrysos 馃敆 |
Sat 4:40 p.m. - 5:30 p.m.
|
Poster session II
(
Poster
)
>
|
馃敆 |
-
|
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agent ( Poster ) > link | Taiyi Wang 路 Zhihao Wu 路 Jianheng Liu 路 Derek Yuen 路 Jianye Hao 路 Jun Wang 路 Kun Shao 馃敆 |
-
|
REACT: Residual-Adaptive Contextual Tuning for Fast Model Adaptation in Cybersecurity ( Poster ) > link | Jiayun Zhang 路 Junshen Xu 路 Yi Fan 馃敆 |
-
|
Efficient Fine-Tuning of Behavior Cloned Policies with Reinforcement Learning from Limited Demonstrations ( Poster ) > link | Samyeul Noh 路 Seonghyun Kim 路 Ingook Jang 馃敆 |
-
|
Semi-Supervised Fine-Tuning of Vision Foundation Models with Content-Style Decomposition ( Poster ) > link | Mariia Drozdova 路 Vitaliy Kinakh 路 Yury Belousov 路 Erica Lastufka 路 Slava Voloshynovskiy 馃敆 |
-
|
Deep Reinforcement Learning Without Experience Replay, Target Networks, or Batch Updates ( Poster ) > link | Mohamed Elsayed 路 Gautham Vasan 路 Rupam Mahmood 馃敆 |
-
|
FourierKAN outperforms MLP on Text Classification Head Fine-tuning ( Poster ) > link | Abdullah Al Imran 路 Md Farhan Ishmam 馃敆 |
-
|
Enhancing Cross-Language Code Translation via Task-Specific Embedding Alignment in Retrieval-Augmented Generation ( Poster ) > link | Manish Bhattarai 路 Javier E. Santos 路 Ismael Boureima 路 Daniel O'Malley 馃敆 |
-
|
CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation ( Poster ) > link | Ingo Ziegler 路 Abdullatif K枚ksal 路 Desmond Elliott 路 Hinrich Schuetze 馃敆 |
-
|
Learning the Regularization Strength for Deep Fine-Tuning via a Data-Emphasized Variational Objective ( Poster ) > link | Ethan Harvey 路 Mikhail Petrov 路 Michael Hughes 馃敆 |
-
|
ImageNet-RIB Benchmark: Large Pre-Training Datasets Don't Guarantee Robustness after Fine-Tuning ( Poster ) > link | Jaedong Hwang 路 Brian Cheung 路 Zhang-Wei Hong 路 Akhilan Boopathy 路 Pulkit Agrawal 路 Ila Fiete 馃敆 |
-
|
Parasite Networks: Transfer Learning in Resource-Constrained Domains ( Poster ) > link | Andrew Alini 路 Douglas E Sturim 路 Kevin Brady 路 Pooya Khorrami 馃敆 |
-
|
PAL: Pluralistic Alignment Framework for Learning from Heterogeneous Preferences ( Poster ) > link | Daiwei Chen 路 Yi Chen 路 Aniket Rege 路 Ramya Korlakai Vinayak 馃敆 |
-
|
Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization ( Poster ) > link | Noam Razin 路 Sadhika Malladi 路 Adithya Bhaskar 路 Danqi Chen 路 Sanjeev Arora 路 Boris Hanin 馃敆 |
-
|
Adapting Language Models via Token Translation ( Poster ) > link | Zhili Feng 路 Tanya Marwah 路 Lester Mackey 路 David Alvarez-Melis 路 Nicolo Fusi 馃敆 |
-
|
Self-Stitching: Widely Applicable and Efficient Transfer Learning Using Stitching Layer ( Poster ) > link | Tanachai Anakewat 路 Yusuke Mukuta 路 Thomas Westfechtel 路 Tatsuya Harada 馃敆 |
-
|
FedEx-LoRA: Exact Aggregation for Federated Parameter-Efficient Fine-Tuning of Foundation Models ( Poster ) > link | Raghav Singhal 路 Kaustubh Ponkshe 路 Praneeth Vepakomma 馃敆 |
-
|
Mastering Task Arithmetic: Jp as a Key Indicator for Weight Disentanglement ( Poster ) > link | Kotaro Yoshida 路 Yuji Naraki 路 Takafumi Horie 路 Ryosuke Yamaki 路 Ryotaro Shimizu 路 Yuki Saito 路 Julian Mcauley 路 Hiroki Naganuma 馃敆 |
-
|
Uncertainty-Penalized Direct Preference Optimization ( Poster ) > link | Sam Houliston 路 Aliz茅e Pace 路 Alexander Immer 路 Gunnar R盲tsch 馃敆 |
-
|
Learning Robust Representations for Transfer in Reinforcement Learning ( Poster ) > link | Faisal Ahmed Abdelrahman Mohamed 路 Roger Creus Castanyer 路 Hongyao Tang 路 Zahra Sheikhbahaee 路 Glen Berseth 馃敆 |
-
|
Effective Text-to-Image Alignment with Quality Aware Pair Ranking ( Poster ) > link | Kunal Singh 路 Mukund Khanna 路 Pradeep Moturi 馃敆 |
-
|
PLMFit: Benchmarking Transfer Learning with Protein Language Models for Protein Engineering ( Poster ) > link | Thomas Bikias 路 Evangelos Stamkopoulos 路 Sai Reddy 馃敆 |
-
|
Generalizing Alignment Paradigm of Text-to-Image Generation with Preferences through -divergence Minimization ( Poster ) > link | Haoyuan Sun 路 Bo Xia 路 Yongzhe Chang 路 Xueqian Wang 馃敆 |
-
|
On the Transferability of Parameter-Efficient Continual Learning for Vision Transformers ( Poster ) > link | Leon Ackermann 路 Van-Linh Nguyen 馃敆 |
-
|
Improving Fine-Tuning with Latent Cluster Correction ( Poster ) > link | C茅dric Thanh 馃敆 |
-
|
Ensembling Finetuned Language Models for Text Classification ( Poster ) > link | Sebastian Pineda Arango 路 Maciej Janowski 路 Lennart Purucker 路 Arber Zela 路 Frank Hutter 路 Josif Grabocka 馃敆 |
-
|
Variational Low-Rank Adaptation Using IVON ( Poster ) > link | Bai Cong 路 Nico Daheim 路 Yuesong Shen 路 Daniel Cremers 路 Rio Yokota 路 Mohammad Emtiyaz Khan 路 Thomas M枚llenhoff 馃敆 |
-
|
UnoLoRA: Single Low-Rank Adaptation for Efficient Multitask Fine-tuning ( Poster ) > link | Akash Kamalesh 路 Anirudh Lakhotia 路 Nischal S 路 Prerana Sanjay Kulkarni 路 Gowri Srinivasa 馃敆 |
-
|
Online Fine-Tuning with Uncertainty Quantification for Offline Pre-Trained Agents ( Poster ) > link | Ingook Jang 路 Seonghyun Kim 路 Samyeul Noh 馃敆 |
-
|
Flat-LoRA: Low-Rank Adaption over a Flat Loss Landscape ( Poster ) > link | Tao Li 路 Zhengbao He 路 Yujun Li 路 Yasheng Wang 路 Lifeng Shang 路 Xiaolin Huang 馃敆 |
-
|
TOU: Truncated-factorized reduction for an efficient-parameter model fine-tuning ( Poster ) > link | Phuong Thi-Mai Nguyen 路 Minh-Son Dao 路 Koji Zettsu 馃敆 |
-
|
MPLoRA: Orthogonal Multi-Path Low-Rank Adaptation for Parameter Efficient Fine-Tuning ( Poster ) > link | Junhan Shi 路 Fulin Wang 路 Qing Li 路 Yong Jiang 馃敆 |
-
|
Towards Natural Machine Unlearning ( Poster ) > link | Zhengbao He 路 Tao Li 路 Xinwen Cheng 路 Zhehao Huang 路 Xiaolin Huang 馃敆 |
-
|
Navigating Parameter Space with Geodesic Interpolation: A New Approach to Efficient Fine-Tuning ( Poster ) > link | Sophia Abraham 馃敆 |
-
|
Balancing Cost and Effectiveness of Synthetic Data Generation Strategies for LLMs ( Poster ) > link | Yung-Chieh Chan 路 George Pu 路 Apaar Shanker 路 Parth Suresh 路 Penn Jenks 路 John Heyer 路 Sam Denton 馃敆 |
-
|
Investigating the Role of Fine-Tuning in Addressing the Gap Between Synthetic and Real Data in Generative Foundation Models ( Poster ) > link | Leonhard Hennicke 路 Christian Medeiros Adriano 路 Holger Giese 路 Lukas Schott 路 Jan Koehler 馃敆 |
-
|
Skip Transformers: Efficient Inference through Skip-Routing ( Poster ) > link | Matthew Peroni 路 Dimitris Bertsimas 馃敆 |
-
|
Evaluating Fine-Tuning Efficiency of Human-Inspired Learning Strategies in Medical Question Answering ( Poster ) > link | Yushi Yang 路 Andrew M. Bean 路 Robert McCraith 路 Adam Mahdi 馃敆 |
-
|
Optimizing Small Language Models for In-Vehicle Function-Calling ( Poster ) > link | Yahya SOWTI KHIABANI 路 Farris Atif 路 Chieh Hsu 路 Sven Stahlmann 路 Tobias Michels 路 Sebastian Kramer 路 Benedikt Heidrich 路 M. Saquib Sarfraz 路 Julian Merten 路 Faezeh Tafazzoli 馃敆 |
-
|
Entropic Distribution Matching for Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity ( Poster ) > link | Ziniu Li 路 Congliang Chen 路 Tian Xu 路 Zeyu Qin 路 Jiancong Xiao 路 Ruoyu Sun 路 Zhiquan Luo 馃敆 |
-
|
Accelerating Direct Preference Optimization with Prefix Sharing ( Poster ) > link | Franklin Wang 路 Sumanth Hegde 馃敆 |
-
|
E-Tamba: Efficient Transformer-Mamba Layer Transplantation ( Poster ) > link | DAZHI PENG 路 Hangrui Cao 馃敆 |
-
|
Characterizing the Training Dynamics of Private Fine-tuning with Langevin diffusion ( Poster ) > link | Shuqi Ke 路 Charlie Hou 路 Sewoong Oh 路 Giulia Fanti 馃敆 |
-
|
Understanding Visual Concepts Across Models ( Poster ) > link | Brandon Trabucco 路 Max Gurinas 路 Kyle Doherty 路 Ruslan Salakhutdinov 馃敆 |
-
|
Towards Long-Context Time Series Foundation Models With A Handful Of Additional Parameters ( Poster ) > link | Nina 呕ukowska 路 Mononito Goswami 路 Michal Wilinski 路 Willa Potosnak 路 Artur Dubrawski 馃敆 |
-
|
Fine tuning language models to align fidelity and efficiency of generative retrieval in multi-turn dialogues ( Poster ) > link | Jeremy Curuksu 馃敆 |
-
|
A Meta-Algorithm for Aligning LLMs with General Preferences ( Poster ) > link | Yixin Liu 路 Argyris Oikonomou 路 Weiqiang Zheng 路 Yang Cai 路 Arman Cohan 馃敆 |
-
|
Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs ( Poster ) > link | Jonas H眉botter 路 Sascha Bongni 路 Ido Hakimi 路 Andreas Krause 馃敆 |
-
|
Early Exiting in Deep Neural Networks via Dirichlet-based Uncertainty Quantification ( Poster ) > link | Feng Xia 路 Jake Snell 路 Tom Griffiths 馃敆 |
-
|
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF ( Poster ) > link | Heyang Zhao 路 Chenlu Ye 路 Quanquan Gu 路 Tong Zhang 馃敆 |
-
|
Instruct-SkillMix: A Powerful Pipeline for LLM Instruction Tuning ( Poster ) > link | Simran Kaur 路 Simon Park 路 Anirudh Goyal 路 Sanjeev Arora 馃敆 |
-
|
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation ( Poster ) > link | Fabian Paischer 路 Lukas Hauzenberger 路 Thomas Schmied 路 Benedikt Alkin 路 Marc Deisenroth 路 Sepp Hochreiter 馃敆 |
-
|
A Layer Selection Approach to Test Time Adaptation ( Poster ) > link | Sabyasachi Sahoo 路 Mostafa ElAraby 路 JONAS NGNAWE 路 Yann Pequignot 路 Frederic Precioso 路 Christian Gagn茅 馃敆 |
-
|
Scalability of memorization-based machine unlearning ( Poster ) > link | KAIRAN ZHAO 路 Peter Triantafillou 馃敆 |
-
|
Noise Stability Optimization for Finding Flat Minima: A Hessian-based Regularization Approach ( Poster ) > link | Hongyang Zhang 路 Dongyue Li 路 Zhenshuo Zhang 馃敆 |
-
|
Efficient Fine-Tuning of CNN-based Foundation Models for Segmentation in 3D Medical Images ( Poster ) > link | Mees Hudepohl 路 Suraj Pai 路 Heysem Kaya 路 Hugo Aerts 馃敆 |
-
|
TreeTop: Topology-Aware Fine-Tuning for LLM Conversation Tree Understanding ( Poster ) > link | Jashn Arora 路 Rahul Madhavan 路 Karthikeyan Shanmugam 路 John Palowitch 路 Manish Jain 馃敆 |
-
|
Hierarchical Unlearning Framework for Multi-Class Classification ( Poster ) > link | Abraham Chan 路 Arpan Gujarati 路 Karthik Pattabiraman 路 Sathish Gopalakrishnan 馃敆 |
-
|
Model Soup for Better RLHF: Weight Space Averaging to Improve Alignment in LLMs ( Poster ) > link | Atoosa Chegini 路 Hamid Kazemi 路 Iman Mirzadeh 路 Dong Yin 路 Maxwell Horton 路 Moin Nabi 路 Mehrdad Farajtabar 路 Keivan Alizadeh vahid 馃敆 |
-
|
Improving LLM Generation with Inverse and Forward Alignment: Reward Modeling, Prompting, Fine-Tuning, and Inference-Time Optimization ( Poster ) > link | Hao Sun 路 Thomas Pouplin 路 Nicol谩s Astorga 路 Tennison Liu 路 Mihaela van der Schaar 馃敆 |
-
|
RoCoFT: Efficient Finetuning of Large Language Models with Row-Column Updates ( Poster ) > link | Md Kowsher 路 Tara Esmaeilbeig 路 Chun Nam Yu 路 Mojtaba Soltanalian 路 Niloofar Yousefi 馃敆 |
-
|
An empirical study of CLIP fine-tuning with similarity clusters ( Poster ) > link | Shixuan Liu 路 Yiwei Lyu 路 Honglak Lee 路 Todd Hollon 馃敆 |
-
|
ActNAS : Generating Efficient YOLO Models using Activation NAS ( Poster ) > link | Sudhakar Sah 路 Ravish Kumar 路 Darshan Ganji 路 Ehsan Saboori 馃敆 |
-
|
Memory retaining finetuning via distillation ( Poster ) > link | Zitong Yang 路 Aonan Zhang 路 Sam Wiseman 路 Xiang Kong 路 Ke Ye 路 Dong Yin 馃敆 |
-
|
Faster, More Efficient RLHF through Off-Policy Asynchronous Learning ( Poster ) > link | Michael Noukhovitch 路 Shengyi Huang 路 Sophie Xhonneux 路 Arian Hosseini 路 Rishabh Agarwal 路 Aaron Courville 馃敆 |
-
|
Instant Transformer Adaption via HyperLoRA ( Poster ) > link | Rujikorn Charakorn 路 Edoardo Cetin 路 Yujin Tang 路 Robert Lange 馃敆 |
-
|
Estimating Effects of Tokens in Preference Learning ( Poster ) > link | Hsiao-Ru Pan 路 Maximilian Mordig 路 Bernhard Sch枚lkopf 馃敆 |
-
|
HyperDPO: Hypernetwork-based Multi-Objective Fine-Tuning Framework ( Poster ) > link | Yinuo Ren 路 Tesi Xiao 路 Michael Shavlovsky 路 Lexing Ying 路 Holakou Rahmanian 馃敆 |
-
|
Parameter-Efficient Fine-Tuning of State Space Models ( Poster ) > link | Kevin Galim 路 Wonjun Kang 路 Yuchen Zeng 路 HYUNG IL KOO 路 Kangwook Lee 馃敆 |
-
|
A Tensor-based Convolutional Neural Network for Small Dataset Classification ( Poster ) > link | Zhenhua Chen 路 David Crandall 馃敆 |
-
|
Flexora: Flexible Low-Rank Adaptation for Large Language Models ( Poster ) > link | Chenxing Wei 路 Yao Shu 路 Ying He 路 Fei Yu 馃敆 |
-
|
SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors ( Poster ) > link | Vijay Chandra Lingam 路 Atula Neerkaje 路 Aditya Vavre 路 Aneesh Shetty 路 Gautham Krishna Gudur 路 Joydeep Ghosh 路 Alex Dimakis 路 Eunsol Choi 路 Aleksandar Bojchevski 路 Sujay Sanghavi 馃敆 |
-
|
Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization ( Poster ) > link | Hritik Bansal 路 Ashima Suvarna 路 Gantavya Bhatt 路 Nanyun Peng 路 Kai-Wei Chang 路 Aditya Grover 馃敆 |
-
|
FRACTAL: Fine-Grained Scoring from Aggregate Text Labels ( Poster ) > link | Yukti Makhija 路 Priyanka Agrawal 路 Rishi Saket 路 Aravindan Raghuveer 馃敆 |
-
|
Teaching LLMs How To Learn with Contextual Fine-Tuning ( Poster ) > link | Younwoo Choi 路 Muhammad Adil Asif 路 Ziwen Han 路 John Willes 路 Rahul Krishnan 馃敆 |
-
|
Learning Reward and Policy Jointly from Demonstration and Preference Improves Alignment ( Poster ) > link | Chenliang Li 路 Siliang Zeng 路 Zeyi Liao 路 Jiaxiang Li 路 Dongyeop Kang 路 Alfredo Garcia 路 Mingyi Hong 馃敆 |
-
|
On Efficient Distillation from LLMs to SLMs ( Poster ) > link | Metod Jazbec 路 Menglin Xia 路 Ankur Mallick 路 Daniel Madrigal 路 Dongge Han 路 Samuel Kessler 路 Victor Ruehle 馃敆 |
-
|
What Causes a Disparate Impact in a Quantized Model? ( Poster ) > link | Abhimanyu Bellam 路 Jung-Eun Kim 馃敆 |
-
|
Fitness Aware Human Motion Generation with Fine-Tuning ( Poster ) > link | Kiril Bikov 路 Shiye Su 路 Deepro Choudhury 路 Zhilin Guo 路 Weihao Xia 路 Mehmet 脟elikteny谋ld谋z 路 Chenliang Zhou 路 Param Hanji 路 Cengiz Oztireli 馃敆 |
-
|
Best Unpacking DPO and PPO: Disentangling Practices for Learning from Preference Feedback ( Poster ) > link | Hamish Ivison 路 Yizhong Wang 路 Jiacheng Liu 路 Zeqiu Wu 路 Valentina Pyatkin 路 Nathan Lambert 路 Noah Smith 路 Yejin Choi 路 Hannaneh Hajishirzi 馃敆 |
-
|
LLM Alignment Through Successive Policy Re-weighting (SPR) ( Poster ) > link | Xinnan Zhang 路 Siliang Zeng 路 Jiaxiang Li 路 Kaixiang Lin 路 Mingyi Hong 馃敆 |
-
|
Inducing Semi-Structured Sparsity by Masking for Efficient Model Inference in Convolutional Networks ( Poster ) > link | David A. Danhofer 馃敆 |
-
|
Token Pruning using a Lightweight Background Aware Vision Transformer ( Poster ) > link | Sudhakar Sah 路 Ravish Kumar 路 Honnesh Rohmetra 路 Ehsan Saboori 馃敆 |
-
|
Simultaneous Weight and Architecture Optimization for Neural Networks ( Poster ) > link | Zitong Huang 路 Mansooreh Montazerin 路 Ajitesh Srivastava 馃敆 |
-
|
Addax: Resource-Efficient Fine-Tuning of Language Models with a Combination of Forward-Backward and Forward-Only Passes ( Poster ) > link | Zeman Li 路 Xinwei Zhang 路 Peilin Zhong 路 Yuan Deng 路 Vahab Mirrokni 路 Meisam Razaviyayn 馃敆 |
-
|
XoRA: Expander Adapted LoRA Finetuning ( Poster ) > link | Amaljith EV 路 Arindam Biswas 路 Suryam Arnav Kalra 路 Pabitra Mitra 路 Biswajit Basu 馃敆 |
-
|
GaLore-mini: Low Rank Gradient Learning with Fewer Learning Rates ( Poster ) > link | WH Huang 路 Zhenyu Zhang 路 Yushun Zhang 路 Zhiquan Luo 路 Ruoyu Sun 路 Zhangyang "Atlas" Wang 馃敆 |
-
|
Variational Best-of-N Alignment ( Poster ) > link | Afra Amini 路 Tim Vieira 路 Elliott Ash 路 Ryan Cotterell 馃敆 |
-
|
Fine-tuning Vision Classifiers On A Budget ( Poster ) > link | Sunil Kumar 路 Ted Sandler 路 Paulina Varshavskaya 馃敆 |
-
|
Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples ( Poster ) > link | No毛l Vouitsis 路 Rasa Hosseinzadeh 路 Brendan Ross 路 Valentin Villecroze 路 Satya Krishna Gorti 路 Jesse Cresswell 路 Gabriel Loaiza-Ganem 馃敆 |
-
|
Towards Exploring Continual Fine-Tuning for Enhancing Language Ability in Large Language Model ( Poster ) > link | Divyanshu Aggarwal 路 Sankarshan Damle 路 Navin Goyal 路 Satya Lokam 路 Sunayana Sitaram 馃敆 |
-
|
Discrepancy-Guided Parameter Suppression for Robust Fine-tuning ( Poster ) > link | Chang Liu 路 Jingyu Ma 馃敆 |
-
|
Analysing Softmax Entropy Minimization for Adaptating Multitask Models at Test-time ( Poster ) > link | Soumyajit Chatterjee 路 Abhirup Ghosh 路 Fahim Kawsar 路 Mohammad Malekzadeh 馃敆 |