NeurIPS Competition NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day

Competition

NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day

Mark Saroufim · Weiwei Yang · Christian Puhrsch · Luca Antiga · Greg Bowyer · Driss Guessous · Artidoro Pagnoni · Supriya Rao · Joseph Isaacson · Vicki Boykis · Geeta Chauhan · aaron gonzales · Davide Eynard

Room 356

[ Abstract ] [ Project Page ]

Abstract:

Large Language Models (LLMs) have been pivotal in the recent Cambrian explosion of generative AI applications. However, existing efforts to democratize access to fine-tune and query LLMs have been largely limited by growing hardware costs required to adapt and serve these models. Enabling low cost and efficient LLM fine-tuning and inference can have significant impact on industrial and scientific applications. Here, we present a single GPU fine-tuning and inference competition. Our goal is to accelerate the development of practical software methods to reduce the costs associated with utilizing LLMs. Furthermore, by advocating for goal-oriented and infrastructure-focused evaluation frameworks that stress reproducibility, our aim is to democratize access to these methods and enhance their accessibility to the wider public.

Chat is not available.

Schedule

Fri 11:30 a.m. - 11:45 a.m.	Kick-Off to Efficiency: Welcoming statement for the organizers ( Speak ) > SlidesLive Video	Mark Saroufim · Weiwei Yang 🔗
Fri 11:45 a.m. - 12:00 p.m.	Invited Speaker: Jeremy Howard-Lessons from 25 years of machine learning competitions ( In-person presentation ) > SlidesLive Video	🔗
Fri 12:00 p.m. - 12:15 p.m.	Invited Speaker: Sebastian Raschka (lightning.ai) - LoRA in Action: Insights from Finetuning LLMs with Low-Rank Adaptation ( In-person presentation ) > SlidesLive Video	Sebastian Raschka 🔗
Fri 12:15 p.m. - 12:30 p.m.	Unveiling Success: A100 track Team percent_bdf's Winning Strategies ( Zoom presentation ) > SlidesLive Video	Ao Liu 🔗
Fri 12:30 p.m. - 12:45 p.m.	Invited Speaker: Tim Dettmers QLoRA ( In-person presentation ) > SlidesLive Video	Tim Dettmers 🔗
Fri 12:45 p.m. - 1:00 p.m.	Invited Speaker: Sourab Mangrulka -- Generative AI for Al: 🤗 PEFT: Finetuning made simple, efficient and extendable ( Zoom presentation ) > SlidesLive Video	SOURAB MANGRULKAR 🔗
Fri 1:00 p.m. - 1:15 p.m.	Coffee break	🔗
Fri 1:15 p.m. - 1:30 p.m.	Unveiling Success: 4090 Track winning team's strategies ( In-person presentation ) > SlidesLive Video	🔗
Fri 1:30 p.m. - 1:45 p.m.	Invited Speaker: Keming Lu (Alibaba Research) - Qwen: Towards a Generalist Model ( In-person presentation ) > SlidesLive Video	Keming Lu 🔗
Fri 1:45 p.m. - 2:00 p.m.	Invited Speaker: Mojan Javaheripi (Microsoft Research) - Unleashing the power of Small Language Models ( Zoom presentation ) > SlidesLive Video	Mojan Javaheripi 🔗
Fri 2:00 p.m. - 2:15 p.m.	Invited Speaker: Leshem Choshen (IBM Research) - Efficient Evaluation for Efficient Training ( In-person presentation ) > SlidesLive Video	Leshem Choshen 🔗
Fri 2:15 p.m. - 2:30 p.m.	Award celemony and open floor discussion ( panel discussion ) > SlidesLive Video	Weiwei Yang · Mark Saroufim · Christian Puhrsch · Joseph Isaacson · Vicki Boykis 🔗