firstbacksecondback
144 Results
Poster
|
Tue 9:00 |
Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training Geng Yuan · Yanyu Li · Sheng Li · Zhenglun Kong · Sergey Tulyakov · Xulong Tang · Yanzhi Wang · Jian Ren |
|
Poster
|
Tue 14:00 |
Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models Boxin Wang · Wei Ping · Chaowei Xiao · Peng Xu · Mostofa Patwary · Mohammad Shoeybi · Bo Li · Anima Anandkumar · Bryan Catanzaro |
|
Poster
|
Tue 14:00 |
Toward Efficient Robust Training against Union of ℓp Threat Models Gaurang Sriramanan · Maharshi Gor · Soheil Feizi |
|
Poster
|
Thu 14:00 |
PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies Guocheng Qian · Yuchen Li · Houwen Peng · Jinjie Mai · Hasan Hammoud · Mohamed Elhoseiny · Bernard Ghanem |
|
Poster
|
Wed 14:00 |
An empirical analysis of compute-optimal large language model training Jordan Hoffmann · Sebastian Borgeaud · Arthur Mensch · Elena Buchatskaya · Trevor Cai · Eliza Rutherford · Diego de Las Casas · Lisa Anne Hendricks · Johannes Welbl · Aidan Clark · Thomas Hennigan · Eric Noland · Katherine Millican · George van den Driessche · Bogdan Damoc · Aurelia Guy · Simon Osindero · Karén Simonyan · Erich Elsen · Oriol Vinyals · Jack Rae · Laurent Sifre |
|
Poster
|
Tue 14:00 |
Maximum Likelihood Training of Implicit Nonlinear Diffusion Model Dongjun Kim · Byeonghu Na · Se Jung Kwon · Dongsoo Lee · Wanmo Kang · Il-chul Moon |
|
Poster
|
Wed 14:00 |
DreamShard: Generalizable Embedding Table Placement for Recommender Systems Daochen Zha · Louis Feng · Qiaoyu Tan · Zirui Liu · Kwei-Herng Lai · Bhargav Bhushanam · Yuandong Tian · Arun Kejariwal · Xia Hu |
|
Workshop
|
Neural Network Online Training with Sensitivity to Multiscale Temporal Structure Matt Jones · Tyler Scott · Gamaleldin Elsayed · Mengye Ren · Katherine Hermann · David Mayo · Michael Mozer |
||
Poster
|
Tue 14:00 |
LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models Mojan Javaheripi · Gustavo de Rosa · Subhabrata Mukherjee · Shital Shah · Tomasz Religa · Caio Cesar Teodoro Mendes · Sebastien Bubeck · Farinaz Koushanfar · Debadeepta Dey |
|
Poster
|
Wed 14:00 |
Training language models to follow instructions with human feedback Long Ouyang · Jeffrey Wu · Xu Jiang · Diogo Almeida · Carroll Wainwright · Pamela Mishkin · Chong Zhang · Sandhini Agarwal · Katarina Slama · Alex Ray · John Schulman · Jacob Hilton · Fraser Kelton · Luke Miller · Maddie Simens · Amanda Askell · Peter Welinder · Paul Christiano · Jan Leike · Ryan Lowe |
|
Poster
|
Tue 9:00 |
Adversarial training for high-stakes reliability Daniel Ziegler · Seraphina Nix · Lawrence Chan · Tim Bauman · Peter Schmidt-Nielsen · Tao Lin · Adam Scherlis · Noa Nabeshima · Benjamin Weinstein-Raun · Daniel de Haas · Buck Shlegeris · Nate Thomas |
|
Poster
|
Wed 14:00 |
LAION-5B: An open large-scale dataset for training next generation image-text models Christoph Schuhmann · Romain Beaumont · Richard Vencu · Cade Gordon · Ross Wightman · Mehdi Cherti · Theo Coombes · Aarush Katta · Clayton Mullis · Mitchell Wortsman · Patrick Schramowski · Srivatsa Kundurthy · Katherine Crowson · Ludwig Schmidt · Robert Kaczmarczyk · Jenia Jitsev |