Skip to yearly menu bar Skip to main content


(119 events)   Timezone:  
Toggle Poster Visibility
Expo Workshop
@ Room 291 None
DGL: Impactful graph neural networks: A Tale of Research and Productionization
Poster
None
Multi-Lingual Acquisition on Multimodal Pre-training for Cross-modal Retrieval
Liang Zhang · Anwen Hu · Qin Jin
[ Poster [ OpenReview
Poster
None
On the Representation Collapse of Sparse Mixture of Experts
Zewen Chi · Li Dong · Shaohan Huang · Damai Dai · Shuming Ma · Barun Patra · Saksham Singhal · Payal Bajaj · XIA SONG · Xian-Ling Mao · Heyan Huang · Furu Wei
[ OpenReview
Poster
None
A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models
Yuanxin Liu · Fandong Meng · Zheng Lin · Jiangnan Li · Peng Fu · Yanan Cao · Weiping Wang · Jie Zhou
[ Poster [ OpenReview
Poster
None
Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning
Yuchong Sun · Hongwei Xue · Ruihua Song · Bei Liu · Huan Yang · Jianlong Fu
[ Poster [ OpenReview
Poster
None
Moderate-fitting as a Natural Backdoor Defender for Pre-trained Language Models
Biru Zhu · Yujia Qin · Ganqu Cui · Yangyi Chen · Weilin Zhao · Chong Fu · Yangdong Deng · Zhiyuan Liu · Jingang Wang · Wei Wu · Maosong Sun · Ming Gu
[ Poster [ OpenReview
Poster
None
Relation-Constrained Decoding for Text Generation
Xiang Chen · Zhixian Yang · Xiaojun Wan
[ Slides [ Poster [ OpenReview
Poster
None
OrdinalCLIP: Learning Rank Prompts for Language-Guided Ordinal Regression
Wanhua Li · Xiaoke Huang · Zheng Zhu · Yansong Tang · Xiu Li · Jie Zhou · Jiwen Lu
[ OpenReview
Poster
None
Outlier Suppression: Pushing the Limit of Low-bit Transformer Language Models
Xiuying Wei · Yunchen Zhang · Xiangguo Zhang · Ruihao Gong · Shanghang Zhang · Qi Zhang · Fengwei Yu · Xianglong Liu
[ Poster [ OpenReview
Poster
None
P2P: Tuning Pre-trained Image Models for Point Cloud Analysis with Point-to-Pixel Prompting
Ziyi Wang · Xumin Yu · Yongming Rao · Jie Zhou · Jiwen Lu
[ Poster [ OpenReview
Poster
None
Divert More Attention to Vision-Language Tracking
Mingzhe Guo · Zhipeng Zhang · Heng Fan · Liping Jing
[ Poster [ OpenReview
Poster
None
Addressing Resource Scarcity across Sign Languages with Multilingual Pretraining and Unified-Vocabulary Datasets
Gokul NC · Manideep Ladi · Sumit Negi · Prem Selvaraj · Pratyush Kumar · Mitesh Khapra
[ OpenReview
Poster
None
TGEA 2.0: A Large-Scale Diagnostically Annotated Dataset with Benchmark Tasks for Text Generation of Pretrained Language Models
Huibin Ge · Xiaohu Zhao · Chuang Liu · Yulong Zeng · Qun Liu · Deyi Xiong
[ Poster [ OpenReview
Poster
None
Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset
Peter Henderson · Mark Krass · Lucia Zheng · Neel Guha · Christopher D Manning · Dan Jurafsky · Daniel Ho
[ OpenReview
Poster
None
TaiSu: A 166M Large-scale High-Quality Dataset for Chinese Vision-Language Pre-training
Yulong Liu · Guibo Zhu · Bin Zhu · Qi Song · Guojing Ge · Haoran Chen · GuanHui Qiao · Ru Peng · Lingxiang Wu · Jinqiao Wang
[ Slides [ Poster [ OpenReview
Poster
None
InterpretDL: Explaining Deep Models in PaddlePaddle
Xuhong Li · Haoyi Xiong · Xingjian Li · Xuanyu Wu · Zeyu Chen · Dejing Dou
Workshop
Mon Nov 28 07:00 AM -- 04:00 PM (PST) @ Room 283 None
Queer in AI
Sarthak Arora · Jaidev Shriram · Evan Dong · Divija Nagaraju · Kruno Lehman · Yanan Long · Nenad Tomasev · Ashwin S · Hang Yuan · Ruchira Ray · Claas Voelcker
Expo Workshop
Mon Nov 28 07:30 AM -- 10:30 AM (PST) @ Room 291 None
PyTorch: New advances for large-scale training and performance optimizations
Geeta Chauhan · Rohan Varma · Ke Wen · Taylor Robie · Andrew Gu · Anupam Bhatnagar · Bin Bao · Natalia Gimelshein · Animesh Jain · Sherlock Huang
Expo Talk Panel
Mon Nov 28 08:30 AM -- 09:30 AM (PST) @ Theater B None
Machine Learning and Optimization for Automated Trading at HRT
Miles Lubin · Marc Khoury
Expo Talk Panel
Mon Nov 28 12:00 PM -- 01:00 PM (PST) @ Theater B None
Towards learning agents for solving complex real-world tasks
Honglak Lee
Invited Talk
Mon Nov 28 03:15 PM -- 04:00 PM (PST) @ Hall H None
Could a Large Language Model be Conscious?
David Chalmers
[ Slides
Poster
Tue Nov 29 09:00 AM -- 11:00 AM (PST) @ Hall J #107
Differentially Private Model Compression
FatemehSadat Mireshghallah · Arturs Backurs · Huseyin A. Inan · Lukas Wutschitz · Janardhan Kulkarni
[ Poster [ OpenReview
Poster
Tue Nov 29 09:00 AM -- 11:00 AM (PST) @ Hall J #117
Predictive Querying for Autoregressive Neural Sequence Models
Alex Boyd · Samuel Showalter · Stephan Mandt · Padhraic Smyth
[ Poster [ OpenReview
Poster
Tue Nov 29 09:00 AM -- 11:00 AM (PST) @ Hall J #138
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning
Hung Le · Yue Wang · Akhilesh Deepak Gotmare · Silvio Savarese · Steven Chu Hong Hoi
[ Slides [ Poster [ OpenReview
Poster
Tue Nov 29 09:00 AM -- 11:00 AM (PST) @ Hall J #241
UniCLIP: Unified Framework for Contrastive Language-Image Pre-training
Janghyeon Lee · Jongsuk Kim · Hyounguk Shon · Bumsoo Kim · Seung Hwan Kim · Honglak Lee · Junmo Kim
[ Poster [ OpenReview
Poster
Tue Nov 29 09:00 AM -- 11:00 AM (PST) @ Hall J #232
Factuality Enhanced Language Models for Open-Ended Text Generation
Nayeon Lee · Wei Ping · Peng Xu · Mostofa Patwary · Pascale N Fung · Mohammad Shoeybi · Bryan Catanzaro
[ Poster [ OpenReview
Poster
Tue Nov 29 09:00 AM -- 11:00 AM (PST) @ Hall J #231
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering
Pan Lu · Swaroop Mishra · Tanglin Xia · Liang Qiu · Kai-Wei Chang · Song-Chun Zhu · Oyvind Tafjord · Peter Clark · Ashwin Kalyan
[ Poster [ OpenReview
Poster
Tue Nov 29 09:00 AM -- 11:00 AM (PST) @ Hall J #424
On the Effect of Pre-training for Transformer in Different Modality on Offline Reinforcement Learning
Shiro Takagi
[ Poster [ OpenReview
Poster
Tue Nov 29 09:00 AM -- 11:00 AM (PST) @ Hall J #513
NaturalProver: Grounded Mathematical Proof Generation with Language Models
Sean Welleck · Jiacheng Liu · Ximing Lu · Hannaneh Hajishirzi · Yejin Choi
[ OpenReview
Poster
Tue Nov 29 09:00 AM -- 11:00 AM (PST) @ Hall J #525
Fine-tuning language models to find agreement among humans with diverse preferences
Michiel Bakker · Martin Chadwick · Hannah Sheahan · Michael Tessler · Lucy Campbell-Gillingham · Jan Balaguer · Nat McAleese · Amelia Glaese · John Aslanides · Matt Botvinick · Christopher Summerfield
[ Poster [ OpenReview
Poster
Tue Nov 29 09:00 AM -- 11:00 AM (PST) @ Hall J #640
Matryoshka Representation Learning
Aditya Kusupati · Gantavya Bhatt · Aniket Rege · Matthew Wallingford · Aditya Sinha · Vivek Ramanujan · William Howard-Snyder · Kaifeng Chen · Sham Kakade · Prateek Jain · Ali Farhadi
[ Slides [ Poster [ OpenReview
Poster
Tue Nov 29 09:00 AM -- 11:00 AM (PST) @ Hall J #611
Quality Not Quantity: On the Interaction between Dataset Design and Robustness of CLIP
Thao Nguyen · Gabriel Ilharco · Mitchell Wortsman · Sewoong Oh · Ludwig Schmidt
[ Poster [ OpenReview
Poster
Tue Nov 29 09:00 AM -- 11:00 AM (PST) @ Hall J #921
QUARK: Controllable Text Generation with Reinforced Unlearning
Ximing Lu · Sean Welleck · Jack Hessel · Liwei Jiang · Lianhui Qin · Peter West · Prithviraj Ammanabrolu · Yejin Choi
[ Slides [ OpenReview
Poster
Tue Nov 29 09:00 AM -- 11:00 AM (PST) @ Hall J #1037
Why do We Need Large Batchsizes in Contrastive Learning? A Gradient-Bias Perspective
Changyou Chen · Jianyi Zhang · Yi Xu · Liqun Chen · Jiali Duan · Yiran Chen · Son Tran · Belinda Zeng · Trishul Chilimbi
[ Poster [ OpenReview
Poster
Tue Nov 29 02:00 PM -- 04:00 PM (PST) @ Hall J #107
Exploring Length Generalization in Large Language Models
Cem Anil · Yuhuai Wu · Anders Andreassen · Aitor Lewkowycz · Vedant Misra · Vinay Ramasesh · Ambrose Slone · Guy Gur-Ari · Ethan Dyer · Behnam Neyshabur
[ OpenReview
Poster
Tue Nov 29 02:00 PM -- 04:00 PM (PST) @ Hall J #231
Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models
Boxin Wang · Wei Ping · Chaowei Xiao · Peng Xu · Mostofa Patwary · Mohammad Shoeybi · Bo Li · Anima Anandkumar · Bryan Catanzaro
[ Poster [ OpenReview
Poster
Tue Nov 29 02:00 PM -- 04:00 PM (PST) @ Hall J #439
Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts
Basil Mustafa · Carlos Riquelme · Joan Puigcerver · Rodolphe Jenatton · Neil Houlsby
[ OpenReview
Poster
Tue Nov 29 02:00 PM -- 04:00 PM (PST) @ Hall J #640
LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models
Mojan Javaheripi · Gustavo de Rosa · Subhabrata Mukherjee · Shital Shah · Tomasz Religa · Caio Cesar Teodoro Mendes · Sebastien Bubeck · Farinaz Koushanfar · Debadeepta Dey
[ Poster [ OpenReview
Poster
Tue Nov 29 02:00 PM -- 04:00 PM (PST) @ Hall J #628
Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving
Xiwen Liang · Yangxin Wu · Jianhua Han · Hang Xu · Chunjing XU · Xiaodan Liang
[ Poster [ OpenReview
Poster
Tue Nov 29 02:00 PM -- 04:00 PM (PST) @ Hall J #621
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning
Yi-Lin Sung · Jaemin Cho · Mohit Bansal
[ Poster [ OpenReview
Poster
Tue Nov 29 02:00 PM -- 04:00 PM (PST) @ Hall J #605
Revisiting Neural Scaling Laws in Language and Vision
Ibrahim Alabdulmohsin · Behnam Neyshabur · Xiaohua Zhai
[ Poster [ OpenReview
Poster
Tue Nov 29 02:00 PM -- 04:00 PM (PST) @ Hall J #705
First is Better Than Last for Language Data Influence
Chih-Kuan Yeh · Ankur Taly · Mukund Sundararajan · Frederick Liu · Pradeep Ravikumar
[ Poster [ OpenReview
Poster
Tue Nov 29 02:00 PM -- 04:00 PM (PST) @ Hall J #706
When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment
Zhijing Jin · Sydney Levine · Fernando Gonzalez Adauto · Ojasv Kamal · Maarten Sap · Mrinmaya Sachan · Rada Mihalcea · Josh Tenenbaum · Bernhard Schölkopf
[ Poster [ OpenReview
Poster
Tue Nov 29 02:00 PM -- 04:00 PM (PST) @ Hall J #707
Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation
Jin Xu · Xiaojiang Liu · Jianhao Yan · Deng Cai · Huayang Li · Jian Li
[ Slides [ Poster [ OpenReview
Poster
Tue Nov 29 02:00 PM -- 04:00 PM (PST) @ Hall J #920
Solving Quantitative Reasoning Problems with Language Models
Aitor Lewkowycz · Anders Andreassen · David Dohan · Ethan Dyer · Henryk Michalewski · Vinay Ramasesh · Ambrose Slone · Cem Anil · Imanol Schlag · Theo Gutman-Solo · Yuhuai Wu · Behnam Neyshabur · Guy Gur-Ari · Vedant Misra
[ OpenReview
Poster
Tue Nov 29 02:00 PM -- 04:00 PM (PST) @ Hall J #1027
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Linxi Fan · Guanzhi Wang · Yunfan Jiang · Ajay Mandlekar · Yuncong Yang · Haoyi Zhu · Andrew Tang · De-An Huang · Yuke Zhu · Anima Anandkumar
[ Poster [ OpenReview
Poster
Tue Nov 29 02:00 PM -- 04:00 PM (PST) @ Hall J #1019
Multi-LexSum: Real-world Summaries of Civil Rights Lawsuits at Multiple Granularities
Zejiang Shen · Kyle Lo · Lauren Yu · Nathan Dahlberg · Margo Schlanger · Doug Downey
[ Slides [ Poster [ OpenReview
Poster
Tue Nov 29 02:00 PM -- 04:00 PM (PST) @ Hall J #1012
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Hugo Laurençon · Lucile Saulnier · Thomas Wang · Christopher Akiki · Albert Villanova del Moral · Teven Le Scao · Leandro Von Werra · Chenghao Mou · Eduardo González Ponferrada · Huu Nguyen · Jörg Frohberg · Mario Šaško · Quentin Lhoest · Angelina McMillan-Major · Gerard Dupont · Stella Biderman · Anna Rogers · Loubna Ben allal · Francesco De Toni · Giada Pistilli · Olivier Nguyen · Somaieh Nikpoor · Maraim Masoud · Pierre Colombo · Javier de la Rosa · Paulo Villegas · Tristan Thrush · Shayne Longpre · Sebastian Nagel · Leon Weber · Manuel Muñoz · Jian Zhu · Daniel Van Strien · Zaid Alyafeai · Khalid Almubarak · Minh Chien Vu · Itziar Gonzalez-Dios · Aitor Soroa · Kyle Lo · Manan Dey · Pedro Ortiz Suarez · Aaron Gokaslan · Shamik Bose · David Adelani · Long Phan · Hieu Tran · Ian Yu · Suhas Pai · Jenny Chim · Violette Lepercq · Suzana Ilic · Margaret Mitchell · Sasha Alexandra Luccioni · Yacine Jernite
[ Poster [ OpenReview
Poster
Tue Nov 29 02:00 PM -- 04:00 PM (PST) @ Hall J #1011
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish
Lukasz Augustyniak · Kamil Tagowski · Albert Sawczyn · Denis Janiak · Roman Bartusiak · Adrian Szymczak · Arkadiusz Janz · Piotr Szymański · Marcin Wątroba · Mikołaj Morzy · Tomasz Kajdanowicz · Maciej Piasecki
[ Poster [ OpenReview
Poster
Wed Nov 30 09:00 AM -- 11:00 AM (PST) @ Hall J #242
Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks Adaptively
Haojie Zhang · Ge Li · Jia Li · Zhongjin Zhang · YUQI ZHU · Zhi Jin
[ Poster [ OpenReview
Poster
Wed Nov 30 09:00 AM -- 11:00 AM (PST) @ Hall J #215
Flamingo: a Visual Language Model for Few-Shot Learning
Jean-Baptiste Alayrac · Jeff Donahue · Pauline Luc · Antoine Miech · Iain Barr · Yana Hasson · Karel Lenc · Arthur Mensch · Katherine Millican · Malcolm Reynolds · Roman Ring · Eliza Rutherford · Serkan Cabi · Tengda Han · Zhitao Gong · Sina Samangooei · Marianne Monteiro · Jacob L Menick · Sebastian Borgeaud · Andy Brock · Aida Nematzadeh · Sahand Sharifzadeh · Mikołaj Bińkowski · Ricardo Barreira · Oriol Vinyals · Andrew Zisserman · Karén Simonyan
[ OpenReview
Poster
Wed Nov 30 09:00 AM -- 11:00 AM (PST) @ Hall J #506
Fault-Aware Neural Code Rankers
Jeevana Priya Inala · Chenglong Wang · Mei Yang · Andres Codas · Mark Encarnación · Shuvendu Lahiri · Madanlal Musuvathi · Jianfeng Gao
[ OpenReview
Poster
Wed Nov 30 09:00 AM -- 11:00 AM (PST) @ Hall J #513
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason Wei · Xuezhi Wang · Dale Schuurmans · Maarten Bosma · brian ichter · Fei Xia · Ed Chi · Quoc V Le · Denny Zhou
[ Poster [ OpenReview
Poster
Wed Nov 30 09:00 AM -- 11:00 AM (PST) @ Hall J #520
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Muning Wen · Jakub Kuba · Runji Lin · Weinan Zhang · Ying Wen · Jun Wang · Yaodong Yang
[ Slides [ Poster [ OpenReview
Poster
Wed Nov 30 09:00 AM -- 11:00 AM (PST) @ Hall J #527
LAMP: Extracting Text from Gradients with Language Model Priors
Mislav Balunovic · Dimitar Dimitrov · Nikola Jovanović · Martin Vechev
[ OpenReview
Poster
Wed Nov 30 09:00 AM -- 11:00 AM (PST) @ Hall J #640
Confident Adaptive Language Modeling
Tal Schuster · Adam Fisch · Jai Gupta · Mostafa Dehghani · Dara Bahri · Vinh Tran · Yi Tay · Donald Metzler
[ Poster [ OpenReview
Poster
Wed Nov 30 09:00 AM -- 11:00 AM (PST) @ Hall J #630
Deep Compression of Pre-trained Transformer Models
Naigang Wang · Chi-Chun (Charlie) Liu · Swagath Venkataramani · Sanchari Sen · Chia-Yu Chen · Kaoutar El Maghraoui · Vijayalakshmi (Viji) Srinivasan · Leland Chang
[ Poster [ OpenReview
Poster
Wed Nov 30 09:00 AM -- 11:00 AM (PST) @ Hall J #738
Capturing Failures of Large Language Models via Human Cognitive Biases
Erik Jones · Jacob Steinhardt
[ OpenReview
Poster
Wed Nov 30 09:00 AM -- 11:00 AM (PST) @ Hall J #900
LogiGAN: Learning Logical Reasoning via Adversarial Pre-training
Xinyu Pi · Wanjun Zhong · Yan Gao · Nan Duan · Jian-Guang Lou
[ Poster [ OpenReview
Poster
Wed Nov 30 09:00 AM -- 11:00 AM (PST) @ Hall J #904
Sparse Probabilistic Circuits via Pruning and Growing
Meihua Dang · Anji Liu · Guy Van den Broeck
[ Slides [ Poster [ OpenReview
Poster
Wed Nov 30 09:00 AM -- 11:00 AM (PST) @ Hall J #926
🏘️ ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
Matt Deitke · Eli VanderBilt · Alvaro Herrasti · Luca Weihs · Kiana Ehsani · Jordi Salvador · Winson Han · Eric Kolve · Aniruddha Kembhavi · Roozbeh Mottaghi
[ Poster [ OpenReview
Poster
Wed Nov 30 09:00 AM -- 11:00 AM (PST) @ Hall J #1033
Robustness Analysis of Video-Language Models Against Visual and Language Perturbations
Madeline Chantry · Shruti Vyas · Hamid Palangi · Yogesh Rawat · Vibhav Vineet
[ Poster [ OpenReview
Poster
Wed Nov 30 09:00 AM -- 11:00 AM (PST) @ Hall J #1022
Forecasting Future World Events With Neural Networks
Andy Zou · Tristan Xiao · Ryan Jia · Joe Kwon · Mantas Mazeika · Richard Li · Dawn Song · Jacob Steinhardt · Owain Evans · Dan Hendrycks
[ Poster [ OpenReview
Poster
Wed Nov 30 09:00 AM -- 11:00 AM (PST) @ Hall J #1014
Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models
Maribeth Rauh · John Mellor · Jonathan Uesato · Po-Sen Huang · Johannes Welbl · Laura Weidinger · Sumanth Dathathri · Amelia Glaese · Geoffrey Irving · Iason Gabriel · William Isaac · Lisa Anne Hendricks
[ Poster [ OpenReview
Poster
Wed Nov 30 09:00 AM -- 11:00 AM (PST) @ Hall J #1013
A Large Scale Search Dataset for Unbiased Learning to Rank
Lixin Zou · Haitao Mao · Xiaokai Chu · Jiliang Tang · Wenwen Ye · Shuaiqiang Wang · Dawei Yin
[ Poster [ OpenReview
Poster
Wed Nov 30 09:00 AM -- 11:00 AM (PST) @ Hall J #1003
[Re] Graph Edit Networks
Vid Stropnik · Maruša Oražem
Poster
Wed Nov 30 02:00 PM -- 04:00 PM (PST) @ Hall J #115
Predictive Coding beyond Gaussian Distributions
Luca Pinchetti · Tommaso Salvatori · Yordan Yordanov · Beren Millidge · Yuhang Song · Thomas Lukasiewicz
[ Poster [ OpenReview
Poster
Wed Nov 30 02:00 PM -- 04:00 PM (PST) @ Hall J #125
HUMUS-Net: Hybrid Unrolled Multi-scale Network Architecture for Accelerated MRI Reconstruction
Zalan Fabian · Berk Tinaz · Mahdi Soltanolkotabi
[ Poster [ OpenReview
Poster
Wed Nov 30 02:00 PM -- 04:00 PM (PST) @ Hall J #234
Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing
Yonggan Fu · Yang Zhang · Kaizhi Qian · Zhifan Ye · Zhongzhi Yu · Cheng-I Jeff Lai · Celine Lin
[ Slides [ Poster [ OpenReview
Poster
Wed Nov 30 02:00 PM -- 04:00 PM (PST) @ Hall J #410
Masked Autoencoding for Scalable and Generalizable Decision Making
Fangchen Liu · Hao Liu · Aditya Grover · Pieter Abbeel
[ Poster [ OpenReview
Poster
Wed Nov 30 02:00 PM -- 04:00 PM (PST) @ Hall J #524
Data Distributional Properties Drive Emergent In-Context Learning in Transformers
Stephanie Chan · Adam Santoro · Andrew Lampinen · Jane Wang · Aaditya Singh · Pierre Richemond · James McClelland · Felix Hill
[ Poster [ OpenReview
Poster
Wed Nov 30 02:00 PM -- 04:00 PM (PST) @ Hall J #527
Do Current Multi-Task Optimization Methods in Deep Learning Even Help?
Derrick Xin · Behrooz Ghorbani · Justin Gilmer · Ankush Garg · Orhan Firat
[ Poster [ OpenReview
Poster
Wed Nov 30 02:00 PM -- 04:00 PM (PST) @ Hall J #639
An empirical analysis of compute-optimal large language model training
Jordan Hoffmann · Sebastian Borgeaud · Arthur Mensch · Elena Buchatskaya · Trevor Cai · Eliza Rutherford · Diego de Las Casas · Lisa Anne Hendricks · Johannes Welbl · Aidan Clark · Thomas Hennigan · Eric Noland · Katherine Millican · George van den Driessche · Bogdan Damoc · Aurelia Guy · Simon Osindero · Karén Simonyan · Erich Elsen · Oriol Vinyals · Jack Rae · Laurent Sifre
[ OpenReview
Poster
Wed Nov 30 02:00 PM -- 04:00 PM (PST) @ Hall J #706
Exploring evolution-aware & -free protein language models as protein function predictors
Mingyang Hu · Fajie Yuan · Kevin Yang · Fusong Ju · Jin Su · Hui Wang · Fei Yang · Qiuyang Ding
[ OpenReview
Poster
Wed Nov 30 02:00 PM -- 04:00 PM (PST) @ Hall J #801
Decoupled Context Processing for Context Augmented Language Modeling
Zonglin Li · Ruiqi Guo · Sanjiv Kumar
[ Slides [ Poster [ OpenReview
Poster
Wed Nov 30 02:00 PM -- 04:00 PM (PST) @ Hall J #920
Training language models to follow instructions with human feedback
Long Ouyang · Jeffrey Wu · Xu Jiang · Diogo Almeida · Carroll Wainwright · Pamela Mishkin · Chong Zhang · Sandhini Agarwal · Katarina Slama · Alex Ray · John Schulman · Jacob Hilton · Fraser Kelton · Luke Miller · Maddie Simens · Amanda Askell · Peter Welinder · Paul Christiano · Jan Leike · Ryan Lowe
[ Poster [ OpenReview
Poster
Wed Nov 30 02:00 PM -- 04:00 PM (PST) @ Hall J #925
Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning
Yujia Xie · Luowei Zhou · Xiyang Dai · Lu Yuan · Nguyen Bach · Ce Liu · Michael Zeng
[ OpenReview
Poster
Wed Nov 30 02:00 PM -- 04:00 PM (PST) @ Hall J #1041
Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Zhenhailong Wang · Manling Li · Ruochen Xu · Luowei Zhou · Jie Lei · Xudong Lin · Shuohang Wang · Ziyi Yang · Chenguang Zhu · Derek Hoiem · Shih-Fu Chang · Mohit Bansal · Heng Ji
[ Poster [ OpenReview
Poster
Wed Nov 30 02:00 PM -- 04:00 PM (PST) @ Hall J #1025
MOMA-LRG: Language-Refined Graphs for Multi-Object Multi-Actor Activity Parsing
Zelun Luo · Zane Durante · Linden Li · Wanze Xie · Ruochen Liu · Emily Jin · Zhuoyi Huang · Lun Yu Li · Jiajun Wu · Juan Carlos Niebles · Ehsan Adeli · Fei-Fei Li
[ Poster [ OpenReview
Poster
Wed Nov 30 02:00 PM -- 04:00 PM (PST) @ Hall J #1012
LAION-5B: An open large-scale dataset for training next generation image-text models
Christoph Schuhmann · Romain Beaumont · Richard Vencu · Cade Gordon · Ross Wightman · Mehdi Cherti · Theo Coombes · Aarush Katta · Clayton Mullis · Mitchell Wortsman · Patrick Schramowski · Srivatsa Kundurthy · Katherine Crowson · Ludwig Schmidt · Robert Kaczmarczyk · Jenia Jitsev
[ Slides [ Poster [ OpenReview
Poster
Thu Dec 01 09:00 AM -- 11:00 AM (PST) @ Hall J #107
Multi-Game Decision Transformers
Kuang-Huei Lee · Ofir Nachum · Mengjiao (Sherry) Yang · Lisa Lee · Daniel Freeman · Sergio Guadarrama · Ian Fischer · Winnie Xu · Eric Jang · Henryk Michalewski · Igor Mordatch
[ Poster [ OpenReview
Poster
Thu Dec 01 09:00 AM -- 11:00 AM (PST) @ Hall J #139
Towards Efficient Post-training Quantization of Pre-trained Language Models
Haoli Bai · Lu Hou · Lifeng Shang · Xin Jiang · Irwin King · Michael R Lyu
[ OpenReview
Poster
Thu Dec 01 09:00 AM -- 11:00 AM (PST) @ Hall J #231
Few-shot Task-agnostic Neural Architecture Search for Distilling Large Language Models
Dongkuan (DK) Xu · Subhabrata Mukherjee · Xiaodong Liu · Debadeepta Dey · Wenhui Wang · Xiang Zhang · Ahmed Awadallah · Jianfeng Gao
[ Slides [ Poster [ OpenReview
Poster
Thu Dec 01 09:00 AM -- 11:00 AM (PST) @ Hall J #332
When Does Differentially Private Learning Not Suffer in High Dimensions?
Xuechen Li · Daogao Liu · Tatsunori Hashimoto · Huseyin A. Inan · Janardhan Kulkarni · Yin-Tat Lee · Abhradeep Guha Thakurta
[ Poster [ OpenReview
Poster
Thu Dec 01 09:00 AM -- 11:00 AM (PST) @ Hall J #334
Are GANs overkill for NLP?
David Alvarez-Melis · Vikas Garg · Adam Kalai
[ OpenReview
Poster
Thu Dec 01 09:00 AM -- 11:00 AM (PST) @ Hall J #440
Back Razor: Memory-Efficient Transfer Learning by Self-Sparsified Backpropagation
Ziyu Jiang · Xuxi Chen · Xueqin Huang · Xianzhi Du · Denny Zhou · Zhangyang Wang
[ Poster [ OpenReview
Poster
Thu Dec 01 09:00 AM -- 11:00 AM (PST) @ Hall J #410
Neural Attentive Circuits
Martin Weiss · Nasim Rahaman · Francesco Locatello · Chris Pal · Yoshua Bengio · Bernhard Schölkopf · Erran Li Li · Nicolas Ballas
[ OpenReview
Poster
Thu Dec 01 09:00 AM -- 11:00 AM (PST) @ Hall J #519
On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting
Tomasz Korbak · Hady Elsahar · Germán Kruszewski · Marc Dymetman
[ OpenReview
Poster
Thu Dec 01 09:00 AM -- 11:00 AM (PST) @ Hall J #639
Thor: Wielding Hammers to Integrate Language Models and Automated Theorem Provers
Albert Qiaochu Jiang · Wenda Li · Szymon Tworkowski · Konrad Czechowski · Tomasz Odrzygóźdź · Piotr Miłoś · Yuhuai Wu · Mateja Jamnik
[ Poster [ OpenReview
Poster
Thu Dec 01 09:00 AM -- 11:00 AM (PST) @ Hall J #621
The Stability-Efficiency Dilemma: Investigating Sequence Length Warmup for Training GPT Models
Conglong Li · Minjia Zhang · Yuxiong He
[ Poster [ OpenReview
Poster
Thu Dec 01 09:00 AM -- 11:00 AM (PST) @ Hall J #716
Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models
Kushal Tirumala · Aram Markosyan · Luke Zettlemoyer · Armen Aghajanyan
[ OpenReview
Poster
Thu Dec 01 09:00 AM -- 11:00 AM (PST) @ Hall J #902
AD-DROP: Attribution-Driven Dropout for Robust Language Model Fine-Tuning
Tao Yang · JInghao Deng · Xiaojun Quan · Qifan Wang · Shaoliang Nie
[ Slides [ Poster [ OpenReview
Poster
Thu Dec 01 09:00 AM -- 11:00 AM (PST) @ Hall J #912
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Chitwan Saharia · William Chan · Saurabh Saxena · Lala Li · Jay Whang · Emily Denton · Kamyar Ghasemipour · Raphael Gontijo Lopes · Burcu Karagol Ayan · Tim Salimans · Jonathan Ho · David Fleet · Mohammad Norouzi
[ OpenReview
Poster
Thu Dec 01 09:00 AM -- 11:00 AM (PST) @ Hall J #926
ReCo: Retrieve and Co-segment for Zero-shot Transfer
Gyungin Shin · Weidi Xie · Samuel Albanie
[ Poster [ OpenReview
Poster
Thu Dec 01 09:00 AM -- 11:00 AM (PST) @ Hall J #1032
PEER: A Comprehensive and Multi-Task Benchmark for Protein Sequence Understanding
Minghao Xu · Zuobai Zhang · Jiarui Lu · Zhaocheng Zhu · Yangtian Zhang · Ma Chang · Runcheng Liu · Jian Tang
[ Slides [ Poster [ OpenReview
Poster
Thu Dec 01 09:00 AM -- 11:00 AM (PST) @ Hall J #1013
Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training Benchmark
Jiaxi Gu · Xiaojun Meng · Guansong Lu · Lu Hou · Niu Minzhe · Xiaodan Liang · Lewei Yao · Runhui Huang · Wei Zhang · Xin Jiang · Chunjing XU · Hang Xu
[ Poster [ OpenReview
Poster
Thu Dec 01 02:00 PM -- 04:00 PM (PST) @ Hall J #234
ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers
Zhewei Yao · Reza Yazdani Aminabadi · Minjia Zhang · Xiaoxia Wu · Conglong Li · Yuxiong He
[ OpenReview
Poster
Thu Dec 01 02:00 PM -- 04:00 PM (PST) @ Hall J #230
K-LITE: Learning Transferable Visual Models with External Knowledge
Sheng Shen · Chunyuan Li · Xiaowei Hu · Yujia Xie · Jianwei Yang · Pengchuan Zhang · Zhe Gan · Lijuan Wang · Lu Yuan · Ce Liu · Kurt Keutzer · Trevor Darrell · Anna Rohrbach · Jianfeng Gao
[ Poster [ OpenReview
Poster
Thu Dec 01 02:00 PM -- 04:00 PM (PST) @ Hall J #205
Recovering Private Text in Federated Learning of Language Models
Samyak Gupta · Yangsibo Huang · Zexuan Zhong · Tianyu Gao · Kai Li · Danqi Chen
[ OpenReview
Poster
Thu Dec 01 02:00 PM -- 04:00 PM (PST) @ Hall J #512
Autoformalization with Large Language Models
Yuhuai Wu · Albert Qiaochu Jiang · Wenda Li · Markus Rabe · Charles Staats · Mateja Jamnik · Christian Szegedy
[ OpenReview
Poster
Thu Dec 01 02:00 PM -- 04:00 PM (PST) @ Hall J #524
The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning
Xi Ye · Greg Durrett
[ OpenReview
Poster
Thu Dec 01 02:00 PM -- 04:00 PM (PST) @ Hall J #639
GPT3.int8(): 8-bit Matrix Multiplication for Transformers at Scale
Tim Dettmers · Mike Lewis · Younes Belkada · Luke Zettlemoyer
[ Poster [ OpenReview
Poster
Thu Dec 01 02:00 PM -- 04:00 PM (PST) @ Hall J #632
Polyhistor: Parameter-Efficient Multi-Task Adaptation for Dense Vision Tasks
Yen-Cheng Liu · CHIH-YAO MA · Junjiao Tian · Zijian He · Zsolt Kira
[ Poster [ OpenReview
Poster
Thu Dec 01 02:00 PM -- 04:00 PM (PST) @ Hall J #706
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima · Shixiang (Shane) Gu · Machel Reid · Yutaka Matsuo · Yusuke Iwasawa
[ Poster [ OpenReview
Poster
Thu Dec 01 02:00 PM -- 04:00 PM (PST) @ Hall J #913
MorphTE: Injecting Morphology in Tensorized Embeddings
Guobing Gan · Peng Zhang · Sunzhu Li · Xiuqing Lu · Benyou Wang
[ Poster [ OpenReview
Poster
Thu Dec 01 02:00 PM -- 04:00 PM (PST) @ Hall J #928
What Can Transformers Learn In-Context? A Case Study of Simple Function Classes
Shivam Garg · Dimitris Tsipras · Percy Liang · Gregory Valiant
[ OpenReview
Poster
Thu Dec 01 02:00 PM -- 04:00 PM (PST) @ Hall J #1017
Understanding Aesthetics with Language: A Photo Critique Dataset for Aesthetic Assessment
Daniel Vera Nieto · Luigi Celona · Clara Fernandez Labrador
[ Slides [ Poster [ OpenReview
Poster
Thu Dec 01 02:00 PM -- 04:00 PM (PST) @ Hall J #1012
BigBio: A Framework for Data-Centric Biomedical Natural Language Processing
Jason Fries · Leon Weber · Natasha Seelam · Gabriel Altay · Debajyoti Datta · Samuele Garda · Sunny Kang · Rosaline Su · Wojciech Kusa · Samuel Cahyawijaya · Fabio Barth · Simon Ott · Matthias Samwald · Stephen Bach · Stella Biderman · Mario Sänger · Bo Wang · Alison Callahan · Daniel León Periñán · Théo Gigant · Patrick Haller · Jenny Chim · Jose Posada · John Giorgi · Karthik Rangasai Sivaraman · Marc Pàmies · Marianna Nezhurina · Robert Martin · Michael Cullan · Moritz Freidank · Nathan Dahlberg · Shubhanshu Mishra · Shamik Bose · Nicholas Broad · Yanis Labrak · Shlok Deshmukh · Sid Kiblawi · Ayush Singh · Minh Chien Vu · Trishala Neeraj · Jonas Golde · Albert Villanova del Moral · Benjamin Beilharz
[ Poster [ OpenReview
Workshop
Fri Dec 02 06:00 AM -- 03:00 PM (PST) @ Room 288 - 289 None
Synthetic Data for Empowering ML Research
Mihaela van der Schaar · Zhaozhi Qian · Sergul Aydore · Dimitris Vlitas · Dino Oglic · Tucker Balch
Workshop
Fri Dec 02 06:30 AM -- 03:45 PM (PST) @ Room 398 None
Table Representation Learning
Madelon Hulsebos · Bojan Karlaš · Pengcheng Yin · haoyu dong
Workshop
Fri Dec 02 06:40 AM -- 03:00 PM (PST) @ Theater A None
New Frontiers in Graph Learning
Jiaxuan You · Marinka Zitnik · Rex Ying · Yizhou Sun · Hanjun Dai · Stefanie Jegelka
Workshop
Sat Dec 03 06:30 AM -- 03:00 PM (PST) @ Room 281 - 282 None
Information-Theoretic Principles in Cognitive Systems
Noga Zaslavsky · Mycal Tucker · Sarah Marzen · Irina Higgins · Stephanie Palmer · Samuel J Gershman
Workshop
Sat Dec 03 06:30 AM -- 03:00 PM (PST) @ Room 288 - 289 None
Machine Learning in Structural Biology Workshop
Roshan Rao · Jonas Adler · Namrata Anand · John Ingraham · Sergey Ovchinnikov · Ellen Zhong
[ Contact: workshopmlsb@gmail.com ]
Workshop
Sat Dec 03 06:50 AM -- 02:30 PM (PST) @ Room 291 - 292 None
Foundation Models for Decision Making
Mengjiao (Sherry) Yang · Yilun Du · Jack Parker-Holder · Siddharth Karamcheti · Igor Mordatch · Shixiang (Shane) Gu · Ofir Nachum
Workshop
Sat Dec 03 06:50 AM -- 03:00 PM (PST) @ Theater C None
Transfer Learning for Natural Language Processing
Alon Albalak · Colin Raffel · Chunting Zhou · Deepak Ramachandran · Xuezhe Ma · Sebastian Ruder
Workshop
Sat Dec 03 07:00 AM -- 02:55 PM (PST) @ Room 397 None
InterNLP: Workshop on Interactive Learning for Natural Language Processing
Kianté Brantley · Soham Dan · Ji Ung Lee · Khanh Nguyen · Edwin Simpson · Alane Suhr · Yoav Artzi
Panel
Tue Dec 06 06:45 PM -- 07:00 PM (PST) @ Virtual None
Panel 2B-4: Extreme Compression for… & Exploring Length Generalization…
Cem Anil · Minjia Zhang
Expo Workshop
Wed Dec 07 09:30 AM -- 11:30 AM (PST) @ Virtual None
PyTorch: New advances for large-scale training and performance optimizations
Geeta Chauhan · Rohan Varma · Ke Wen · Taylor Robie · Andrew Gu · Anupam Bhatnagar · Bin Bao · Natalia Gimelshein · Animesh Jain · Sherlock Huang
Lightning Talk
Thu Dec 08 06:00 PM -- 06:15 PM (PST) None
Lightning Talks 6B-3
Lingfeng Yang · Yao Lai · Zizheng Pan · Zhenyu Wang · WEICONG LIANG · Chuanyang Zheng · Jian-Wei Zhang · Peng Jin · Jing Liu · Xiuying Wei · Yao Mu · Xiang Li · YUHUI YUAN · Zizheng Pan · Yifan Sun · Yunchen Zhang · Jianfei Cai · Hao Luo · zheyang li · Jinfa Huang · Haoyu He · Yi Yang · Ping Luo · Fenglin Liu · Henghui Ding · Borui Zhao · Xiangguo Zhang · Kai Zhang · Pichao WANG · Bohan Zhuang · Wei Chen · Ruihao Gong · Zhi Yang · Xian Wu · Feng Ding · Jianfei Cai · Xiao Luo · Renjie Song · Weihong Lin · Jian Yang · Wenming Tan · Bohan Zhuang · Shanghang Zhang · Shen Ge · Fan Wang · Qi Zhang · Guoli Song · Jun Xiao · Hao Li · Ding Jia · David Clifton · Ye Ren · Fengwei Yu · Zheng Zhang · Jie Chen · Shiliang Pu · Xianglong Liu · Chao Zhang · Han Hu