Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

16 Results

<<   <   Page 1 of 2   >   >>
Poster
Wed 11:00 Graph Neural Networks and Arithmetic Circuits
Timon Barlag · Vivian Holzapfel · Laura Strieker · Jonni Virtema · Heribert Vollmer
Poster
Fri 16:30 Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure
Hanseul Cho · Jaeyoung Cha · Pranjal Awasthi · Srinadh Bhojanapalli · Anupam Gupta · Chulhee Yun
Poster
Fri 16:30 Towards Diverse Device Heterogeneous Federated Learning via Task Arithmetic Knowledge Integration
Mahdi Morafah · Vyacheslav Kungurtsev · Hojin Chang · Chen Chen · Bill Lin
Poster
Fri 11:00 Transformers Can Do Arithmetic with the Right Embeddings
Sean McLeish · Arpit Bansal · Alex Stein · Neel Jain · John Kirchenbauer · Brian Bartoldson · Bhavya Kailkhura · Abhinav Bhatele · Jonas Geiping · Avi Schwarzschild · Tom Goldstein
Poster
Wed 16:30 OccamLLM: Fast and Exact Language Model Arithmetic in a Single Step
Owen Dugan · Donato Jiménez-Benetó · Charlotte Loh · Zhuo Chen · Rumen Dangovski · Marin Soljacic
Poster
Wed 11:00 A Careful Examination of Large Language Model Performance on Grade School Arithmetic
Hugh Zhang · Jeff Da · Dean Lee · Vaughn Robinson · Catherine Wu · William Song · Tiffany Zhao · Pranav Raja · Charlotte Zhuang · Dylan Slack · Qin Lyu · Sean Hendryx · Russell Kaplan · Michele Lunati · Summer Yue
Poster
Fri 11:00 Scalable and Effective Arithmetic Tree Generation for Adder and Multiplier Designs
Yao Lai · Jinxin Liu · David Z. Pan · Ping Luo
Oral
Wed 10:00 Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks
Tianyu He · Darshil Doshi · Aritra Das · Andrey Gromov
Poster
Wed 16:30 Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks
Tianyu He · Darshil Doshi · Aritra Das · Andrey Gromov
Poster
Fri 16:30 A Fast Convoluted Story: Scaling Probabilistic Inference for Integer Arithmetics
Lennert De Smet · Pedro Zuidberg Dos Martires
Workshop
Attention Bias as an Inductive Bias: How to Teach Transformers Simple Arithmetic
Shaoxiong Duan · Yining Shi · Wei Xu
Workshop
Emergence in non-neural models: grokking modular arithmetic via average gradient outer product
Neil Mallinar · Daniel Beaglehole · Libin Zhu · Adityanarayanan Radhakrishnan · Parthe Pandit · Misha Belkin