firstbacksecondback
16 Results
Poster
|
Wed 11:00 |
Graph Neural Networks and Arithmetic Circuits Timon Barlag · Vivian Holzapfel · Laura Strieker · Jonni Virtema · Heribert Vollmer |
|
Poster
|
Fri 16:30 |
Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure Hanseul Cho · Jaeyoung Cha · Pranjal Awasthi · Srinadh Bhojanapalli · Anupam Gupta · Chulhee Yun |
|
Poster
|
Fri 16:30 |
Towards Diverse Device Heterogeneous Federated Learning via Task Arithmetic Knowledge Integration Mahdi Morafah · Vyacheslav Kungurtsev · Hojin Chang · Chen Chen · Bill Lin |
|
Poster
|
Fri 11:00 |
Transformers Can Do Arithmetic with the Right Embeddings Sean McLeish · Arpit Bansal · Alex Stein · Neel Jain · John Kirchenbauer · Brian Bartoldson · Bhavya Kailkhura · Abhinav Bhatele · Jonas Geiping · Avi Schwarzschild · Tom Goldstein |
|
Poster
|
Wed 16:30 |
OccamLLM: Fast and Exact Language Model Arithmetic in a Single Step Owen Dugan · Donato Jiménez-Benetó · Charlotte Loh · Zhuo Chen · Rumen Dangovski · Marin Soljacic |
|
Poster
|
Wed 11:00 |
A Careful Examination of Large Language Model Performance on Grade School Arithmetic Hugh Zhang · Jeff Da · Dean Lee · Vaughn Robinson · Catherine Wu · William Song · Tiffany Zhao · Pranav Raja · Charlotte Zhuang · Dylan Slack · Qin Lyu · Sean Hendryx · Russell Kaplan · Michele Lunati · Summer Yue |
|
Poster
|
Fri 11:00 |
Scalable and Effective Arithmetic Tree Generation for Adder and Multiplier Designs Yao Lai · Jinxin Liu · David Z. Pan · Ping Luo |
|
Oral
|
Wed 10:00 |
Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks Tianyu He · Darshil Doshi · Aritra Das · Andrey Gromov |
|
Poster
|
Wed 16:30 |
Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks Tianyu He · Darshil Doshi · Aritra Das · Andrey Gromov |
|
Poster
|
Fri 16:30 |
A Fast Convoluted Story: Scaling Probabilistic Inference for Integer Arithmetics Lennert De Smet · Pedro Zuidberg Dos Martires |
|
Workshop
|
Attention Bias as an Inductive Bias: How to Teach Transformers Simple Arithmetic Shaoxiong Duan · Yining Shi · Wei Xu |
||
Workshop
|
Emergence in non-neural models: grokking modular arithmetic via average gradient outer product Neil Mallinar · Daniel Beaglehole · Libin Zhu · Adityanarayanan Radhakrishnan · Parthe Pandit · Misha Belkin |