Skip to yearly menu bar Skip to main content


(3 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Thu Dec 04 03:30 PM -- 03:50 PM (PST) @ Exhibit Hall F,G,H None
A is for Absorption: Studying Feature Splitting and Absorption in Sparse Autoencoders
David Chanin ⋅ James Wilken-Smith ⋅ Tomáš Dulka ⋅ Hardik Bhatnagar ⋅ Satvik Golechha ⋅ Joseph Bloom
[ OpenReview
Oral
Thu Dec 04 03:50 PM -- 04:10 PM (PST) @ Exhibit Hall F,G,H None
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free
Zihan Qiu ⋅ Zekun Wang ⋅ Bo Zheng ⋅ Zeyu Huang ⋅ Kaiyue Wen ⋅ Songlin Yang ⋅ Rui Men ⋅ Le Yu ⋅ Fei Huang ⋅ Suozhi Huang ⋅ Dayiheng Liu ⋅ Jingren Zhou ⋅ Junyang Lin
[ OpenReview
Oral
Thu Dec 04 04:10 PM -- 04:30 PM (PST) @ Exhibit Hall F,G,H None
Superposition Yields Robust Neural Scaling
Yizhou Liu ⋅ Ziming Liu ⋅ Jeff Gore
[ OpenReview