Skip to yearly menu bar Skip to main content


Poster

DenseFormer: Enhancing Information Flow in Transformers via Depth Weighted Averaging

Matteo Pagliardini ⋅ Amirkeivan Mohtashami ⋅ François Fleuret ⋅ Martin Jaggi
2024 Poster

Abstract

Video

Chat is not available.