Skip to yearly menu bar Skip to main content


Tensor Attention Training: Provably Efficient Learning of Higher-order Transformers

Yingyu Liang · Zhenmei Shi · Zhao Song · Yufa Zhou

Abstract

Chat is not available.