Skip to yearly menu bar Skip to main content


SpecTr++: Improved transport plans for speculative decoding of large language models

Kwangjun Ahn ⋅ Ahmad Beirami ⋅ Ziteng Sun ⋅ Ananda Theertha Suresh

Abstract

Chat is not available.