Skip to yearly menu bar Skip to main content


Poster Fri, Dec 5, 2025 • 4:30 PM – 7:30 PM PST

Speculate Deep and Accurate: Lossless and Training-Free Acceleration for Offloaded LLMs via Substitute Speculative Decoding

Pei-Shuo Wang ⋅ Jian-Jia Chen ⋅ Chun-Che Yang ⋅ Chi-Chih Chang ⋅ Ning-Chi Huang ⋅ Mohamed Abdelfattah ⋅ Kai-Chiang Wu

Abstract

Video

Chat is not available.