Skip to yearly menu bar Skip to main content


San Diego Poster Fri, Dec 5, 2025 • 4:30 PM – 7:30 PM PST Exhibit Hall C,D,E #715

Speculate Deep and Accurate: Lossless and Training-Free Acceleration for Offloaded LLMs via Substitute Speculative Decoding

Pei-Shuo Wang · Jian-Jia Chen · Chun-Che Yang · Chi-Chih Chang · Ning-Chi Huang · Mohamed Abdelfattah · Kai-Chiang Wu

Abstract

Log in and register to view live content