Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Machine Learning for Systems

Exploring CXL-based KV Cache Storage for LLM Serving

Yupeng Tang ⋅ Runxiang Cheng ⋅ Ping Zhou ⋅ Tongping Liu ⋅ Fei Liu ⋅ Wei Tang ⋅ Kyoungryun Bae ⋅ Jianjun Chen ⋅ Wu Xiang ⋅ Rui Shi

Abstract

Chat is not available.