Fast and Accurate Language Model Decoding via Parallel Token Processing
Zhepei Wei · Wei-Lin Chen · Xinyu Zhu · Yu Meng
2024 Oral
in
Workshop: Adaptive Foundation Models: Evolving AI for Personalized and Efficient Learning
in
Workshop: Adaptive Foundation Models: Evolving AI for Personalized and Efficient Learning
Video
Chat is not available.
Successful Page Load