Skip to yearly menu bar Skip to main content


CTR-BERT: Cost-effective knowledge distillation for billion-parameter teacher models

Aashiq Muhamed · Iman Keivanloo · Sujan Perera · James Mracek · Yi Xu · Qingjun Cui · Santosh Rajagopalan · Belinda Zeng · Trishul Chilimbi

Abstract

Video

Chat is not available.