Realizing Personal and Enterprise AI Twins - Lenovo
Oguz Elibol
Abstract
In this talk, we outline our progress toward realizing Personal and Enterprise AI Twins using a Hybrid Compute paradigm. We will discuss specific advancements in our on-device and cloud agents, focusing on improved tool calling, information retrieval, and data-driven optimization. Furthermore, we will present results on enhancing speculative decoding methodologies to accelerate inference, along with model routing strategies designed to balance cost, latency, and performance.
Successful Page Load