Timezone: »
We introduce a learning-based framework to optimize tensor programs for deep learning workloads. Efficient implementations of tensor operators, such as matrix multiplication and high dimensional convolution are key enablers of effective deep learning systems. However, existing systems rely on manually optimized libraries such as cuDNN where only a narrow range of server class GPUs are well-supported. The reliance on hardware specific operator libraries limits the applicability of high-level graph optimizations and incurs significant engineering costs when deploying to new hardware targets. We use learning to remove this engineering burden. We learn domain specific statistical cost models to guide the search of tensor operator implementations over billions of possible program variants. We further accelerate the search by effective model transfer across workloads. Experimental results show that our framework delivers performance competitive with state-of-the-art hand-tuned libraries for low-power CPU, mobile GPU, and server-class GPU.
Author Information
Tianqi Chen (University of Washington)
Lianmin Zheng (Shanghai Jiao Tong University)
Eddie Yan (university of washington)
Ziheng Jiang (Fudan University)
Thierry Moreau (university of washington)
Luis Ceze (University of Washington)
Carlos Guestrin (University of Washington)
Arvind Krishnamurthy (University of Washington)
Related Events (a corresponding poster, oral, or spotlight)
-
2018 Poster: Learning to Optimize Tensor Programs »
Tue. Dec 4th through Wed the 5th Room Room 517 AB #104
More from the Same Authors
-
2021 : TenSet: A Large-scale Program Performance Dataset for Learned Tensor Compilers »
Lianmin Zheng · Ruochen Liu · Junru Shao · Tianqi Chen · Joseph Gonzalez · Ion Stoica · Ameer Haj-Ali -
2021 : Learned Compiler Optimizations »
Luis Ceze -
2018 : Poster Session (All Posters) »
Stephen Macke · Hongzi Mao · Caroline Lemieux · Saim Salman · Rishikesh Jha · Hanrui Wang · Shoumik P Palkar · Tianqi Chen · Thomas Pumir · Vaishnav Janardhan · adit bhardwaj · Ed Chi -
2018 : Lunch provided and Open Source ML Systems Showcase (TensorFlow, PyTorch 1.0, MxNET, Keras, CoreML, Ray, Chainer) »
Rajat Monga · Soumith Chintala · Thierry Moreau · Francois Chollet · Daniel Crankshaw · Robert Nishihara · Seiya Tokui -
2018 Poster: Training Deep Models Faster with Robust, Approximate Importance Sampling »
Tyler Johnson · Carlos Guestrin -
2017 : Updates from Current ML Systems (TensorFlow, PyTorch, Caffe2, CNTK, MXNet, TVM, Clipper, MacroBase, ModelDB) »
Rajat Monga · Soumith Chintala · Cha Zhang · Tianqi Chen · Daniel Crankshaw · Kai Sheng Tai · Andrew Tulloch · Manasi Vartak -
2017 Poster: Clustering Billions of Reads for DNA Data Storage »
Cyrus Rashtchian · Konstantin Makarychev · Miklos Racz · Siena Ang · Djordje Jevdjic · Sergey Yekhanin · Luis Ceze · Karin Strauss -
2017 Spotlight: Clustering Billions of Reads for DNA Data Storage »
Cyrus Rashtchian · Konstantin Makarychev · Miklos Racz · Siena Ang · Djordje Jevdjic · Sergey Yekhanin · Luis Ceze · Karin Strauss -
2016 : Invited talk, Carlos Guestrin »
Carlos Guestrin -
2016 Poster: Unified Methods for Exploiting Piecewise Linear Structure in Convex Optimization »
Tyler Johnson · Carlos Guestrin -
2015 Workshop: Machine Learning Systems »
Alex Beutel · Tianqi Chen · Sameer Singh · Elaine Angelino · Markus Weimer · Joseph Gonzalez -
2015 Poster: A Complete Recipe for Stochastic Gradient MCMC »
Yi-An Ma · Tianqi Chen · Emily Fox -
2014 Workshop: High-energy particle physics, machine learning, and the HiggsML data challenge (HEPML) »
Glen Cowan · Balázs Kégl · Kyle Cranmer · Gábor Melis · Tim Salimans · Vladimir Vava Gligorov · Daniel Whiteson · Lester Mackey · Wojciech Kotlowski · Roberto Díaz Morales · Pierre Baldi · Cecile Germain · David Rousseau · Isabelle Guyon · Tianqi Chen -
2014 Poster: Divide-and-Conquer Learning by Anchoring a Conical Hull »
Tianyi Zhou · Jeffrey A Bilmes · Carlos Guestrin -
2012 Demonstration: GraphLab: A Framework For Machine Learning in the Cloud »
Yucheng Low · Haijie Gu · Carlos Guestrin -
2011 Workshop: Big Learning: Algorithms, Systems, and Tools for Learning at Scale »
Joseph E Gonzalez · Sameer Singh · Graham Taylor · James Bergstra · Alice Zheng · Misha Bilenko · Yucheng Low · Yoshua Bengio · Michael Franklin · Carlos Guestrin · Andrew McCallum · Alexander Smola · Michael Jordan · Sugato Basu -
2011 Poster: Linear Submodular Bandits and their Application to Diversified Retrieval »
Yisong Yue · Carlos Guestrin -
2010 Poster: Evidence-Specific Structures for Rich Tractable CRFs »
Anton Chechetka · Carlos Guestrin -
2010 Poster: Inference with Multivariate Heavy-Tails in Linear Models »
Danny Bickson · Carlos Guestrin -
2009 Workshop: Learning with Orderings »
Tiberio Caetano · Carlos Guestrin · Jonathan Huang · Risi Kondor · Guy Lebanon · Marina Meila -
2009 Workshop: Large-Scale Machine Learning: Parallelism and Massive Datasets »
Alexander Gray · Arthur Gretton · Alexander Smola · Joseph E Gonzalez · Carlos Guestrin -
2009 Poster: Riffled Independence for Ranked Data »
Jonathan Huang · Carlos Guestrin -
2009 Spotlight: Riffled Independence for Ranked Data »
Jonathan Huang · Carlos Guestrin -
2007 Oral: Efficient Inference forDistributions on Permutations »
Jonathan Huang · Carlos Guestrin · Leonidas Guibas -
2007 Poster: Efficient Inference forDistributions on Permutations »
Jonathan Huang · Carlos Guestrin · Leonidas Guibas -
2007 Spotlight: Selecting Observations against Adversarial Objectives »
Andreas Krause · H. Brendan McMahan · Carlos Guestrin · Anupam Gupta -
2007 Poster: Selecting Observations against Adversarial Objectives »
Andreas Krause · H. Brendan McMahan · Carlos Guestrin · Anupam Gupta -
2007 Poster: Efficient Principled Learning of Thin Junction Trees »
Anton Chechetka · Carlos Guestrin -
2006 Poster: Distributed Inference in Dynamical Systems »
Stanislav Funiak · Carlos Guestrin · Mark A Paskin · Rahul Sukthankar