Timezone: »
A new area is emerging at the intersection of machine learning (ML) and systems design. This birth is driven by the explosive growth of diverse applications of ML in production, the continued growth in data volume, and the complexity of large-scale learning systems. Addressing the challenges in this intersection demands a combination of the right abstractions -- for algorithms, data structures, and interfaces -- as well as scalable systems capable of addressing real world learning problems.
Designing systems for machine learning presents new challenges and opportunities over the design of traditional data processing systems. For example, what is the right abstraction for data consistency in the context of parallel, stochastic learning algorithms? What guarantees of fault tolerance are needed during distributed learning? The statistical nature of machine learning offers an opportunity for more efficient systems but requires revisiting many of the challenges addressed by the systems and database communities over the past few decades. Machine learning focused developments in distributed learning platforms, programming languages, data structures, general purpose GPU programming, and a wide variety of other domains have had and will continue to have a large impact in both academia and industry.
As the relationship between the machine learning and systems communities has grown stronger, new research in using machine learning tools to solve classic systems challenges has also grown. Specifically, as we develop larger and more complex systems and networks for storing, analyzing, serving, and interacting with data, machine learning offers promise for modeling system dynamics, detecting issues, and making intelligent, data-driven decisions within our systems. Machine learning techniques have begun to play critical roles in scheduling, system tuning, and network analysis. Through working with systems and databases researchers to solve systems challenges, machine learning researchers can both improve their own learning systems as well impact the systems community and infrastructure at large.
The goal of this workshop is to bring together experts working at the crossroads of ML, system design and software engineering to explore the challenges faced when building practical large-scale machine learning systems. In particular, we aim to elicit new connections among these diverse fields, identify tools, best practices and design principles. The workshop will cover ML and AI platforms and algorithm toolkits (Caffe, Torch, TensorFlow, MXNet and parameter server, Theano, etc), as well as dive into the reality of applying ML and AI in industry with challenges of data and organization scale (with invited speakers from companies like Google, Microsoft, Facebook, Amazon, Netflix, Uber and Twitter).
The workshop will have a mix of invited speakers and reviewed papers with talks, posters and panel discussions to facilitate the flow of new ideas as well as best practices which can benefit those looking to implement large ML systems in academia or industry.
Focal points for discussions and solicited submissions include but are not limited to:
- Systems for online and batch learning algorithms
- Systems for out-of-core machine learning
- Implementation studies of large-scale distributed learning algorithms --- challenges faced and lessons learned
- Database systems for Big Learning --- models and algorithms implemented, properties (fault tolerance, consistency, scalability, etc.), strengths and limitations
- Programming languages for machine learning
- Data driven systems --- learning for job scheduling, configuration tuning, straggler mitigation, network configuration, and security
- Systems for interactive machine learning
- Systems for serving machine learning models at scale
Fri 11:45 p.m. - 12:00 a.m.
|
Opening Remarks
(
Talk
)
|
🔗 |
Sat 12:00 a.m. - 12:20 a.m.
|
Invited Talk: You've been using asynchrony wrong your whole life! (Chris Re, Stanford)
(
Invited Talk
)
|
Christopher Ré 🔗 |
Sat 12:20 a.m. - 12:40 a.m.
|
Contributed Talk: Hemingway: Modeling Distributed Optimization Algorithms
(
Contributed Talk
)
|
🔗 |
Sat 12:40 a.m. - 1:00 a.m.
|
Invited Talk: Paleo: A Performance Model for Deep Neural Networks (Ameet Talwalkar, UCLA)
(
Invited Talk
)
|
Ameet S Talwalkar 🔗 |
Sat 1:00 a.m. - 1:40 a.m.
|
Poster Previews
(
Lightening Talks
)
|
🔗 |
Sat 2:30 a.m. - 2:50 a.m.
|
Invited Talk: Scaling Machine Learning Using TensorFlow (Jeff Dean, Google Brain)
(
Invited Talk
)
|
Jeff Dean 🔗 |
Sat 2:50 a.m. - 3:10 a.m.
|
Contributed Talk: Demitasse: SPMD Programing Implementation of Deep Neural Network Library for Mobile Devices
(
Invited Talk
)
|
🔗 |
Sat 3:10 a.m. - 4:30 a.m.
|
Lunch
|
🔗 |
Sat 4:30 a.m. - 5:50 a.m.
|
ML System Updates from Caffe (Andrew Tulloch), Clipper (Daniel Crankshaw), Decision Service (Siddhartha Sen), MxNET (Tianqi Chen), Torch (Soumith Chintala), and VW (John Langford)
(
Invited Talks
)
|
🔗 |
Sat 5:50 a.m. - 6:10 a.m.
|
Invited Talk: Optimizing Large-Scale Machine Learning Pipelines with KeystoneML (Tomer Kaftan, UW)
(
Invited Talk
)
|
Tomer Kaftan 🔗 |
Sat 6:10 a.m. - 6:30 a.m.
|
Invited Talk: Optimizing Machine Learning and Deep Learning (John Canny, UC Berkeley & Google Research)
(
Invited Talk
)
|
John Canny 🔗 |
Sat 6:30 a.m. - 7:30 a.m.
|
Posters & Coffee
(
Poster Session
)
|
🔗 |
Sat 7:30 a.m. - 7:50 a.m.
|
Contributed Talk: Yggdrasil: An Optimized System for Training Deep Decision Trees at Scale
(
Contributed Talk
)
|
🔗 |
Sat 7:50 a.m. - 8:10 a.m.
|
Contributed Talk: TensorForest: Scalable Random Forests on TensorFlow
(
Contributed Talk
)
|
🔗 |
Sat 8:10 a.m. - 8:15 a.m.
|
Closing Remarks
(
Talk
)
|
🔗 |
Author Information
Aparna Lakshmiratan (Facebook)
I am the PM lead for the AI Platform in Facebook AI (PyTorch 1.0, Data Tools and Developer Ecosystem) Before Facebook, I worked in Microsoft building and shipping several products including a new Click Prediction system for Bing Ads, several enhancements to the Speller and Query Alterations engine in Bing and most recently an interactive machine learning platform for non-experts at Microsoft Research. I have a PhD in Computer Science from MIT.
Li Erran Li (Pony.ai)
Li Erran Li is the head of machine learning at Scale and an adjunct professor at Columbia University. Previously, he was chief scientist at Pony.ai. Before that, he was with the perception team at Uber ATG and machine learning platform team at Uber where he worked on deep learning for autonomous driving, led the machine learning platform team technically, and drove strategy for company-wide artificial intelligence initiatives. He started his career at Bell Labs. Li’s current research interests are machine learning, computer vision, learning-based robotics, and their application to autonomous driving. He has a PhD from the computer science department at Cornell University. He’s an ACM Fellow and IEEE Fellow.
Siddhartha Sen (Microsoft Research)
Sarah Bird (Facebook)
Sarah’s work focuses on research and emerging technology strategy for AI products in Azure. Sarah works to accelerate the adoption and positive impact of AI by bringing together the latest innovations in research with the best of open source and product expertise to create new tools and technologies. Sarah is currently leading Responsible AI for the Azure Cognitive Services. Prior to joining the Cognitive Services, Sarah lead the development of responsible AI tools in Azure Machine Learning. She is an active member of the Microsoft AETHER committee, where she works to develop and drive company-wide adoption of responsible AI principles, best practices, and technologies. Sarah was one of the founding researchers in the Microsoft FATE research group and prior to joining Microsoft worked on AI fairness in Facebook. Sarah is active contributor to the open source ecosystem, she co-founded ONNX, Fairlearn, and OpenDP’s SmartNoise was a leader in the Pytorch 1.0 and InterpretML projects. She was an early member of the machine learning systems research community and has been active in growing and forming the community. She co-founded the MLSys research conference and the Learning Systems workshops. She has a Ph.D. in computer science from UC Berkeley advised by Dave Patterson, Krste Asanovic, and Burton Smith.
Hussein Mehanna (Facebook)
More from the Same Authors
-
2022 : The Impact of Symbolic Representations on In-context Learning for Few-shot Reasoning »
Hanlin Zhang · yifan zhang · Li Erran Li · Eric Xing -
2022 : Improving the Strength of Human-Like Models in Chess »
Saumik Narayanan · Kassa Korley · Chien-Ju Ho · Siddhartha Sen -
2022 : Exploring Transformer Backbones for Heterogeneous Treatment Effect Estimation »
yifan zhang · Hanlin Zhang · Zachary Lipton · Li Erran Li · Eric Xing -
2021 Workshop: Distribution shifts: connecting methods and applications (DistShift) »
Shiori Sagawa · Pang Wei Koh · Fanny Yang · Hongseok Namkoong · Jiashi Feng · Kate Saenko · Percy Liang · Sarah Bird · Sergey Levine -
2021 : Learning to perceive objects by prediction »
Tushar Arora · Li Erran Li · Mingbo Cai -
2021 : Learning to perceive objects by prediction »
Tushar Arora · Li Erran Li · Mingbo Cai -
2021 Poster: Detecting Individual Decision-Making Style: Exploring Behavioral Stylometry in Chess »
Reid McIlroy-Young · Russell Wang · Siddhartha Sen · Jon Kleinberg · Ashton Anderson -
2021 Poster: A Causal Lens for Controllable Text Generation »
Zhiting Hu · Li Erran Li -
2019 : Panel - The Role of Communication at Large: Aparna Lakshmiratan, Jason Yosinski, Been Kim, Surya Ganguli, Finale Doshi-Velez »
Aparna Lakshmiratan · Finale Doshi-Velez · Surya Ganguli · Zachary Lipton · Michela Paganini · Anima Anandkumar · Jason Yosinski -
2019 : Welcome »
Rowan McAllister · Nicholas Rhinehart · Li Erran Li -
2019 Workshop: Machine Learning for Autonomous Driving »
Rowan McAllister · Nicholas Rhinehart · Fisher Yu · Li Erran Li · Anca Dragan -
2019 Workshop: MLSys: Workshop on Systems for ML »
Aparna Lakshmiratan · Siddhartha Sen · Joseph Gonzalez · Dan Crankshaw · Sarah Bird -
2018 : Opening Remark »
Li Erran Li · Anca Dragan -
2018 Workshop: NIPS Workshop on Machine Learning for Intelligent Transportation Systems 2018 »
Li Erran Li · Anca Dragan · Juan Carlos Niebles · Silvio Savarese -
2018 : Closing Remarks »
Aparna Lakshmiratan -
2018 : Welcome »
Sarah Bird -
2018 : Welcome and organisers comments »
Chloé Bakalar · Finnian Lattimore · Sarah Bird · Sendhil Mullainathan -
2018 Workshop: MLSys: Workshop on Systems for ML and Open Source Software »
Aparna Lakshmiratan · Sarah Bird · Siddhartha Sen · Joseph Gonzalez · Daniel Crankshaw -
2018 Workshop: Workshop on Ethical, Social and Governance Issues in AI »
Chloe Bakalar · Sarah Bird · Tiberio Caetano · Edward W Felten · Dario Garcia · Isabel Kloumann · Finnian Lattimore · Sendhil Mullainathan · D. Sculley -
2017 Workshop: 2017 NIPS Workshop on Machine Learning for Intelligent Transportation Systems »
Li Erran Li · Anca Dragan · Juan Carlos Niebles · Silvio Savarese -
2017 : Invited Talk: Creating an Open and Flexible ecosystem for AI models with ONNX, Sarah Bird, Dmytro Dzhulgakov, Facebook Research »
Sarah Bird -
2017 Workshop: ML Systems Workshop @ NIPS 2017 »
Aparna Lakshmiratan · Sarah Bird · Siddhartha Sen · Christopher Ré · Li Erran Li · Joseph Gonzalez · Daniel Crankshaw -
2016 Workshop: Machine Learning for Intelligent Transportation Systems »
Li Erran Li · Trevor Darrell -
2015 : Multiworld Testing »
Sarah Bird