Taking Machine Learning Research to Production: Solving Real Problems
This video is also available in the GOTO Play video app! Download it to enjoy offline access to our conference videos while on the move.
Most of the focus in the ML community is on research, which is exciting and important. Equally important however is bringing that research to production applications to solve real-world problems, but the issues and approaches for doing that are often poorly understood.
An ML application in production must address all of the issues of modern software development methodology, as well as issues unique to ML and data science. Often ML applications are developed and trained using tools like notebooks and suffer from inherent limitations in testability, scalability across clusters, training/serving skew, and the modularity and reusability of components. In addition, ML application measurement often emphasizes top level metrics, leading to issues in model fairness as well as predictive performance across user segments. The user experience of any ML application is unique to the model’s performance on that user’s input data, so if the model doesn’t perform well on that particular data segment then the user has a poor experience.
We discuss the use of ML pipeline architectures for implementing production ML applications, and in particular we review Google’s experience with TensorFlow Extended (TFX). Google uses TFX for large scale ML applications, and offers an open-source version to the community. TFX scales to very large training sets and very high request volumes, and enables strong software methodology including testability, hot versioning, and deep performance analysis. Robert Crowe is a data scientist and TFX Developer Advocate at Google and will discuss how developers can move their ML applications to TFX or similar platforms for both training and inference.
What will the audience learn from this talk? The audience will learn about issues and approaches for developing ML applications which are intended for commercial deployments in the real world. Creating production ML applications and the infrastructure to support them is very different from doing ML research, or coding up an ML model to try to achieve a target level of performance. A developer needs to think much more in terms of modern software methodology, with the additional ML aspects to consider as well.
Does it feature code examples and/or live coding? No live coding, but there will be code examples in slides.
Prerequisite attendee experience level: 250-300
-
Space ShuttleStephen CarverMonday Nov 18 @ 08:45
-
Quantum ComputingJessica PointingTuesday Nov 19 @ 09:00
-
Composing Bach Chorales Using Deep LearningFeynman LiangMonday Nov 18 @ 13:20
-
Design For The Utopia You Want, Not The Dystopia You're InChris AthertonMonday Nov 18 @ 17:30
-
Is Business The Key To Making The World A Happier Place?Evan SutterTuesday Nov 19 @ 13:20
-
Party KeynoteSteve WozniakTuesday Nov 19 @ 18:10
-
Extreme Digitalization in ChinaChristina BoutrupWednesday Nov 20 @ 09:00
-
The Promise and Limitations of AIDoug LenatWednesday Nov 20 @ 13:20
-
How to Be Human in the Age of The MachineHannah FryWednesday Nov 20 @ 17:00