0


TFX:一个基于TensorFlow的生产规模机器学习平台

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform
课程网址: https://videolectures.net/videos/kdd2017_cheng_machine_learning  
主讲教师: Heng-­Tze Cheng
开课单位: KDD 2017研讨会
开课时间: 2017-10-09
课程语种: 英语
中文简介:
创建和维护一个可靠地生成和部署机器学习模型的平台需要仔细编排许多组件——一个基于训练数据生成模型的学习器,一个用于分析和验证数据和模型的模块,最后是用于在生产中服务模型的基础设施。当数据随时间变化并且需要不断生成新模型时,这变得尤为具有挑战性。不幸的是,这种编排通常是使用单个团队为特定用例开发的粘合代码和自定义脚本临时完成的,导致重复工作和技术债务高的脆弱系统。我们介绍了一个通用机器学习平台的解剖结构,以及谷歌的一个实现。通过将上述组件集成到一个平台中,我们能够标准化组件,简化平台配置,并将生产时间从几个月缩短到几周,同时提供平台稳定性,最大限度地减少服务中断。我们展示了在Google Play应用商店中部署该平台的一个案例研究,其中机器学习模型随着新数据的到来而不断刷新。部署该平台减少了自定义代码,加快了实验周期,并且由于改进了数据和模型分析,应用程序安装量增加了2%。
课程简介: Creating and maintaining a platform for reliably producing and deploying machine learning models requires careful orchestration of many components—-a learner for generating models based on training data, modules for analyzing and validating both data as well as models, and finally infrastructure for serving models in production. This becomes particularly challenging when data changes over time and fresh models need to be produced continuously. Unfortunately, such orchestration is often done ad hoc using glue code and custom scripts developed by individual teams for specific use cases, leading to duplicated effort and fragile systems with high technical debt. We present the anatomy of a general-purpose machine learning platform and one implementation of such a platform at Google. By integrating the aforementioned components into one platform, we were able to standardize the components, simplify the platform configuration, and reduce the time to production from the order of months to weeks, while providing platform stability that minimizes service disruptions. We present the case study of one deployment of the platform in the Google Play app store, where the machine learning models are refreshed continuously as new data arrive. Deploying the platform led to reduced custom code, faster experiment cycles, and a 2% increase in app installs resulting from improved data and model analysis.
关 键 词: 机器学习平台; 训练数据; 模型分析
课程来源: 视频讲座网
数据采集: 2024-12-25:liyq
最后编审: 2024-12-25:liyq
阅读次数: 13