0


Rosetta:大规模图像文本检测和识别系统

Rosetta: Large Scale System for Text Detection and Recognition in Images
课程网址: http://videolectures.net/kdd2018_borisyuk_rosetta/  
主讲教师: Fedor Borisyuk
开课单位: Facebook
开课时间: 2018-11-23
课程语种: 英语
中文简介:
在本文中,我们提出了一种部署的、可扩展的光学字符识别(OCR)系统,我们称之为Rosetta,旨在处理每天以Facebook规模上传的图像。共享图像内容已成为社交网络(如Facebook)内互联网用户之间交流信息的主要方式之一,对此类媒体(包括其文本信息)的理解对于促进搜索和推荐应用程序至关重要。我们提出了有效检测和识别图像中文本的建模技术,并描述了Rosetta的系统架构。我们对所展示的技术进行了广泛的评估,解释了大规模构建OCR系统的实用方法,并根据系统开发和部署过程中的经验教训,提供了有关某些组件为什么以及如何工作的深刻直觉。
课程简介: In this paper we present a deployed, scalable optical character recognition (OCR) system, which we call Rosetta , designed to process images uploaded daily at Facebook scale. Sharing of image content has become one of the primary ways to communicate information among internet users within social networks such as Facebook, and the understanding of such media, including its textual information, is of paramount importance to facilitate search and recommendation applications. We present modeling techniques for efficient detection and recognition of text in images and describe Rosetta ‘s system architecture. We perform extensive evaluation of presented technologies, explain useful practical approaches to build an OCR system at scale, and provide insightful intuitions as to why and how certain components work based on the lessons learnt during the development and deployment of the system.
关 键 词: 可扩展的光学字符识别; Rosetta; 文本的建模技术; 大规模构建OCR系统
课程来源: 视频讲座网
数据采集: 2023-01-29:cyh
最后编审: 2023-01-30:cyh
阅读次数: 37