0


在新闻联播注释和查询报价大规模系统

A Large-Scale System for Annotating and Querying Quotations in News Feeds
课程网址: http://videolectures.net/www2010_liang_lss/  
主讲教师: Jisheng Liang
开课单位: Evri 公司
开课时间: 2010-05-17
课程语种: 英语
中文简介:
在本文中, 我们描述了一个系统, 它自动提取新闻源中的报价, 并允许有效地检索语义注释的报价。从最近的新闻源中提取的超过 1, 000万个报价的实时查询 api 是公开的。此外, 我们每天还增加大约6万条新的引文, 从大约5万条新闻文章或博客中提取。我们应用计算语言技术, 如共同参考分辨率, 实体识别和消歧, 以提高精度和召回报价检测。我们支持对引号中提到的发言者和实体进行分面搜索。
课程简介: In this paper, we describe a system that automatically extracts quotations from news feeds, and allows efficient retrieval of the semantically annotated quotes. APIs for real-time querying of over 10 million quotes extracted from recent news feeds are publicly available. In addition, each day we add around 60 thousand new quotes extracted from around 50 thousand news articles or blogs. We apply computational linguistic techniques such as co-reference resolution, entity recognition and disambiguation to improve both precision and recall of the quote detection. We support faceted search on both speakers and entities mentioned in the quotes.
关 键 词: 计算机科学; 语义搜索; 系统
课程来源: 视频讲座网
最后编审: 2020-06-12:yumf
阅读次数: 35