0


欧洲媒体监视器系统

Europe Media Monitor (EMM) System
课程网址: http://videolectures.net/wapa2010_goot_emms/  
主讲教师: Erik van der Goot
开课单位: 欧盟委员会
开课时间: 2010-09-20
课程语种: 英语
中文简介:
欧洲媒体监控(EMM)是一个文本收集和分析引擎,它是许多欧洲媒体监控和其他信息分析应用程序(例如EMM、MediSys的基础,这些应用程序服务于欧盟政策,尤其是那些与危机管理有关的政策。EMM引擎由越来越多的文本分析和信息处理模块组成,这些模块目前执行以下任务:语言检测、已知实体提取、地理标记、情感分析、分类、重复检测、聚类、事件检测和索引。信息聚合模块的系统有很多展示分析结果每类别,每个国家,作为一个故事等等的嗯NewsBrief约100.000系统的收成和分析新闻文章每天40种语言从大约5500 RSS提要和HTML页面,在大约1000个不同的类别和分类这些定义的35000种不同的关键词和关键字的组合。该系统由欧洲委员会联合研究中心(JRC)开发和运行。该报告将重点介绍该系统的历史、开发和体系结构。
课程简介: The Europe Media Monitor (EMM) is the text gathering and analysis engine underlying a number of European media monitoring and other information analysis applications (e.g. EMM , MediSys that are serving EU policies especially those concerned with crisis management. The EMM engine consists of a growing number of text analysis and information processing modules currently performing the following tasks: language detection, known entity extraction, geo-tagging, sentiment analysis, categorization, duplicate detection, clustering, event detection and indexing. The system has a number of information aggregation modules to present the analysis results per category, per country, as a story etc. In the case of the EMM NewsBrief, the system harvests and analyses around 100.000 news articles per day in 40 languages from around 5500 RSS feeds and HTML pages, and categorizes these in approximately 1000 different categories defined by 35000 different keywords and keyword combinations. The system is developed and operated by the European Commission's Joint Research Centre (JRC). The presentation will focus on the history, development and architecture of the system.
关 键 词: 计算机科学; 文本挖掘; 文本收集
课程来源: 视频讲座网
最后编审: 2021-12-21:liyy
阅读次数: 86