首页情报学
   首页医药科学
0


结合信息检索和信息提取的医学情报

Combining Information Retrieval and Information Extraction for Medical Intelligence
课程网址: http://videolectures.net/mmdss07_yangarber_cir/  
主讲教师: Roman Yangarber
开课单位: 纽约大学
开课时间: 2007-12-03
课程语种: 英语
中文简介:
全球流行病和医疗监测是公共卫生机构的基本职能,其主要目的是保护公众免受重大健康威胁。为了有效地执行此功能,需要来自各种来源的及时且准确的医疗信息。在这项工作中,我们提出了一个系统,旨在通过分析网上提供的文本报告(主要是新闻形式)来监测疾病流行病。该系统基于两个主要组件 - 基于信息检索(IR)技术的MedISys和信息提取(IE)系统PULS。医疗信息系统MedISys是一种自动工具,可以从全球数以千计的互联网资源中收集32种语言的公共卫生报告,根据数百个类别对其进行分类,检测不同类别和语言的趋势,并通知用户.Medisys编译关于各种疾病,生物恐怖主义,毒素,细菌,出血热,病毒,药物,水污染,动物疾病,公共卫生组织等的最新报告的定量摘要.3系统根据约200种健康威胁对所有文件进行分类,使用预定义的加权布尔查询或警报。它使用统计程序来检测任何类别中文章数量的突然增加。 MedISys是ECMediaMonitor(EMM)产品系列的一部分[2],由EC的联合研究中心(JRC)开发,其中还包括NewsBrief,4个实时新闻聚合系统和NewsExplorer,5个新闻摘要和分析系统[1]。 MedISys已被证明是一种有用且有效的工具,每天吸引成千上万的用户。 IE技术是进一步增强MedISys功能的自然方向。其中一个原因是IE能够提供有关疾病特定事件的信息,而IR则返回完整匹配的文件(指示发出的警报)。另一个原因是IE可以提高精度,因为基于关键字的查询可能会触发偏离主题的文档但恰好提及不相关上下文中的警报,而IE中的模式匹配确保关键字仅出现在相关上下文中。
课程简介: Global epidemic and medical surveillance is an essential function of Public Health agencies, whose primary aim is to protect the public from major health threats. To perform this function effectively one requires timely and accurate medical information from a wide range of sources. In this work we present a system designed to monitor the disease epidemics by analyzing textual reports, mostly in the form of news, available on the Web. The system rests on two major components—MedISys, based on Information Retrieval (IR) technology, and PULS, an Information Extraction (IE) system. The Medical Information System, MedISys, is an automatic tool that gathers reports concerning Public Health from thousands of Internet sources world-wide in 32 languages, classifies them according to hundreds of categories, detects trends across categories and languages, and notifies users.MedISys compiles quantitative summaries of latest reports on a variety of diseases, bioterrorism, toxins, bacteria, hemorrhagic fevers, viruses, medicines, water contaminations, animal diseases, Public Health organisations, etc.3 The system categorises all documents according to about 200 classes of health threats, using pre-defined weighted boolean queries, or alerts. It uses statistical procedures to detect a sudden increase in the volume of articles in any of the classes. MedISys is part of the EuropeMediaMonitor (EMM) product family [2], developed at the EC’s Joint Research Centre (JRC), which also includes NewsBrief,4 a live news aggregation system, and NewsExplorer,5 a news summary and analysis system [1]. MedISys has already proved to be a useful and an effective tool, which attracts thousands of users daily. IE technology is a natural direction for further enhancing the functionality that MedISys offers. One reason for this is that IE is able to deliver information about specific incidents of the diseases, whereas IR returns entire matched documents (with an indication which alerts fired). Another reason is that IE could boost precision, since keyword-based queries may trigger on documents which are off-topic but happen to mention the alerts in unrelated contexts, while pattern matching in IE assures that the keywords appear in relevant contexts only.
关 键 词: 医疗信息系统; 公众健康; 全球流行病; 公共卫生机构
课程来源: 视频讲座网
最后编审: 2020-09-28:yumf
阅读次数: 67