LinkedCT中临床试验数据的自动处理Automatic Curation of Clinical Trials Data in LinkedCT |
|
课程网址: | http://videolectures.net/iswc2015_hassanzadeh_automatic_curation/ |
主讲教师: | Oktie Hassanzadeh |
开课单位: | IBM Thomas J.Watson研究中心 |
开课时间: | 2015-11-10 |
课程语种: | 英语 |
中文简介: | 链接临床试验(LinkedCT)项目始于2008年,目标是提供临床试验的链接数据源。数据来源来自ClinicalTrials.gov上发布的XML数据,ClinicalTrials是一个国际临床研究注册中心。自最初发布以来,LinkedCT项目经历了一些重大变化,以提高数据的质量和新鲜度。结果是一个高质量的临床研究链接数据源,每天更新,目前包含超过19.5万个试验、460万个实体和4200万个三元组。在本文中,我们详细描述了该系统,并简要概述了将原始XML数据管理为高质量链接数据所涉及的技术挑战。我们还提供了使用统计数据和外部开发的一些有趣的用例。我们分享了在当前系统的设计和实施中吸取的经验教训,以及我们对该项目未来计划的概述,其中包括使系统开源和使数据免费用于商业用途。 |
课程简介: | The Linked Clinical Trials (LinkedCT) project started back in 2008 with the goal of providing a Linked Data source of clinical trials. The source of the data is from the XML data published on ClinicalTrials.gov, which is an international registry of clinical studies. Since the initial release, the LinkedCT project has gone through some major changes to both improve the quality of the data and its freshness. The result is a high-quality Linked Data source of clinical studies that is updated daily, currently containing over 195,000 trials, 4.6 million entities, and 42 million triples. In this paper, we present a detailed description of the system along with a brief outline of technical challenges involved in curating the raw XML data into high-quality Linked Data. We also present usage statistics and a number of interesting use cases developed by external parties. We share the lessons learned in the design and implementation of the current system, along with an outline of our future plans for the project which include making the system open-source and making the data free for commercial use. |
关 键 词: | 链接临床试验; 技术挑战; 商业用途 |
课程来源: | 视频讲座网 |
数据采集: | 2023-03-06:chenjy |
最后编审: | 2023-03-06:chenjy |
阅读次数: | 31 |