0


知识图识别

Knowledge Graph Identification
课程网址: http://videolectures.net/iswc2013_pujara_graph_identification/  
主讲教师: Jay Pujara
开课单位: 马里兰大学
开课时间: 2013-11-28
课程语种: 英语
中文简介:
大型信息处理系统能够提取大量相互关联的事实,但是不幸的是,将这些候选事实转化为有用的知识是一个巨大的挑战。在本文中,我们展示了如何将关于实体及其关系的不确定性提取转换为知识图。提取形成一个提取图,我们将参考删除噪声,推断缺失信息并确定哪些候选事实应包括在知识图中作为知识图识别的任务。为了执行此任务,我们必须对候选事实及其关联的提取置信度进行联合推理,确定共指实体,并纳入本体约束。我们提出的方法使用概率软逻辑(PSL),这是最近引入的概率建模框架,可以轻松扩展到数百万个事实。我们在源自MusicBrainz音乐社区的合成链接数据语料库和来自NELL项目的现实世界中的提取集(包含超过1M提取和70K本体论关系)上证明了我们方法的强大功能。我们证明,与现有方法相比,我们的方法能够以更低的运行时间实现改进的AUC和F1。
课程简介: Large-scale information processing systems are able to extract massive collections of interrelated facts, but unfortunately transforming these candidate facts into useful knowledge is a formidable challenge. In this paper, we show how uncertain extractions about entities and their relations can be transformed into a knowledge graph. The extractions form an extraction graph and we refer to the task of removing noise, inferring missing information, and determining which candidate facts should be included into a knowledge graph as knowledge graph identification. In order to perform this task, we must reason jointly about candidate facts and their associated extraction confidences, identify co-referent entities, and incorporate ontological constraints. Our proposed approach uses probabilistic soft logic (PSL), a recently introduced probabilistic modeling framework which easily scales to millions of facts. We demonstrate the power of our method on a synthetic Linked Data corpus derived from the MusicBrainz music community and a real-world set of extractions from the NELL project containing over 1M extractions and 70K ontological relations. We show that compared to existing methods, our approach is able to achieve improved AUC and F1 with significantly lower running time.
关 键 词: 概率建模; 图识别
课程来源: 视频讲座网
数据采集: 2020-11-18:zyk
最后编审: 2020-11-18:zyk
阅读次数: 49