0


关于链接开放数据云的显式和隐式模式信息的系统研究

A Systematic Investigation of Explicit and Implicit Schema Information on the Linked Open Data Cloud
课程网址: http://videolectures.net/eswc2013_gottron_cloud/  
主讲教师: Olaf Hartig, Thomas Gottron
开课单位: 科布伦茨 - 兰道大学
开课时间: 2013-07-08
课程语种: 英语
中文简介:
有关链接开放数据(LOD)云中资源的模式信息可以通过双重方式提供:可以通过将RDF类型附加到资源来明确定义。或者通过资源属性的定义隐式提供。在本文中,我们提出了一种方法和指标来分析信息理论属性和模式信息的两种表现形式之间的相互关系。此外,我们实际上对大规模链接数据集执行此类分析。为此,我们提取了有关为Billion Triples Challenge 2012提供的数据集段中定义的类型和属性的架构信息。我们进行了深入分析,并计算了各种熵度量以及编码的互信息。两种类型的架构信息。我们的分析提供了对不同模式特征中编码的信息的深入了解。两个主要发现是隐式模式信息更具有判别性,并且涉及基于类型或属性的模式信息的应用程序仅捕获数据中包含的模式信息的63.5%和88.1%之间。基于这些观察,我们得出了关于LOD未来模式设计以及潜在应用场景的结论。
课程简介: Schema information about resources in the Linked Open Data (LOD) cloud can be provided in a twofold way: it can be explicitly defined by attaching RDF types to the resources. Or it is provided implicitly via the definition of the resources’ properties. In this paper, we present a method and metrics to analyse the information theoretic properties and the correlation between the two manifestations of schema information. Furthermore, we actually perform such an analysis on large-scale linked data sets. To this end, we have extracted schema information regarding the types and properties defined in the data set segments provided for the Billion Triples Challenge 2012. We have conducted an in depth analysis and have computed various entropy measures as well as the mutual information encoded in the two types of schema information. Our analysis provides insights into the information encoded in the different schema characteristics. Two major findings are that implicit schema information is far more discriminative and that applications involving schema information based on either types or properties alone will only capture between 63.5% and 88.1% of the schema information contained in the data. Based on these observations, we derive conclusions about the design of future schemas for LOD as well as potential application scenarios.
关 键 词: 链接开放数据; 云中资源; 信息理论属性
课程来源: 视频讲座网
最后编审: 2019-04-14:lxf
阅读次数: 24