0


在不同主题领域采用关联数据最佳实践

Adoption of Linked Data Best Practices in Different Topical Domains
课程网址: http://videolectures.net/iswc2014_bizer_topical_domains/  
主讲教师: Christian Bizer
开课单位: 柏林自由大学
开课时间: 2014-12-19
课程语种: 英语
中文简介:

链接数据的中心思想是数据发布者通过遵循链接,词汇用法和元数据提供方面的一组最佳实践来支持应用程序发现和集成数据。 2011年,LOD云状态报告分析了不同主题领域内链接数据集对这些最佳实践的采用情况。该报告基于数据集发布者自己通过datahub.io链接数据目录提供的信息。在本文中,我们基于2014年4月对链接数据网络的抓取,回顾并更新了2011年LOD云状态报告的结果。我们分析了采用不同最佳实践的方式,并对此进行了概述数据集之间的联系关系以更新的LOD云图的形式出现,这次不是基于数据集提供者提供的信息,而是基于链接数据搜寻器实际上可以检索到的数据。除其他外,我们发现在2011年至2014年之间,链接数据集的数量大约增加了一倍,在描述某些类型实体的通用词汇上达成了越来越多的共识,并且数据源仍然很少提供出处和许可证元数据。 / p>

课程简介: The central idea of Linked Data is that data publishers support applications in discovering and integrating data by complying to a set of best practices in the areas of linking, vocabulary usage, and metadata provision. In 2011, the State of the LOD Cloud report analyzed the adoption of these best practices by linked datasets within different topical domains. The report was based on information that was provided by the dataset publishers themselves via the datahub.io Linked Data catalog. In this paper, we revisit and update the findings of the 2011 State of the LOD Cloud report based on a crawl of the Web of Linked Data conducted in April 2014. We analyze how the adoption of the different best practices has changed and present an overview of the linkage relationships between datasets in the form of an updated LOD cloud diagram, this time not based on information from dataset providers, but on data that can actually be retrieved by a Linked Data crawler. Among others, we find that the number of linked datasets has approximately doubled between 2011 and 2014, that there is increased agreement on common vocabularies for describing certain types of entities, and that provenance and license metadata is still rarely provided by the data sources.
关 键 词: 链接数据; 数据集
课程来源: 视频讲座网
数据采集: 2020-11-11:zyk
最后编审: 2020-11-11:zyk
阅读次数: 24