0


关系数据库中的三重轻量级链接数据发布

Triplify - Light-weight Linked Data Publication from Relational Databases
课程网址: http://videolectures.net/www09_auer_tlwldp/  
主讲教师: David Aumueller; Sören Auer; Sebastian Hellmann; Sebastian Dietzold; Jens Lehmann
开课单位: 莱比锡大学
开课时间: 2009-05-20
课程语种: 英语
中文简介:
我们提出了Triplify—一种简单但有效的方法来发布来自关系数据库的链接数据。Triplify基于将HTTP-URI请求映射到关系数据库查询。Triplify将生成的关系转换为RDF语句,并在Web上以各种RDF序列化方式发布数据,特别是作为链接数据。开发Triplify的基本原理是,Web上的大部分信息已经以结构化的形式存储,通常作为关系数据库中包含的数据,但Web应用程序仅以HTML混合结构、布局和内容的形式发布这些数据。为了揭示当前Web背后的纯结构化信息,我们将Triplify实现为一个轻量级的软件组件,可以方便地与众多广泛安装的Web应用程序集成和部署。我们的方法包括一种发布更新日志的方法,以支持对链接数据源的增量爬行。Triplify由一个公共关系模式配置库和一个支持rest的数据源注册中心补充。Triplify配置包含许多流行Web应用程序的映射,包括Wordpress、Drupal、Joomla、Gallery和phpBB。我们表明,尽管Triplify的架构很轻,但它可以发布非常大的数据集,例如来自OpenStreetMap项目的160GB的geo数据。
课程简介: We present Triplify - a simplistic but effective approach to publish linked data from relational databases. Triplify is based on mapping HTTP-URI requests onto relational database queries. Triplify transforms the resulting relations into RDF statements and publishes the data on the Web in various RDF serializations, in particular as Linked Data. The rationale for developing Triplify is that the largest part of information on the Web is already stored in structured form, often as data contained in relational databases but published by Web applications merely as HTML mixing structure, layout and content. In order to reveal the pure structured information behind the current Web we implemented Triplify as a light-weight software component, which can be easily integrated and deployed with the numerous widely installed Web applications. Our approach includes a method for publishing update logs to enable incremental crawling of linked data sources. Triplify is complemented by a library of configurations for common relational schemata and a REST-enabled datasource registry. Triplify configurations are provided containing mappings for many popular Web applications, including Wordpress, Drupal, Joomla, Gallery, and phpBB. We show that despite its light-weight architecture Triplify is usable to publish very large datasets, such as 160GB of geo data from the OpenStreetMap project.
关 键 词: 数据库; 语义Web; RDF-资源描述框架; Web搜索; 计算机科学; 语义网
课程来源: 视频讲座网
最后编审: 2021-01-31:nkq
阅读次数: 42