0


NLP与Web标注的语言学关联数据

Linked Data in Linguistics for NLP and Web Annotation
课程网址: http://videolectures.net/w3cworkshop2012_hellmann_data/  
主讲教师: Sebastian Hellmann
开课单位: 莱比锡大学
开课时间: 2012-07-22
课程语种: 英语
中文简介:

此演示文稿介绍了三个主要的数据池,这些资源池最近已通过协作社区过程作为链接数据免费提供:(1)DBpedia国际化委员会关注从特定语言的Wikipedia版本中提取RDF; (2)创建基于DBpedia的可配置提取器,并能够以可控的方式从Wiktionary的所有语言中提取信息; (3)开放语言数据工作组,这是一个开放知识基金会小组,其目标是将开放语言数据集转换为RDF并将它们互连。该演讲强调并强调了开放许可证和RDF在维持此类池方面的作用。它还简要介绍了LOD2 EU项目在NIF(自然语言处理交换格式)方面的最新进展。 NIF 2.0将具有许多新功能,包括与上述数据池的互操作性以及主要的RDF词汇表,例如OLiA,Lemon和NERD。此外,NIF可以用作Web注释工具(例如AnnotateIt)的交换语言,因为它使用强大的链接数据感知标识符进行网站注释。

课程简介: This presentation introduces three major data pools that have recently been made freely available as Linked Data by a collaborative community process: (1) the DBpedia Internationalization committee is concerned with the extraction of RDF from the language-specific Wikipedia editions; (2) the creation of a configurable extractor based on DBpedia and able to extract information from all languages of Wiktionary with manageable effort; (3) the Working Group for Open Lingustic Data, an Open Knowledge Foundation group with the goal of converting Open Linguistics data sets to RDF and interlinking them. The presentation highlights and stresses the role of Open Licences and RDF for the sustenance of such pools. It also provides a short update on the recent progress of NIF (Natural Language Processing Interchange Format) by the LOD2-EU project. NIF 2.0 will have many new features, including interoperability with the above-mentioned data pools as well as major RDF vocabularies such as OLiA, Lemon, and NERD. Furthermore, NIF can be used as an exchange language for Web annotation tools such as AnnotateIt as it uses robust Linked Data aware identifiers for Website annotation.
关 键 词: 语言数据集; 语言交换
课程来源: 视频讲座网
数据采集: 2020-11-30:zyk
最后编审: 2020-11-30:zyk
阅读次数: 48