0


TELIX:一个基于RDF的语言注释模型

TELIX: An RDF-based Model for Linguistic Annotation
课程网址: http://videolectures.net/eswc2012_rubiera_azcona_telix/  
主讲教师: Emilio Rubiera Azcona
开课单位: CTIC基金会
开课时间: 2012-07-04
课程语种: 英语
中文简介:
本文提出将RDF框架应用于语言注释的表示。我们认为RDF是一个合适的数据模型,可以捕获同一文本段上的多个注释,并集成多个注释层。除了使用RDF的思想之外,本文的主要贡献是一个OWL本体,称为telix(文本编码和语言信息交换),它为注释内容建模。这个本体建立在skos~xl词汇表之上,这是一个W3C标准,用于将词汇实体表示为RDF图。我们扩展skos,以获取单词之间的词法关系(例如同义词),并支持词义消歧、形态特征和句法分析等。此外,还定义了特征结构到RDF图的正式映射,从而实现了语言实体的复杂组合。最后,本文还建议使用RDFA作为一种方便的语法,将源文本和语言注释结合在同一个文件中。
课程简介: This paper proposes to apply the RDF framework to the representation of linguistic annotations. We argue that RDF is a suitable data model to capture multiple annotations on the same text segment, and to integrate multiple layers of annotations. Besides the idea of using RDF for this purpose, the main contribution of the paper is an OWL ontology, called TELIX (Text Encoding and Linguistic Information eXchange), which models annotation content. This ontology builds on the SKOS~XL vocabulary, a W3C standard for lexical entities representation as RDF graphs. We extend SKOS in order to capture lexical relations between words (e.g., synonymy), as well as to support word sense disambiguation, morphological features and syntactic analysis, among others. Additionally, a formal mapping of feature structures to RDF graphs is defined, enabling complex composition of linguistic entities. Finally, the paper also suggests the use of RDFa as a convenient syntax that combines source texts and linguistic annotations in the same file.
关 键 词: RDF框架语言; 数据模型; 语义关系
课程来源: 视频讲座网
最后编审: 2019-11-28:lxf
阅读次数: 60