NITE XML Toolkit符合ICSI会议语料库:导入,注释和浏览

The NITE XML Toolkit meets the ICSI Meeting Corpus: import, annotation, and browsing
主讲教师: Jean Carletta
开课单位: 爱丁堡大学
开课时间: 2007-02-25
课程语种: 英语
NITE XML Toolkit(NXT)为使用多模式语言语料库提供了库支持。我们通过将其应用于ICSI会议语料库来描述正在进行的工作,以探索其AMI项目的潜力。我们讨论将现有数据转换为NXT数据格式;使用NXT的查询工具来探索语料库;手注解和自动索引;以及通过应用NXT外部进程(如解析器)获得的数据集成。
课程简介: The NITE XML Toolkit (NXT) provides library support for working with multimodal language corpora. We describe work in progress to explore its potential for the AMI project by applying it to the ICSI Meeting Corpus. We discuss converting existing data into the NXT data format; using NXT’s query facility to explore the corpus; hand-annotation and automatic indexing; and the integration of data obtained by applying NXT-external processes such as parsers.
关 键 词: 多模式; 语言语料库; 数据集成
