0


动员元数据:用于东非语言资源开发的开放数据工具包 (ODK)

Mobilizing Metadata: Open Data Kit (ODK) for Language Resource Development in East Africa
课程网址: http://videolectures.net/rail2020_griscom_mobilizing_metadata/  
主讲教师: Richard T. Griscom
开课单位: 莱顿大学
开课时间: 2020-03-20
课程语种: 英语
中文简介:
语言领域工作者收集和归档元数据作为他们创建的语言资源 (LR) 的一部分,但他们通常在资源受限的环境中工作,这会阻止他们使用计算机进行数据输入。在这种情况下,语言学家必须完成耗时且容易出错的数字化任务,这些任务限制了他们产生的资源和元数据的数量和质量(Thieberger & Berez 2012;Margetts & Margetts 2012)。本文介绍了一种使用开放数据工具包 (ODK) 平台将语言元数据输入移动设备的方法,该平台是为移动数据收集而设计的一套开源工具。该方法被纳入坦桑尼亚的两个基于社区的语言文档项目,涉及 12 名研究人员同时在四个行政区域收集数据(Griscom & Harvey 2019)。通过识别项目特定的数据相关性和冗余,元数据输入系统内置了许多效率。这些包括使用封闭的词汇表、用于不同数据收集器类别的独特数据输入表格,以及用于输入参与者和资源元数据的单独表格。由此产生的系统作为通用双语英语-斯瓦希里语元数据输入工具的持续开发的基础,可供在东非工作的其他研究人员使用。
课程简介: Linguistic fieldworkers collect and archive metadata as part of the language resources (LRs) that they create, but they often work inresource-constrained environments that prevent them from using computers for data entry. In such situations, linguists must completetime-consuming and error-prone digitization tasks that limit the quantity and quality of the resources and metadata that they produce(Thieberger & Berez 2012; Margetts & Margetts 2012). This paper describes a method for entering linguistic metadata into mobiledevices using the Open Data Kit (ODK) platform, a suite of open source tools designed for mobile data collection. The method wasincorporated into two community-based language documentation projects in Tanzania, involving twelve researchers simultaneouslycollecting data in four administrative regions (Griscom & Harvey 2019). Through the identification of project-specific datadependencies and redundancies, a number of efficiencies were built into the metadata entry system. These include the use of closedvocabularies, unique data entry forms for distinct data collector categories, and separate forms for entering participant and resourcemetadata. The resulting system serves as the basis for the ongoing development of general purpose bilingual English-Swahili metadataentry tools, to be made available for use by other researchers working in East Africa.
关 键 词: 语言资源; 开放数据; 语元数据
课程来源: 视频讲座网
数据采集: 2022-03-30:hqh
最后编审: 2022-03-30:hqh
阅读次数: 45