首页语言学
   首页外国语言
0


应对资源不足语言的多语言资源开发挑战:非洲 Wordnet 项目的案例

Navigating Challenges of Multilingual Resource Development for Under-Resourced Languages: The Case of the African Wordnet Project
课程网址: http://videolectures.net/rail2020_griesel_grieselof_multilingual/  
主讲教师: Marissa Griesel
开课单位: 南非大学
开课时间: 2020-03-20
课程语种: 英语
中文简介:
创建一个新的 wordnet 绝非易事,当目标语言资源不足时,就像目前包含在多语言非洲 Wordnet (AfWN) 中的语言一样,开发人员需要严重依赖人类的专业知识。在 AfWN 开发的不同阶段,我们采用了各种快速跟踪方法,以减轻繁琐且耗时的工作。一些方法已被证明是有效的,而另一些方法似乎对工作率几乎没有积极影响。与许多其他资源不足的语言一样,扩展模型是贯穿始终的,因此依赖于英语源数据,例如英语普林斯顿 Wordnet (PWN),然后假设新语言共享该语言,将其翻译成目标语言具有 PWN 的底层结构。本文讨论了在此过程中遇到的一些问题,并指出了(半)自动化质量保证措施和进一步完善 AfWN 以确保加速增长的各种可能性。在本文中,我们旨在强调从实践经验中汲取的一些经验教训,以促进类似项目,特别是其他非洲国家的语言项目。
课程简介: Creating a new wordnet is by no means a trivial task and when the target language is under-resourced as is the case for the languages currently included in the multilingual African Wordnet (AfWN), developers need to rely heavily on human expertise. During the different phases of development of the AfWN, we incorporated various methods of fast-tracking to ease the tedious and time-consuming work. Some methods have proven effective while others seem to have little positive impact on the work rate. As in the case of many other under-resourced languages, the expand model was implemented throughout, thus depending on English source data such as the English Princeton Wordnet (PWN) which is then translated into the target language with the assumption that the new language shares an underlying structure with the PWN. The paper discusses some problems encountered along the way and points out various possibilities of (semi) automated quality assurance measures and further refinement of the AfWN to ensure accelerated growth. In this paper we aim to highlight some of the lessons learnt from hands-on experience in order to facilitate similar projects, in particular for languages from other African countries.
关 键 词: 扩展模型; 多语言非洲; 语言翻译
课程来源: 视频讲座网
数据采集: 2022-03-30:hqh
最后编审: 2022-03-30:hqh
阅读次数: 29