首页考古学
   首页历史学
   首页语言学
0


通过中心语言进行多语言文档检索

Multilingual Document Retrieval Through Hub Languages
课程网址: http://videolectures.net/is2012_skraba_hub_languages/  
主讲教师: Primož Škraba
开课单位: 约瑟夫·斯特凡学院
开课时间: 2012-11-16
课程语种: 英语
中文简介:
在本文中,我们扩展了以前关于跨多语言语料库的文档检索的工作。在此设置中,通常假设我们具有一定的对齐,基于此我们可以学习空间之间的映射。然而,在真正的多语言语料库中,我们通常不会在所有语言之间进行对齐。有一些中心语言与许多其他语言对齐。我们研究了利用这些对齐来学习可能具有很小或没有对齐的地图的有效性。我们测试了几种方法,并在维基百科数据集上研究了各种方法的性能。
课程简介: In this paper we extend previous work on document retrieval across multilingual corpora. In this setting it is often assumed that we have a certain alignment given based on which we can learn mapping between spaces. In true multilingual corpora however, we often do not have alignments between all languages. There are hub languages which have alignments with many other languages. We look at the effectiveness of leveraging these alignments to learn maps which may have small or no alignments given. We test several methods and investigate the performance of various approaches on theWikipedia dataset.
关 键 词: 语言语料库; 文档检索; 空间映射
课程来源: 视频讲座网
最后编审: 2020-07-29:yumf
阅读次数: 62