0


环境法律文件的文件嵌入模型

Document Embedding Models on Environmental Legal Documents
课程网址: http://videolectures.net/sikdd2019_urbancic_environmental_legal_d...  
主讲教师: Živa Urbančič
开课单位: Jožef Stefan研究所
开课时间: 2019-11-14
课程语种: 英语
中文简介:
基于上下文在大型文档库中查找相似文档有很多实际应用,特别是在法律领域。在这篇文章中,我们的重点是收集在大约30万份文件的数据库中的与环境法有关的文件。我们分析了数据库中不同表示模型(称为文档嵌入)的性能,发现由于数据库的大小,评估结果很困难。本文提出的方法也适用于其他文本数据集。
课程简介: Finding similar documents in a big document corpus based on context has many practical applications especially in the legal sector. In this paper, our focus is on the documents related to environmental law which have been collected in a database of approximately 300k documents. We analyzed the performance of different representation models (called document embeddings) on our database and found that evaluating the results is difficult, due to the size of the database. The approaches presented can be applicable for other text datasets.
关 键 词: 环境法律文件; 数据挖掘; 文件嵌入模型; 人工智能
课程来源: 视频讲座网
数据采集: 2022-09-14:cyh
最后编审: 2022-09-19:cyh
阅读次数: 19