基于生成表达式约束的开放数据知识解码Generative Expression Constrained Knowledge-based decoding for Open data |
|
课程网址: | https://videolectures.net/eswc2024_kruit_open_data/ |
主讲教师: | Benno Kruit |
开课单位: | 2024年上海世博会 |
开课时间: | 2024-06-18 |
课程语种: | 英语 |
中文简介: | 本文介绍了GECKO,一个用于荷兰统计局(中央统计局)数据的知识图问答(KGQA)系统。QA在生成相关答案以及防止幻觉方面带来了巨大的挑战。这是在语言模型中发现的一种现象,当仅使用这些模型进行事实QA时会产生问题。为了克服这些局限性,荷兰统计局使用公开可用的OData4数据创建了一个知识图,其中答案生成解码过程是基础的,确保了忠实的答案。在处理问题时,GECKO执行实体和模式检索,进行模式约束表达式解码,在需要时做出假设,并将生成的表达式作为OData4查询执行以检索信息。实现了一种使用编码器-解码器模型执行基于约束知识的表达式解码的新方法。对稀疏和密集实体检索方法进行了评估。虽然编码器-解码器模型没有达到生产就绪性能,但实验表明,使用稀疏实体检索器的基于规则的基线具有很好的结果。此外,定性用户测试的结果为阳性。因此,我们制定了部署建议,帮助指导荷兰统计局数据的用户更快地找到答案。 |
课程简介: | In this paper, we present GECKO, a knowledge graph question answering (KGQA) system for data from Statistics Netherlands (Centraal Bureau voor de Statistiek). QA poses great challenges in means of generating relevant answers, as well as preventing hallucinations. This is a phenomenon found in language models and creates issues when attempting factual QA with these models alone. To overcome these limitations, the Statistics Netherlands’ publicly available OData4 data was used to create a knowledge graph, in which the answer generation decoding process is grounded, ensuring faithful answers. When processing a question, GECKO performs entity and schema retrieval, does schema-constrained expression decoding, makes assumptions where needed and executes the generated expression as an OData4 query to retrieve information. A novel method was implemented to perform the constrained knowledge-based expression decoding using an encoder-decoder model. Both a sparse and dense entity retrieval method were evaluated. While the encoder-decoder model did not achieve production-ready performance, experiments show promising results for a rule-based baseline using a sparse entity retriever. Additionally, the results of qualitative user testing were positive. We therefore formulate recommendations for deployment help guide users of Statistics Netherlands data to their answers more quickly. |
关 键 词: | 生成表达式; 开放数据; 知识解码 |
课程来源: | 视频讲座网 |
数据采集: | 2024-08-08:liyq |
最后编审: | 2024-08-08:liyq |
阅读次数: | 37 |