0


为数据源分配语义标签

Assigning Semantic Labels to Data Sources
课程网址: http://videolectures.net/eswc2015_krishnamurthy_data_sources/  
主讲教师: Ramnandan Krishnamurthy
开课单位: 印度马德拉斯理工学院
开课时间: 2015-07-15
课程语种: 英语
中文简介:
对于能够发现和集成异构数据源有着巨大的需求,这需要将源的属性映射到域本体中定义的概念和关系。在本文中,我们提出了一种找到这些映射的新方法,我们称之为语义标记。先前的方法通常通过使用监督机器学习技术基于从数据中提取的特征来学习模型,从而单独映射每个数据值。我们的方法与现有方法的不同之处在于,我们对与语义标签相对应的数据值采取整体视图,并使用集体处理这些数据的技术,这使得可以从整体上捕捉与语义标签相关联的值的特征属性。我们的方法支持文本和数字数据,并提出了前k个语义标签及其相关的置信度分数。我们的实验表明,与现有系统相比,该方法具有更高的标签预测精度、更低的时间复杂度和更大的可扩展性。
课程简介: There is a huge demand to be able to find and integrate heterogeneous data sources, which requires mapping the attributes of a source to the concepts and relationships defined in a domain ontology. In this paper, we present a new approach to find these mappings, which we call semantic labeling. Previous approaches map each data value individually, typically by learning a model based on features extracted from the data using supervised machine-learning techniques. Our approach differs from existing approaches in that we take a holistic view of the data values corresponding to a semantic label and use techniques that treat this data collectively, which makes it possible to capture characteristic properties of the values associated with a semantic label as a whole. Our approach supports both textual and numeric data and proposes the top k semantic labels along with their associated confidence scores. Our experiments show that the approach has higher label prediction accuracy, has lower time complexity, and is more scalable than existing systems.
关 键 词: 集成异构; 学习模型; 语义标签
课程来源: 视频讲座网
数据采集: 2022-12-12:chenjy
最后编审: 2022-12-12:chenjy
阅读次数: 18