0


图分类的半监督特征选择

Semi-supervised Feature Selection for Graph Classification
课程网址: http://videolectures.net/kdd2010_kong_ssfs/  
主讲教师: Xiangnan Kong
开课单位: 伍斯特理工学院
开课时间: 2010-10-01
课程语种: 英语
中文简介:
图分类问题在过去十年中引起了极大的兴趣。目前对图分类的研究假设存在大量标记的训练图。然而,在许多应用中,图形数据的标签非常昂贵或难以获得,而通常存在大量未标记的图形数据。在本文中,我们研究了用于图分类的半监督特征选择问题,并提出了一种新的解决方案,称为gSSC,以有效地搜索带有标记和未标记图的最佳子图特征。与给定假设特征集的向量空间中的现有特征选择方法不同,我们与子图特征挖掘过程一起以渐进方式对图形数据执行半监督特征选择。我们推导出一个名为gSemi的特征评估标准,用于基于标记图和未标记图来估计子图特征的有用性。然后,我们提出了一种分支定界算法,通过明智地修剪子图搜索空间来有效地搜索最佳子图特征。对几个现实世界任务的实证研究表明,我们的半监督特征选择方法可以通过半监督特征选择有效地提高图分类性能,并且通过使用标记和未标记图修剪子图搜索空间非常有效。
课程简介: The problem of graph classification has attracted great interest in the last decade. Current research on graph classification assumes the existence of large amounts of labeled training graphs. However, in many applications, the labels of graph data are very expensive or difficult to obtain, while there are often copious amounts of unlabeled graph data available. In this paper, we study the problem of semi-supervised feature selection for graph classification and propose a novel solution, called gSSC, to efficiently search for optimal subgraph features with labeled and unlabeled graphs. Different from existing feature selection methods in vector spaces which assume the feature set is given, we perform semi-supervised feature selection for graph data in a progressive way together with the subgraph feature mining process. We derive a feature evaluation criterion, named gSemi, to estimate the usefulness of subgraph features based upon both labeled and unlabeled graphs. Then we propose a branch-and-bound algorithm to efficiently search for optimal subgraph features by judiciously pruning the subgraph search space. Empirical studies on several real-world tasks demonstrate that our semi-supervised feature selection approach can effectively boost graph classification performances with semi-supervised feature selection and is very efficient by pruning the subgraph search space using both labeled and unlabeled graphs.
关 键 词: 图分类; 半监督特征选择; 分支定界算法
课程来源: 视频讲座网
最后编审: 2019-05-11:lxf
阅读次数: 104