0


在大型数据集中挖掘,索引和搜索图形

Mining, Indexing, and Searching Graphs in Large Data Sets
课程网址: http://videolectures.net/mlg07_han_miasg/  
主讲教师: Jiawei Han
开课单位: 伊利诺伊大学
开课时间: 2007-09-06
课程语种: 英语
中文简介:
最近对模式发现的研究已经从挖掘频繁项目集和序列发展到挖掘结构化模式,包括树,格子和图形。作为一般数据结构,图形可以模拟数据之间复杂的关系,广泛应用于Web,社交网络分析和生物信息学。然而,由于存在指数数量的频繁子图,因此在图数据库中挖掘和搜索大图是具有挑战性的。  在本次演讲中,我们介绍了我们最近在开发高效且可扩展的方法,以便在大型数据库中挖掘和搜索图表的过程。我们介绍了gSpan和CloseGraph,这是在图数据库中挖掘频繁图形模式的两种有效方法。然后我们引入基于约束的图挖掘方法。此外,我们引入了图索引方法,gIndex和图近似搜索方法grafil,它们都利用频繁图挖掘来构造紧凑但高效的图索引,并用这种索引结构进行相似搜索。这些方法不仅有助于在海量数据集中挖掘和查询图形模式,还可以在其他领域(包括DB / OS系统和软件工程)中获得广泛的应用。
课程简介: Recent research on pattern discovery has progressed from mining frequent itemsets and sequences to mining structured patterns including trees, lattices, and graphs. As a general data structure, graph can model complicated relations among data with wide applications in Web, social network analysis, and bioinformatics. However, mining and searching large graphs in graph databases is challenging due to the presence of an exponential number of frequent subgraphs. In this talk, we present our recent progress on developing efficient and scalable methods for mining and searching of graphs in large databases. We introduce gSpan and CloseGraph, two efficient methods for mining frequent graph patterns in graph databases. Then we introduce constraint-based graph mining methods. Further, we introduce a graph indexing method, gIndex, and a graph approximate searching method, grafil, both taking advantages of frequent graph mining to construct a compact but highly effective graph index and perform similarity search with such indexing structures. These methods not only facilitate mining and querying graph patterns in massive datasets but also claim broad applications in other fields, including DB/OS systems and software engineering.
关 键 词: 结构化模式; 频繁子图; 图挖掘方法
课程来源: 视频讲座网
最后编审: 2020-06-15:heyf
阅读次数: 39