0


噪声阀值的谱聚类算法

Noise Thresholds for Spectral Clustering
课程网址: http://videolectures.net/nips2011_balakrishnan_clustering/  
主讲教师: Sivaraman Balakrishnan
开课单位: 卡内基梅隆大学
开课时间: 2012-09-06
课程语种: 英语
中文简介:
虽然谱聚类在机器学习中取得了相当大的经验成功,但其理论性质尚未得到充分发展。分析了一种谱聚类算法的性能,表明在一类层次结构相似矩阵上,该算法能够承受随数据点数增长的噪声,同时还能很好地恢复高概率的层次聚类。此外,我们还对以前的k路谱聚类结果进行了改进,得出了谱聚类不出错的条件。此外,利用极大极小分析,我们得到了聚类问题的严格上下界,并将光谱聚类的性能与这些信息理论极限进行了比较。我们还提供了模拟和现实数据的实验,说明了我们的结果。
课程简介: Although spectral clustering has enjoyed considerable empirical success in machine learning, its theoretical properties are not yet fully developed. We analyze the performance of a spectral algorithm for hierarchical clustering and show that on a class of hierarchically structured similarity matrices, this algorithm can tolerate noise that grows with the number of data points while still perfectly recovering the hierarchical clusters with high probability. We additionally improve upon previous results for k-way spectral clustering to derive conditions under which spectral clustering makes no mistakes. Further, using minimax analysis, we derive tight upper and lower bounds for the clustering problem and compare the performance of spectral clustering to these information theoretic limits. We also present experiments on simulated and real world data illustrating our results.
关 键 词: 机器学习; 分层集群; 极大极小分析; k谱聚类
课程来源: 视频讲座网
最后编审: 2020-07-29:yumf
阅读次数: 90