0


具有先验信息的聚类

Clustering with Prior Information
课程网址: http://videolectures.net/nipsworkshops09_versteeg_cwp/  
主讲教师: Greg Ver Steeg
开课单位: 加州理工学院
开课时间: 2010-01-19
课程语种: 英语
中文简介:
总之,我们已经通过分析证明,任何小的(但是有限的)半监督量都会通过将检测阈值移动到其最低可能值来抑制种植二分模型的聚类可检测性的相变。对于簇内和簇之间的链接具有不同权重的图,我们发现半监督导致检测阈值取决于ρ。此外,如果J
课程简介: In summary, we have demonstrated analytically that any small (but finite) amount of semi– supervision suppresses the phase transition in cluster detectability for the planted–bisection model, by shifting the detection threshold to its lowest possible value. For graphs where the links within and across the clusters have different weights, we found that semi–supervision leads to a detection threshold that depends on ρ. Furthermore, if J < K, then for ρ → 0+, the detection threshold converges to a value lower (better) from the one obtained via balancing within–cluster and inter–cluster weights. This suggests that for weighted graphs a small [but generic] semi-supervising can be employed for defining the very clustering structure. This definition is non-trivial, since it performs better than the weight-balancing definition. Note also that for weighted graphs the very notion of the detection threshold is not clear a priori, in contrast to unweighted networks, where the only possible definition goes via the connectivity balance α = γ. To illustrate this unclarity, consider a node connected to one cluster via few heavy links, and to another cluster via many light links. To which cluster this node should belong in principle? Our (speculative) answer is that the proper cluster assignment in this case can be defined via semi-supervising.
关 键 词: 半监督量; 阈值; 二分模型
课程来源: 视频讲座网
最后编审: 2019-09-07:lxf
阅读次数: 64