加权k出现对中心度感知分类方法的影响The influence of weighting the k-occurrences on hubness-aware classification methods |
|
课程网址: | http://videolectures.net/sikdd2011_tomasev_hubness/ |
主讲教师: | Nenad Tomašev |
开课单位: | 约瑟夫·斯特凡学院 |
开课时间: | 2011-11-04 |
课程语种: | 英语 |
中文简介: | 中心度是许多高维数据集中存在的现象。它与k个出现的分布(即其他数据点的k个邻居集中的数据点的出现)的偏斜度有关。最近已经提出了几种集中精力利用这种现象的中心意识方法。在本文中,我们通过考虑各个数据点之间的距离,检验了对k个事件进行加权的潜在影响,这对感知中心性的最近邻居方法(更具体地说是hw kNN,h FNN和HIKNN)具有重要意义。我们证明了这种基于距离的加权既有利又有害,并且以不同的方式影响不同的方法。 |
课程简介: | Hubness is a phenomenon present in many highdimensional data sets. It is related to the skewness in the distribution of k-occurrences, i.e. occurrences of data points in k-neighbor sets of other data points. Several hubnessaware methods that focus on exploiting this phenomenon have recently been proposed. In this paper, we examine the potential impact of weighting the k-occurrences, by taking into account the distance between the respective data points, on hubness-aware nearest-neighbor methods, more specifically hw-kNN, h-FNN and HIKNN. We show that such distance-based weighting can be both advantageous and detrimental and that it influences different methods in different ways. |
关 键 词: | 中心度; 高维数据集中存在; 中心意识方法 |
课程来源: | 视频讲座网 |
最后编审: | 2020-06-03:liush |
阅读次数: | 46 |