0


精确的最近邻搜索的随机分配树

Randomized partition trees for exact nearest neighbor search
课程网址: http://videolectures.net/colt2013_dasgupta_trees/  
主讲教师: Sanjoy Dasgupta
开课单位: 加州大学圣地亚哥分校
开课时间: 2013-08-09
课程语种: 英语
中文简介:
K-D树是最早提出的最近邻搜索空间数据结构之一。在高维空间中,它的有效性降低了,但一些具有随机化和重叠细胞的变体在实践中证明是成功的。我们分析了三个这样的方案。我们证明,对于任何数据集和任何查询点,它们找不到最近邻的概率与捕获点配置困难的简单潜在函数直接相关。然后,我们在两种感兴趣的情况下绑定这个潜在函数:第一,当数据来自双重度量时;第二,当数据是来自主题模型的文档时。
课程简介: The k-d tree was one of the first spatial data structures proposed for nearest neighbor search. Its efficacy is diminished in high-dimensional spaces, but several variants, with randomization and overlapping cells, have proved to be successful in practice. We analyze three such schemes. We show that the probability that they fail to find the nearest neighbor, for any data set and any query point, is directly related to a simple potential function that captures the difficulty of the point configuration. We then bound this potential function in two situations of interest: the first, when data come from a doubling measure, and the second, when the data are documents from a topic model.
关 键 词: 计算机科学; 机器学习; 最近邻搜索
课程来源: 视频讲座网
最后编审: 2019-12-17:lxf
阅读次数: 19