0


用于挖掘社交网络的可扩展协同过滤算法

Scalable Collaborative Filtering Algorithms for Mining Social Networks
课程网址: http://videolectures.net/bsciw08_chang_scfamsn/  
主讲教师: Edward Chang
开课单位: 谷歌公司
开课时间: 2008-12-20
课程语种: 英语
中文简介:
Orkut、MySpace、Hi5和Facebook等社交网站每天吸引了数十亿的访问量,超过了网络搜索的页面访问量。这些社交网站为个人提供建立社区、上传和共享文档/照片/视频以及与其他用户交互的应用程序。以Orkut为例。Orkut拥有数百万个社区,每小时创建数百个社区,上传数万个博客/照片。为了帮助用户查找相关信息,必须提供有效的协作过滤工具来执行推荐,如好友、社区和广告匹配。在这篇演讲中,我将首先描述上述信息爆炸给传统协同过滤算法带来的计算和存储挑战。处理巨大的社会图,不断扩大,应该设计一个有效的算法1)上运行成千上万的并行机器共享存储和加速计算,2)执行增量再培训和更新为实现在线性能,和3)融合信息的多个源减轻信息稀疏。在演讲的第二部分,我将介绍我们最近开发的算法,包括并行谱聚类[1],并行PF-Growth[2],并行组合协同过滤[3],并行LDA,并行谱聚类,并行支持向量机[4]。
课程简介: Social networking sites such as Orkut, MySpace, Hi5, and Facebook attract billions of visits a day, surpassing the page views of Web Search. These social networking sites provide applications for individuals to establish communities, to upload and share documents/photos/videos, and to interact with other users. Take Orkut as an example. Orkut hosts millions of communities, with hundreds of communities created and tens of thousands of blogs/photos uploaded each hour. To assist users to find relevant information, it is essential to provide effective collaborative filtering tools to perform recommendations such as friend, community, and ads matching. In this talk, I will first describe both computational and storage challenges to traditional collaborative filtering algorithms brought by aforementioned information explosion. To deal with huge social graphs that expand continuously, an effective algorithm should be designed to 1) run on thousands of parallel machines for sharing storage and speeding up computation, 2) perform incremental retraining and updates for attaining online performance, and 3) fuse information of multiple sources for alleviating information sparseness. In the second part of the talk, I will present algorithms we recently developed including parallel Spectral Clustering [1], parallel PF-Growth [2], parallel combinational collaborative filtering [3], parallel LDA, parallel spectral clustering, and parallel Support Vector Machines [4].
关 键 词: 挖掘社交网络; 可扩展性协同算法
课程来源: 视频讲座网
最后编审: 2019-10-28:lxf
阅读次数: 30