0


在Facebook处理结构化和非结构化数据

Dealing with structured and unstructured data at Facebook
课程网址: http://videolectures.net/eswc2011_backstrom_facebook/  
主讲教师: Lars Backstrom, Marko Grobelnik
开课单位: 约瑟夫·斯特凡学院
开课时间: 2011-07-07
课程语种: 英语
中文简介:
Facebook在过去五年中经历了巨大的增长。在这里,我们将首先看一下伴随这种增长的一些基本统计数据和趋势。然后我们将深入探讨两个不同的主题。首先,我们将看一下在Facebook上使数据更加结构化的总体趋势。拥有更多结构化数据可以更轻松地管理,理解和利用它。我将简要讨论为实现Facebook每天进行的大规模数据分析而构建的工具(Hive)。在演讲的第二部分,我将深入探讨有助于Facebook发展的系统之一的细节:你可能知道的人。该系统在Facebook上生成大量的朋友关系,并且通过使用越来越复杂的机器学习技术,我们已经能够对系统自其最初发布以来使用的排名进行大幅改进。
课程简介: Facebook has undergone tremendous growth in the last five years. Here we will start by looking at some basic statistics and trends that have accompanied this growth. We'll then dive into two different topics. First, we will look at a general trend to make data more structured at Facebook. Having more structured data makes it easier to manage, understand, and leverage it. I will briefly discuss the tools (Hive) that have been built to enable the massive-scale data analysis that goes on at Facebook on a daily basis. In the second part of the talk, I will dive into the details of one of the systems that has contributed to the growth of Facebook: People You May Know. This system generates a significant number of the friend connections on Facebook, and by using increasingly sophisticated machine learning techniques, we have been able to make large improvements to the ranking used by the system since its original launch.
关 键 词: 基本统计数据; 总体趋势; 管理
课程来源: 视频讲座网
最后编审: 2020-07-25:csy
阅读次数: 77