0


社交媒体中的IR(IRSM)

IR in Social Media (IRSM)
课程网址: http://videolectures.net/russir08_maykov_irsm/  
主讲教师: Alexey Maykov, Matthew Hurst
开课单位: 微软公司
开课时间: 2008-11-04
课程语种: 英语
中文简介:
我们将社交媒体定义为Web上的用户生成内容。 社交媒体包括但不限于:博客,usenet,论坛。 教程的第一部分非常技术性和实践性。 我们将通过博客,微博,usenet展示数据采集的具体细节。 我们将展示现有的数据集并展示如何使用它们。 在第二部分中,我们将讨论使用获得的数据的具体细节。 我们将介绍关键字提取和其他数据挖掘技术。 垃圾邮件已成为互联网用户的主要问题,涵盖网络搜索以及通信的大多数方面,包括电子邮件,即时消息,论坛。 最近博客的流行推动了博客垃圾邮件的激增,其中包括splog,垃圾评论,引用垃圾邮件和ping垃圾邮件等多种版本。 在本次演讲中,我们将讨论在博客媒体中与其他类型的垃圾邮件打击垃圾邮件的差异和共性。 该论述将得到基于实际数据的结果和示例的支持
课程简介: We define Social Media as a user-generated content on a Web. Social Media includes but not limited to: blogs, usenet, forums. The first part of a tutorial is pretty technical and hands-on. We will show specifics of a data acquisition from blogs, microblogs, usenet. We will present our existing data sets and show how to use them. In the second part we will talk about specifics of using obtained data. We will cover keyword extraction and other data mining techniques. Spam has become a major problem for Internet users and covers web search as well as most aspects of communication including email, IM, discussion forums. The recent popularity of blogging has spurned a surge in blog spam, with many flavors including splogs, comment spam, trackback spam and ping spam. In this talk we will discuss the differences and commonalities of combating spam in the blog medium vs. other types of spam. The exposition will be supported by results and examples based on real data.
关 键 词: 社交媒体; 数据挖掘技术; 垃圾邮件
课程来源: 视频讲座网
最后编审: 2020-04-13:chenxin
阅读次数: 54