0


网络上多冲突信息提供者的真相发现

Truth Discovery with Multiple Conflicting Information Providers on the Web
课程网址: http://videolectures.net/kdd07_yin_tdwmci/  
主讲教师: Xiaoxin Yin
开课单位: 微软研究院
开课时间: 2007-08-14
课程语种: 英语
中文简介:
世界范围的网络已经成为我们大多数人最重要的信息源。不幸的是,没有人能保证网上信息的正确性。此外,不同的网站在一个主题上经常提供相互矛盾的信息,例如同一产品的不同规格。在本文中,我们提出了一个新的问题,即真实性问题,即如何从各种网站提供的关于许多主题的大量相互矛盾的信息中找到真实的事实。我们为Veracity问题设计了一个通用框架,并发明了一种叫做TruthFinder的算法,该算法利用了网站与其信息之间的关系,即如果一个网站提供了许多条真实信息,那么它就是值得信赖的,而如果一条信息由许多条值得信赖的网站提供,那么它就可能是真实的。我们的实验表明,TruthFinder成功地在相互矛盾的信息中找到了真实的事实,并比流行的搜索引擎更好地识别出值得信赖的网站。
课程简介: The world-wide web has become the most important information source for most of us. Unfortunately, there is no guarantee for the correctness of information on the web. Moreover, different web sites often provide conflicting information on a subject, such as different specifications for the same product. In this paper we propose a new problem called Veracity, i.e., conformity to truth, which studies how to find true facts from a large amount of conflicting information on many subjects that is provided by various web sites. We design a general framework for the Veracity problem, and invent an algorithm called TruthFinder, which utilizes the relationships between web sites and their information, i.e., a web site is trustworthy if it provides many pieces of true information, and a piece of information is likely to be true if it is provided by many trustworthy web sites. Our experiments show that TruthFinder successfully finds true facts among conflicting information, and identifies trustworthy web sites better than the popular search engines.
关 键 词: 网络信息; 真实信息; 正确信息
课程来源: 视频讲座网
数据采集: 2023-04-03:chenxin01
最后编审: 2023-05-22:chenxin01
阅读次数: 21