0


用查询标记:如何以及为什么?

Tagging with Queries: How and Why?
课程网址: http://videolectures.net/wsdm09_antonellis_twq/  
主讲教师: Ioannis Antonellis; Jawed Karim; Hector Garcia-Molina
开课单位: 斯坦福大学
开课时间: 2009-03-12
课程语种: 英语
中文简介:
Web搜索查询捕获搜索引擎用户的信息需求。搜索引擎将这些查询存储在日志中并分析它们以指导搜索结果。在这项工作中,我们认为不仅搜索引擎可以从这些日志中存储的数据中获益,而且web用户也可以从中受益。我们首先展示如何使用web服务器访问日志中的http referer字段以分布式方式收集点击率日志。然后,我们进行一组实验,研究搜索引擎查询的信息价值,并将其视为“标签”;或“;labels"对于同时显示结果和用户实际单击的web页面。我们通过比较del.icio中的标记来询问这些查询标记为web页面提供了多少额外信息。美国书签站点和到页ext。我们发现查询标记可以为大部分Web提供很多(平均每个URL 250个标记)、新标记(平均每个URL 125个标记不在pagetext中)。
课程简介: Web search queries capture the information need of search engine users. Search engines store these queries in their logs and analyze them to guide their search results. In this work, we argue that not only a search engine can benefit from data stored in these logs, but also the web users. We first show how clickthrough logs can be collected in a distributed fashion using the http referer field in web server access logs. We then perform a set of experiments to study the information value of search engine queries when treated as "tags" or "labels" for the web pages that both appear as a result and the user actually clicks on. We ask how much extra information these query tags provide for web pages by comparing them to tags from the del.icio.us bookmarking site and to the pagetext. We find that query tags can provide substantially many (on average 250 tags per URL), new tags (on average 125 tags per URL are not present in the pagetext) for a large fraction of the Web.
关 键 词: 计算机科学; 语义网; 注释; 标签
课程来源: 视频讲座网
最后编审: 2019-10-24:lxf
阅读次数: 43