语义搜索入门2009Introduction to Semantic Search 2009 |
|
课程网址: | http://videolectures.net/semsearch09_mika_intro/ |
主讲教师: | Peter Mika |
开课单位: | 雅虎公司 |
开课时间: | 2009-05-27 |
课程语种: | 英语 |
中文简介: | 近年来,我们在网络和企业规模上都看到了对搜索技术的巨大兴趣和大量经济利用。然而,现有搜索设备中的用户查询和资源内容的表示几乎完全通过简单的基于语法的资源内容描述和信息需求来实现,例如在主要的以关键字为中心的范例中(即与包匹配的关键字查询) - 单词文件表示)。另一方面,语义技术领域的最新进展已经产生了工具和标准,其允许以高度的表达性以正式方式阐明领域知识。同时,语义存储库和推理引擎现在才发展到一种状态,在这种情况下,查询和处理这些知识可以扩展到真实的IR场景。与这些发展并行的是,在过去几年中,我们也看到了将IR的思想与RDF / OWL数据,民俗分类,微格式集合或语义标记的自然文本中的搜索问题相适应的重要结果的出现。这些场景的共同点是搜索不是关注文档集合,而是关注元数据(可能链接或嵌入文本信息)。元数据存储中的搜索和排名是研讨会解决的另一个关键主题。因此,语义技术现在处于为IR问题提供重要贡献的状态。在这种情况下,语义搜索系统出现了一些挑战。其中包括:*如何利用语义技术来捕获用户的信息需求? *如何在不强制用户能够处理难以查询的语法的情况下,将用户的信息需求转换为富有表现力的正式查询? *如何从文档(用户)中提取(获取)富有表现力的资源描述? *如何有效地存储和查询富有表现力的资源描述? *如何处理模糊的信息需求和不完整的资源描述? *如何评估语义搜索系统并与标准IR系统进行比较? |
课程简介: | In recent years we have witnessed tremendous interest and substantial economic exploitation of search technologies, both at web and enterprise scale. However, the representation of user queries and resource content in existing search appliances is still almost exclusively achieved by simple syntax‐based descriptions of the resource content and the information need such as in the predominant keyword-centric paradigm (i.e. keyword queries matched against bag‐of‐words document representation). On the other hand, recent advances in the field of semantic technologies have resulted in tools and standards that allow for the articulation of domain knowledge in a formal manner at a high level of expressivity. At the same time, semantic repositories and reasoning engines have only now advanced to a state where querying and processing of this knowledge can scale to realistic IR scenarios. In parallel to these developments, in the past years we have also seen the emergence of important results in adapting ideas from IR to the problem of search in RDF/OWL data, folksonomies, microformat collections or semantically tagged natural text. Common to these scenarios is that the search is focused not on a document collection, but on metadata (which may be possibly linked to or embedded in textual information). Search and ranking in metadata stores is another key topic addressed by the workshop. As such, semantic technologies are now in a state to provide significant contributions to IR problems. In this context, several challenges arise for Semantic Search systems. These include, among others: * How can semantic technologies be exploited to capture the information need of the user? * How can the information need of the user be translated to expressive formal queries without enforcing the user to be capable of handling the difficult query syntax? * How can expressive resource descriptions be extracted (acquired) from documents (users)? * How can expressive resource descriptions be stored and queried efficiently on a large scale? * How can vague information needs and incomplete resource descriptions be handled? * How can semantic search systems be evaluated and compared with standard IR systems? |
关 键 词: | 语义库; 搜索引擎; 语义技术 |
课程来源: | 视频讲座网 |
最后编审: | 2020-06-29:wuyq |
阅读次数: | 38 |