0


将IPTC新闻架构引入语义Web

Bringing the IPTC News Architecture into the Semantic Web
课程网址: http://videolectures.net/iswc08_troncy_biptc/  
主讲教师: Raphaël Troncy
开课单位: EURECOM
开课时间: 2008-11-24
课程语种: 英语
中文简介:
为了简化新闻交流,国际新闻通信委员会(IPTC)开发了NewsML架构(NAR),这是一种基于XML的模型,专门用于多种语言,如NewsML G2和EventsML G2。作为该体系结构的一部分,特定的受控词汇表(如IPTC新闻代码)与其他行业标准词汇表一起用于对新闻项目进行分类。虽然新闻仍然主要以文本为基础的故事形式出现,但这些故事通常以图形、图像和视频的形式出现。媒体特定的元数据格式,如EXIF、DIG35和XMP,用于描述媒体。在单个生产过程中使用不同的元数据格式会导致新闻生产链本身的互操作性问题。它还排除了与现有网络知识资源的链接,并阻碍了为搜索和浏览新闻内容构建统一的最终用户界面。为了允许这些不同的元数据标准在单个信息环境中互操作,我们为IPTC新闻架构设计了OWL本体,并与其他多媒体元数据标准相链接。我们将IPTC新闻代码转换为SKOS同义词库,并演示如何使用自然语言处理和多媒体分析丰富新闻元数据,并将其与语义Web上已正式形成的现有知识集成。我们讨论了用于开发本体的方法,并为我们的设计决策提供了理论依据。我们提供了将模式重新设计为本体并形式化其隐含语义的指南。为了证明我们的本体基础设施的适当性,我们提供了一个用于搜索和浏览新闻项的探索性环境。
课程简介: For easing the exchange of news, the International Press Telecommunication Council (IPTC) has developed the NewsML Architecture (NAR), an XML-based model that is specialized into a number of languages such as NewsML G2 and EventsML G2. As part of this architecture, specific controlled vocabularies, such as the IPTC News Codes, are used to categorize news items together with other industry-standard thesauri. While news is still mainly in the form of text-based stories, these are often illustrated with graphics, images and videos. Media-specific metadata formats, such as EXIF, DIG35 and XMP, are used to describe the media. The use of different metadata formats in a single production process leads to interoperability problems within the news production chain itself. It also excludes linking to existing web knowledge resources and impedes the construction of uniform end-user interfaces for searching and browsing news content. In order to allow these different metadata standards to interoperate within a single information environment, we design an OWL ontology for the IPTC News Architecture, linked with other multimedia metadata standards. We convert the IPTC NewsCodes into a SKOS thesaurus and we demonstrate how the news metadata can then be enriched using natural language processing and multimedia analysis and integrated with existing knowledge already formalized on the Semantic Web. We discuss the method we used for developing the ontology and give rationale for our design decisions. We provide guidelines for re-engineering schemas into ontologies and formalize their implicit semantics. In order to demonstrate the appropriateness of our ontology infrastructure, we present an exploratory environment for searching and browsing news items.
关 键 词: 故事形式; 生产过程; 信息环境
课程来源: 视频讲座网
数据采集: 2023-03-15:chenjy
最后编审: 2023-03-15:chenjy
阅读次数: 26