0


图像和视频的标注在互联网时代

Image and Video Tagging in the Internet Era
课程网址: http://videolectures.net/s3mr2011_hua_tagging/  
主讲教师: Xian-Sheng Hua
开课单位: 微软公司
开课时间: 2011-07-18
课程语种: 英语
中文简介:
自动将视觉内容转换为文本描述长期以来一直是许多多媒体,计算机视觉和机器学习研究人员的梦想。最近几年进行了大量研究的图像和视频标记可以被视为实现这一宏伟目标的更现实的步骤。特别是,互联网上媒体数据和媒体用户的爆炸性增长,以及用户之间,数据之间以及用户与数据之间的联系,给我们带来了挑战和机遇。在本讲座中,我们将首先回顾过去十年中多媒体标签的演变,然后介绍最先进的基于学习的标记方法,然后总结互联网环境中的手动标记方案并呈现互联网规模数据 - 可扩展图像和视频标记的驱动方法。最后,我们将讨论模型,数据和用户在多媒体标签系统中的作用,并研究可持续的生态系统,以便在互联网环境中进行多媒体标记。我们还将讨论该领域有前景的研究和发展方向。
课程简介: Automatically converting visual content into textual description has long been a dream of a number of multimedia, computer vision and machine learning researchers. Image and video tagging, which has been studied heavily in recently years, can be regarded as a more realistic step to that ambitious goal. Especially, the explosion of media data and media users on the Internet, as well as the connections among users, among data and between users and data, bring us both challenges and opportunities. In this lecture, firstly we will review the evolution of multimedia tagging in the past decade, and then introduce state-of-the-art learning based tagging approaches, followed by summarizing manual tagging schemes on the Internet environment and presenting Internet-scale data-driven methods for scalable image and video tagging. Finally we will discuss the roles of models, data and users in multimedia tagging systems and study an sustainable ecosystem for multimedia tagging on the Internet environment. We will also discuss promising research and development directions in this area.
关 键 词: 视觉内容; 文本自动转换; 计算机视觉; 机器学习研究
课程来源: 视频讲座网
最后编审: 2021-06-27:zyk
阅读次数: 74