0


网络时代的图像与视频标注

Image and Video Tagging in the Internet Era
课程网址: http://videolectures.net/s3mr2011_hua_tagging/  
主讲教师: Xian-Sheng Hua
开课单位: 微软公司
开课时间: 2011-07-18
课程语种: 英语
中文简介:

将视觉内容自动转换为文字描述一直是许多多媒体,计算机视觉和机器学习研究人员的梦想。近年来,对图像和视频标记进行了大量研究,可以认为它是实现这一宏伟目标的更现实的一步。特别是,互联网上媒体数据和媒体用户的爆炸式增长,以及用户之间,数据之间以及用户与数据之间的联系给我们带来了挑战和机遇。在本讲座中,我们将首先回顾过去十年中多媒体标记的发展,然后介绍基于学习的标记方法的最新发展,然后总结互联网环境中的手动标记方案,并介绍可缩放图像的互联网规模数据驱动方法和视频标记。最后,我们将讨论模型,数据和用户在多媒体标记系统中的作用,并研究一个可持续的生态系统,以在Internet环境中进行多媒体标记。我们还将讨论该领域中有希望的研发方向。

课程简介: Automatically converting visual content into textual description has long been a dream of a number of multimedia, computer vision and machine learning researchers. Image and video tagging, which has been studied heavily in recently years, can be regarded as a more realistic step to that ambitious goal. Especially, the explosion of media data and media users on the Internet, as well as the connections among users, among data and between users and data, bring us both challenges and opportunities. In this lecture, firstly we will review the evolution of multimedia tagging in the past decade, and then introduce state-of-the-art learning based tagging approaches, followed by summarizing manual tagging schemes on the Internet environment and presenting Internet-scale data-driven methods for scalable image and video tagging. Finally we will discuss the roles of models, data and users in multimedia tagging systems and study an sustainable ecosystem for multimedia tagging on the Internet environment. We will also discuss promising research and development directions in this area.
关 键 词: 机器学习; 计算机视觉
课程来源: 视频讲座网
数据采集: 2020-12-29:zyk
最后编审: 2021-01-08:yumf
阅读次数: 41