0


面向动态大规模视频搜索

Towards On-the-fly Large Scale Video Search
课程网址: http://videolectures.net/bmvc2013_zisserman_video_search/  
主讲教师: Andrew Zisserman
开课单位: 牛津大学
开课时间: 2014-07-30
课程语种: 英语
中文简介:

我们希望能够在图像或视频数据集中找到任何内容。演讲将描述我们在视觉搜索中寻找大规模视频数据集中的人,特定对象和类别的过程。新颖之处在于,可以在运行时通过文本查询指定感兴趣的项目,然后使用从Google图像搜索中下载的图像即时学习该项目的判别式分类器。我们将比较该问题的最新编码方法,并讨论在实现这种实时系统的三个重要性能指标之间进行最佳权衡的选择:(i)准确性,(ii)内存占用量和(iii)速度。我们还将描述实现“全面召回”的步骤。 BBC广播的大规模视频数据集将进行演示。这是与Relja Arandjelovic,Ken Chatfield和Omkar Parkhi的共同合作。

课程简介: We would like to be able to find anything in an image or video dataset. The talk will describe our progress on visual search for finding people, specific objects and categories in large scale video datasets. The novelty is that the item of interest can be specified at run time by a text query, and a discriminative classifier for that item is then learnt on-the-fly using images downloaded from Google Image search. We will compare state of the art encoding methods for the problem, and discuss the choices in achieving the best trade-off between three important performance measures for a realtime system of this kind, namely: (i) accuracy, (ii) memory footprint, and (iii) speed. We will also describe steps to achieving `total recall'. There will be demonstrations on a large scale video dataset of BBC broadcasts. This is joint work with Relja Arandjelovic, Ken Chatfield and Omkar Parkhi.
关 键 词: 数据集中; 图像学习
课程来源: 视频讲座网
数据采集: 2020-11-15:zyk
最后编审: 2020-11-15:zyk
阅读次数: 25