0


Im2Text:使用100万张带标题的照片描述图像

Im2Text: Describing Images Using 1 Million Captioned Photographs
课程网址: http://videolectures.net/nips2011_ordonez_captioned/  
主讲教师: Vicente Ordonez
开课单位: 石溪大学
开课时间: 2012-09-06
课程语种: 英语
中文简介:
我们使用大型字幕照片集开发和演示自动图像描述方法。一个贡献是我们的技术,用于自动收集执行大量Flickr查询的新数据集,然后将噪声结果过滤到具有相关视觉相关字幕的100万个图像。这样的集合允许我们使用相对简单的非参数方法来处理极其具有挑战性的描述生成问题,并产生令人惊讶的有效结果。我们还开发了包含许多现有技术的方法,但是相当嘈杂,估计图像内容以产生更令人满意的结果。最后,我们为图像字幕引入了一种新的客观性能度量。
课程简介: We develop and demonstrate automatic image description methods using a large captioned photo collection. One contribution is our technique for the automatic collection of this new dataset - performing a huge number of Flickr queries and then filtering the noisy results down to 1 million images with associated visually relevant captions. Such a collection allows us to approach the extremely challenging problem of description generation using relatively simple non-parametric methods and produces surprisingly effective results. We also develop methods incorporating many state of the art, but fairly noisy, estimates of image content to produce even more pleasing results. Finally we introduce a new objective performance measure for image captioning.
关 键 词: 字幕照片集; 非参数方法; 图像字幕
课程来源: 视频讲座网
最后编审: 2019-09-06:lxf
阅读次数: 98