显示、出席和讲述：具有视觉注意力的神经图像标题生成][Show, Attend and Tell: Neural Image Caption Generation with Visual Attention]_MOOC(慕课)境外开放课程

首页 → 计算机科学技术
首页 → 计算机应用

显示、出席和讲述：具有视觉注意力的神经图像标题生成 Show, Attend and Tell: Neural Image Caption Generation with Visual Attention


课程网址:	http://videolectures.net/icml2015_xu_visual_attention/
主讲教师:	Kelvin Xu
开课单位:	蒙特利尔大学
开课时间:	2015-12-05
课程语种:	英语
中文简介:	受机器翻译和对象检测领域最近工作的启发，我们介绍了一种基于注意力的模型，该模型可以自动学习描述图像的内容。我们描述了如何使用标准反向传播技术以确定性的方式训练该模型，并通过最大化变分下界来随机训练。我们还通过可视化展示了模型如何能够自动学习将目光固定在显著物体上，同时在输出序列中生成相应的单词。我们在三个基准数据集上以最先进的性能验证了注意力的使用：Flickr8k、Flickr30k和MS COCO。
课程简介:	Inspired by recent work in machine translation and object detection, we introduce an attention based model that automatically learns to describe the content of images. We describe how we can train this model in a deterministic manner using standard backpropagation techniques and stochastically by maximizing a variational lower bound. We also show through visualization how the model is able to automatically learn to fix its gaze on salient objects while generating the corresponding words in the output sequence. We validate the use of attention with state-of-the-art performance on three benchmark datasets: Flickr8k, Flickr30k and MS COCO.
关键词:	视觉注意; 神经图像; 标题生成
课程来源:	视频讲座网
数据采集:	2023-06-08：chenxin01
最后编审:	2023-06-08：chenxin01
阅读次数:	42

服务热线：0574-88229129
电子邮件：info_lib@nbt.edu.cn
信息服务：图书馆305室
系统研发：图书馆303室

图书馆学生服务群：437507696
图书馆教工服务群：1038697975
QQ在线咨询
2013-2025 © 浙大宁波理工学院图书馆