0


图像与语言的顺序嵌入

Order-Embeddings of Images and Language
课程网址: http://videolectures.net/iclr2016_vendrov_order_embeddings/  
主讲教师: Ivan Vendrov
开课单位: 多伦多大学
开课时间: 2016-05-27
课程语种: 英语
中文简介:
多义修辞、文本蕴涵和图像字幕可以被视为单词、句子和图像上单一视觉语义层次结构的特殊情况。在本文中,我们提倡显式地建模该层次结构的偏序结构。为了实现这一目标,我们介绍了一种学习有序表示的通用方法,并展示了它如何应用于涉及图像和语言的各种任务。我们表明,与当前的超词预测和图像标题检索方法相比,所产生的表示法提高了性能。
课程简介: Hypernymy, textual entailment, and image captioning can be seen as special cases of a single visual-semantic hierarchy over words, sentences, and images. In this paper we advocate for explicitly modeling the partial order structure of this hierarchy. Towards this goal, we introduce a general method for learning ordered representations, and show how it can be applied to a variety of tasks involving images and language. We show that the resulting representations improve performance over current approaches for hypernym prediction and image-caption retrieval.
关 键 词: 语义层次; 图像语言; 顺序结构
课程来源: 视频讲座网
数据采集: 2023-04-16:chenxin01
最后编审: 2023-05-21:chenxin01
阅读次数: 26