0


会议组行动分割与识别的多模式集成

Multimodal Integration for Meeting Group Action Segmentation and Recognition
课程网址: http://videolectures.net/mlmi04uk_hames_mimga/  
主讲教师: Marc Al Hames
开课单位: 慕尼黑工业大学
开课时间: 2007-02-25
课程语种: 英语
中文简介:
我们解决了在会议中分割和识别多模式人类交互序列的问题。这些交互可以被视为会议的粗略结构,并且可以用作会议浏览器的输入或者用作会议的更高语义分析的第一步。多模式小组会议行动的共同词典,共享会议数据集和共同评估程序使我们能够比较不同的方法。我们比较了三种不同的多模态特征集和四种建模基础结构:更高的语义特征方法,多层HMM,多流DBN,以及用于受干扰数据的多流混合状态DBN。
课程简介: We address the problem of segmentation and recognition of sequences of multimodal human interactions in meetings. These interactions can be seen asa rough structure of a meeting, and can be used either as input for a meeting browser or as a first step towards a higher semantic analysis of the meeting. A common lexicon of multimodal group meeting actions, a shared meeting data set, and a common evaluation procedure enable us to compare the different approaches. We compare three different multimodal feature sets and four modelling infrastructures: a higher semantic feature approach, multi-layer HMMs, a multistream DBN, as well as a multi-stream mixed-state DBN for disturbed data.
关 键 词: 多模式人类交互序列; 粗略结构; 建模基础结构
课程来源: 视频讲座网
最后编审: 2019-06-30:yuh
阅读次数: 27