0


用于多模式交互的动态贝叶斯网络

Dynamic Bayesian Networks for Multimodal Interaction
课程网址: http://videolectures.net/mlmi04uk_jebara_dbnmi/  
主讲教师: Tony Jebara
开课单位: 哥伦比亚大学
开课时间: 2007-02-25
课程语种: 英语
中文简介:
动态贝叶斯网络(DBN)提供了超越经典隐马尔可夫模型的自然升级路径,并且当时态数据包含更高阶结构,多模态或多人交互时变得特别相关。我们描述了动态贝叶斯网络的几个实例,这些实例可用于对单个,双人和多人活动中的音频,视频和触觉频道的时间现象进行建模。这些模型包括输入输出隐马尔可夫模型,开关卡尔曼滤波器,以及最常见的动态系统树(DST)。这些模型用于学习社交活动中的音频视频交互,多人游戏中的视频交互以及机器人腹腔镜检查中的触觉视频交互。使用广义期望最大化方法从无监督设置中的数据估计模型参数。随后,这些模型可以预测,综合和分类各种类型的丰富的多模式人类活动。示出了手势交互,音频视频对话,足球比赛和手术训练评估的实验。
课程简介: Dynamic Bayesian networks (DBNs) offer a natural upgrade path beyond classical hidden Markov models and become especially relevant when temporal data contains higher order structure, multiple modalities or multi-person interaction. We describe several instantiations of dynamic Bayesian networks that are useful for modeling temporal phenomena spanning audio, video and haptic channels in single, two-person and multi-person activity. These models include input-output hidden Markov models, switched Kalman filters and, most generally, dynamical systems trees (DSTs). These models are used to learn audio-video interaction in social activities, video interaction in multi-person game playing and haptic-video interaction in robotic laparoscopy. Model parameters are estimated from data in an unsupervised setting using generalized expectation maximization methods. Subsequently, these models can predict, synthesize and classify various types of rich multimodal human activity. Experiments in gesture interaction, audio-video conversation, football game playing and surgical drill evaluation are shown.
关 键 词: 动态贝叶斯网络; 隐马尔可夫模型; 音频视频交互
课程来源: 视频讲座网
最后编审: 2019-06-30:yuh
阅读次数: 57