儿童智能室中的自动语音识别和语音活动检测Automatic Speech Recognition and Speech Activity Detection in the CHIL Smart Room |
|
课程网址: | http://videolectures.net/mlmi04uk_chu_asrsa/ |
主讲教师: | Stephen M. Chu |
开课单位: | IBM公司 |
开课时间: | 2007-02-25 |
课程语种: | 英语 |
中文简介: | 将语音技术作为人机界面中的功能组件进行广泛部署的一个重要步骤是使用户免于通话或桌面麦克风,并在各种自然通信环境中实现远场操作。在这项工作中,我们考虑会议室中的远场自动语音识别和语音活动检测。实验在CHIL项目提供的智能房间平台上进行。 **论文的前半部分**讨论了研讨会转录任务的语音识别系统的开发。特别是,我们研究了在单通道和多通道设置中组合并行识别器的效果。  **在本文的下半部分**中,我们描述了一种基于融合语音似然得分和能量特征的语音活动检测算法。结果表明,所提出的技术能够处理非平稳噪声事件,并在CHIL研讨会语料库中实现良好的性能。 |
课程简介: | An important step to bring speech technologies into wide deployment as a functional component in man-machine interfaces is to free the users from close-talk or desktop microphones, and enable far-field operation in various natural communication environments. In this work, we consider far-field automatic speech recognition and speech activity detection in conference rooms. The experiments are conducted on the smart room platform provided by the CHIL project. **The first half** of the paper addresses the development of speech recognition systems for the seminar transcription task. In particular, we look into the effect of combining parallel recognizers in both single-channel and multi-channel settings. **In the second half** of the paper, we describe a novel algorithm for speech activity detection based on fusing phonetic likelihood scores and energy features. It is shown that the proposed technique is able to handle non-stationary noise events and achieves good performance on the CHIL seminar corpus. |
关 键 词: | 语音技术; 人机界面; 通信环境 |
课程来源: | 视频讲座网 |
最后编审: | 2020-04-30:chenxin |
阅读次数: | 83 |