首页 → 自然科学
首页 → 数学
首页 → 工程与技术科学
首页 → 数学
首页 → 工程与技术科学
AMI会议录音转录制度的发展The Development of the AMI System for the Transcription of Speech in Meetings |
|
| 课程网址: | http://videolectures.net/mlmi04uk_hain_dasts/ |
| 主讲教师: | Thomas Hain |
| 开课单位: | 谢菲尔德大学 |
| 开课时间: | 2007-02-25 |
| 课程语种: | 英语 |
| 中文简介: | 在会议式会议中收集的语音的自动处理引起了人们对该领域的几个大型项目的极大兴趣。本文描述了在AMI(增强型多方互动)项目的背景下开发会议的基线自动语音转录系统。我们提出了几种对处理这些数据很重要的技术,并以字误码率(WER)表示性能。转录此数据的一个重要方面是音频预处理方面的必要灵活性。真实世界的系统必须处理灵活的输入,例如通过在房间中使用麦克风阵列或随机放置的麦克风。描述了自动分割和麦克风阵列处理技术,并讨论了对WER的影响。本文介绍的系统及其组件产生了竞争性的表现,并为该领域的未来研究奠定了基础。 |
| 课程简介: | The automatic processing of speech collected in conference style meetings has attracted considerable interest with several large scale projects devoted to this area. This paper describes the development of a baseline automatic speech transcription system for meetings in the context of the AMI (Augmented Multiparty Interaction) project. We present several techniques important to processing of this data and show the performance in terms of word error rates (WERs). An important aspect of transcription of this data is the necessary flexibility in terms of audio pre-processing. Real world systems have to deal with flexible input, for example by using microphone arrays or randomly placed microphones in a room. Automatic segmentation and microphone array processing techniques are described and the effect on WERs is discussed. The system and its components presented in this paper yield compettive performance and form a baseline for future research in this domain. |
| 关 键 词: | 自动语音转录系统; 数据; 灵活性 |
| 课程来源: | 视频讲座网 |
| 最后编审: | 2019-06-30:yuh |
| 阅读次数: | 113 |
