用于语音工具和服务的波兰语阅读语音语料库Polish Read Speech Corpus for Speech Tools and Services |
|
课程网址: | http://videolectures.net/clarinannualconference2016_korzinek_spee... |
主讲教师: | Danijel Koržinek |
开课单位: | 波兰-日本信息技术学院 |
开课时间: | 2017-08-21 |
课程语种: | 英语 |
中文简介: | 本文描述了CLARIN项目波兰财团开展的语音处理活动。该项目这一部分的目的是开发特定的工具,允许自动和半自动处理大量声学语音数据。这些工具包括以下几个方面:字形到音素转换、语音到文本对齐、语音活动检测、说话人日记、关键字识别和自动语音音译。此外,为了开发这些工具,在开放许可下录制并发布了一个大型高质量录音室语音语料库,以鼓励波兰语音研究领域的发展。所有工具和资源都发布在波兰CLARIN网站上。本文讨论了该项目目前的局限性和未来的计划。 |
课程简介: | This paper describes the speech processing activities conducted at the Polish consortium of the CLARIN project. The purpose of this segment of the project was to develop specific tools that would allow for automatic and semi-automatic processing of large quantities of acoustic speech data. The tools include the following: grapheme-to-phoneme conversion, speech-to-text alignment, voice activity detection, speaker diarization, keyword spotting and automatic speech transliteration. Furthermore, in order to develop these tools, a large high-quality studio speech corpus was recorded and released under an open license, to encourage development in the area of Polish speech research. All the tools and resources were released on the the Polish CLARIN website. This paper discusses the current limitations and future plans of the project. |
关 键 词: | 语音处理; 声学语音数据; 语音语料库 |
课程来源: | 视频讲座网 |
数据采集: | 2022-02-14:zkj |
最后编审: | 2022-02-14:zkj |
阅读次数: | 71 |