0


书籍自适应和书籍依赖模型加速早期音乐的数字化

Book-Adaptive and Book-Dependent Models to Accelerate Digitization of Early Music
课程网址: http://videolectures.net/mbc07_eck_bab/  
主讲教师: Douglas Eck
开课单位: 蒙特利尔大学
开课时间: 2008-02-01
课程语种: 英语
中文简介:
光学音乐识别(OMR)使早期音乐收藏能够大规模数字化。这种数字化项目的工作流程还包括扫描和预处理,但是用于纠正自动识别错误的专业人工成本主导了这两个步骤的成本。为了减少OMR过程中的识别错误数量,我们提出了一种创新的应用程序,用于隐藏马尔可夫模型(HMM)的最大后验(MAP)自适应,以构建书籍自适应模型,利用人类编辑生成的新学习数据工作,这是任何音乐数字化项目的一部分。我们还尝试使用生成的数据从头开始构建与书籍相关的模型,在有足够的校正数据可用后,这些模型有时优于书籍自适应模型。我们的实验表明,这些方法可以将人类编辑成本降低一半以上,并且它们特别适合于早期或降级文档等高度可变的来源。
课程简介: Optical music recognition (OMR) enables early music collections to be digitized on a large scale. The workflow for such digitisation projects also includes scanning and preprocessing, but the cost of expert human labour to correct automatic recognition errors dominates the cost of these other two steps. To reduce the number of recognition errors in the OMR process, we present an innovative application of maximum a posteriori (MAP) adaptation for hidden Markov models (HMMs) to build book-adaptive models, taking advantage of the new learning data generated from human editing work, which is part of any music digitization project. We also experimented with using the generated data to build book-dependent models from scratch, which sometimes outperform the book-adaptive models after enough corrected data is available. Our experiments show that these approaches can reduce human editing costs by more than half and that they are especially well suited to highly variable sources like early or degraded documents.
关 键 词: 光学音乐识别; 数字化; 马尔可夫模型
课程来源: 视频讲座网
最后编审: 2019-05-16:cjy
阅读次数: 36