首页信息处理技术
   首页声学
   首页音乐
0


什么/什么时候造成了单声排列和敲击回声的预期模型

What/When Causal Expectation Modelling in Monophonic Pitched and Percussive Audio
课程网址: http://videolectures.net/mbc07_hazan_wwc/  
主讲教师: Amaury Hazan
开课单位: 庞培法布拉大学
开课时间: 2008-02-01
课程语种: 英语
中文简介:
提出了一种表示音乐流并产生进一步预期事件的因果系统。从听觉前端开始,它提取低级(如光谱形状、mfcc、音高)和中层特征,如音调和节拍,一个无监督的聚类过程构建和维护一组符号,旨在使用音色和时间描述来表示音乐流事件。时间事件用与心跳相关的发作间隔来表示。然后,这些符号由基于预测部分匹配的期望模块处理,这是一种基于n-grams的多尺度技术。为了表征系统产生与转录匹配的期望的能力,我们使用加权平均f-度量,考虑到与音乐序列的无监督编码相关的不确定性。在处理包含鼓圈或单声道唱歌声音的音频流的情况下,演示了该系统的潜力。在初步实验中,我们证明了诱导表示对于以因果方式生成期望模式是有用的。在暴露过程中,我们观察到一个整体下降的预测熵与特定结构的变化相结合。
课程简介: A causal system for representing a musical stream and generating further expected events is presented. Starting from an auditory front-end which extracts low-level (e.g. spectral shape, MFCC, pitch) and mid-level features such as onsets and beats, an unsupervised clustering process builds and maintains a set of symbols aimed at representing musical stream events using both timbre and time descriptions. The time events are represented using inter-onset intervals relative to the beats. These symbols are then processed by an expectation module based on Predictive Partial Match, a multiscale technique based on N-grams. To characterise the system capacity to generate an expectation that matches its transcription, we use a weighted average F-measure, that takes into account the uncertainty associated with the unsupervised encoding of the musical sequence. The potential of the system is demonstrated in the case of processing audio streams which contain drum loops or monophonic singing voice. In preliminary experiments, we show that the induced representation is useful for generating expectation patterns in a causal way. During exposure, we observe a globally decreasing prediction entropy combined with structure-specific variations.
关 键 词: 音乐流; 预期事件; 聚类过程; 音乐序列编码; 单声道声音
课程来源: 视频讲座网
最后编审: 2020-05-29:吴雨秋(课程编辑志愿者)
阅读次数: 44