0


用于音频场景分析的稳定自回归模型的学习字典

Learning Dictionaries of Stable Autoregressive Models for Audio Scene Analysis
课程网址: http://videolectures.net/icml09_cho_ldsamasa/  
主讲教师: Youngmin Cho
开课单位: 加州大学圣地亚哥分校
开课时间: 2009-08-26
课程语种: 英语
中文简介:
本文探讨了基追踪在声场分析中的应用。我们的工作目标是检测混合音频信号中何时存在某些声音。我们关注的是这样一种情况:在大量可能的信号源中,一个较小但未知的数字结合并重叠以产生观测到的信号。为了推断哪些声音存在,我们将观察到的信号分解为少量有源源的线性组合。我们将推理转化为线性回归的正则化形式,其稀疏解产生的分解具有很少的活动源。我们用时域波形的自回归模型来描述单个声源的声学变异性。当我们没有关于个别来源的先验知识时,这些自回归模型的系数必须从音频例子中学习。我们分析了这些模型的动态稳定性,并展示了如何用简单的凸优化代替一个不确定的特征值问题来估计稳定模型。我们通过学习音符词典和使用这些词典来分析钢琴、大提琴和小提琴的复调录音来演示我们的方法。
课程简介: In this paper, we explore an application of basis pursuit to audio scene analysis. The goal of our work is to detect when certain sounds are present in a mixed audio signal. We focus on the regime where out of a large number of possible sources, a small but unknown number combine and overlap to yield the observed signal. To infer which sounds are present, we decompose the observed signal as a linear combination of a small number of active sources. We cast the inference as a regularized form of linear regression whose sparse solutions yield decompositions with few active sources. We characterize the acoustic variability of individual sources by autoregressive models of their time domain waveforms. When we do not have prior knowledge of the individual sources, the coefficients of these autoregressive models must be learned from audio examples. We analyze the dynamical stability of these models and show how to estimate stable models by substituting a simple convex optimization for a difficult eigenvalue problem. We demonstrate our approach by learning dictionaries of musical notes and using these dictionaries to analyze polyphonic recordings of piano, cello, and violin.
关 键 词: 追求音频场景分析; 线性组合; 自回归模型
课程来源: 视频讲座网
最后编审: 2019-12-07:lxf
阅读次数: 46