0


移位树:一种基于模型的时间序列分类方法

ShiftTree: an Interpretable Model-Based Approach for Time Series Classification
课程网址: http://videolectures.net/ecmlpkdd2011_hidasi_shifttree/  
主讲教师: Balázs Hidasi
开课单位: 布达佩斯理工大学
开课时间: 2011-11-30
课程语种: 英语
中文简介:
时间序列数据挖掘的高效算法具有利用时间序列属性的特殊时间结构的共同点。为了将时间维度的信息容纳到过程中,我们提出了一种基于实例级别游标的索引技术,该技术与决策树算法相结合。这有利于以下几个原因:(a)它对时间水平噪声(例如渲染,时移)不敏感,(b)其工作方法可以被解释,使得分类过程的解释更容易理解,并且(c) )它可以管理不同长度的时间序列。将实现的名为ShiftTree的算法与使用不同距离度量的众所周知的基于实例的时间序列分类器1NN进行比较,用于公共基准时间序列数据库的所有20个数据集和另外两个公共时间序列数据集。在这些基准数据集上,我们的实验表明,基于新模型的算法的平均精度略高于最有效的基于实例的方法,并且有多个数据集,其中基于模型的分类器超出了基于实例的方法的准确性。我们还通过对SIGKDD 2007时间序列分类挑战的20个数据集进行盲测,对我们的算法进行了评估。为了提高模型精度并避免模型过度拟合,我们也提供了森林方法。
课程简介: Efficient algorithms of time series data mining have the common denominator of utilizing the special time structure of the attributes of time series. To accommodate the information of time dimension into the process, we propose a novel instance-level cursor based indexing technique, which is combined with a decision tree algorithm. This is beneficial for several reasons: (a) it is insensitive to the time level noise (for example rendering, time shifting), (b) its working method can be interpreted, making the explanation of the classification process more understandable, and (c) it can manage time series of different length. The implemented algorithm named ShiftTree is compared to the well-known instance-based time series classifier 1-NN using different distance metrics, used over all 20 datasets of a public benchmark time series database and two more public time series datasets. On these benchmark datasets, our experiments show that the new model-based algorithm has an average accuracy slightly better than the most efficient instance-based methods, and there are multiple datasets where our model-based classifier exceeds the accuracy of instance-based methods. We also evaluated our algorithm via blind testing on the 20 datasets of the SIGKDD 2007 Time Series Classification Challenge. To improve the model accuracy and to avoid model overfitting, we provide forest methods as well.
关 键 词: 时间序列数据; 索引技术; 决策树算法
课程来源: 视频讲座网
最后编审: 2019-04-02:cwx
阅读次数: 72