0


评估Shapelet变换的改进

Evaluating Improvements to the Shapelet Transform
课程网址: https://videolectures.net/videos/kdd2016_bostrom_shapelet_transfo...  
主讲教师: Aaron Bostrom
开课单位: KDD 2016研讨会
开课时间: 2025-02-04
课程语种: 英语
中文简介:
Shapelet树算法于2009年提出,是一种寻找相位无关序列的新方法,可用于时间序列分类。形状集发现算法是O(n2m4),其中n是案例的数量,m是序列的长度。有几种方法试图提高找到形状的速度。ShapeletTransform将查找减少到一次,FastShapelets通过PAA和SAX平滑并减少序列长度。然而,这两种技术都无法枚举UCR存储库中最大数据集上的所有形状。我们首先评估FastShapelet算法作为一种变换是否更好,其次为shapelet变换提供一个契约分类器,通过计算基本操作的数量,我们可以估计算法的运行时间,并对数据进行采样以满足这一契约。我们发现,虽然FastShapeletTransform确实大大减少了查找形状集的操作次数,但它并不比FastShapelets好多少,也不能与ShapeletTransform竞争。采样数据的工厂方法与ShapeletTransform相比具有竞争力,在某些情况下,尽管速度快得多,但我们看到了微小的改进。
课程简介: The Shapelet tree algorithm was proposed in 2009 as a novel way to find phase independent subsequences which could be used for time series classification. The shapelet discovery algorithm is O(n2m4), where n is the number of cases, and m is the length of the series. Several methods have sought to increase the speed of finding shapelets. The ShapeletTransform reduces the finding to a single pass, and FastShapelets smooths and reduces the series lengths through PAA and SAX. However neither of these techniques can enumerate all shapelets on the largest of the datasets present in the UCR repository. We first evaluate whether the FastShapelet algorithm is better as a transform, and secondly provide a contract classifier for the shapelet transform, by calculating the number of fundamental operations we can estimate the run time of the algorithm, and sample the data to fulfil this contract. We found that whilst the FastShapeletTransform does drastically reduce the operation count of finding shapelets it is not significantly better than FastShapelets, nor can it compete with the ShapeletTransform. The factory method for sampling the data is competitive with the ShapeletTransform and in some cases we see minor improvements despite being much faster.
关 键 词: 时间序列; 契约分类器; 采样数据
课程来源: 视频讲座网
数据采集: 2025-02-21:liyq
最后编审: 2025-02-27:liyq
阅读次数: 4