
MalStone: Towards a Benchmark for Analytics on Large Data Clouds
课程网址: http://videolectures.net/kdd2010_grossman_mtba/  
主讲教师: Robert Grossman
开课单位: 美国开放数据组
开课时间: 2010-10-01
课程语种: 英语

开发适用于云计算平台的数据挖掘算法目前是研究的一个活跃领域,开发适用于数据挖掘的云计算平台也正在研究中。当前,最常见的云计算基准是Terasort(及相关)基准。尽管Terasort Benchmark非常有用,但它本身并不是为数据挖掘而设计的。在本文中,我们引入了一个称为MalStone的基准,该基准专门用于衡量云计算中间件的性能,该云计算中间件在构建数据挖掘模型时支持常见的数据密集型计算类型。我们还介绍了MalGen,它是一种可在可与MalStone一起使用的云上生成数据的实用程序。

课程简介: Developing data mining algorithms that are suitable for cloud computing platforms is currently an active area of research, as is developing cloud computing platforms appropriate for data mining. Currently, the most common benchmark for cloud computing is the Terasort (and related) benchmarks. Although the Terasort Benchmark is quite useful, it was not designed for data mining per se. In this paper, we introduce a benchmark called MalStone that is specifically designed to measure the performance of cloud computing middleware that supports the type of data intensive computing common when building data mining models. We also introduce MalGen, which is a utility for generating data on clouds that can be used with MalStone.
关 键 词: 云计算; 数据挖掘; 数据生成
课程来源: 视频讲座网
数据采集: 2021-03-07:zyk
最后编审: 2021-03-10:zyk
阅读次数: 76