0


一种针对机器学习而基于现场可编程门阵列的映射归约框架

FPGA-based MapReduce Framework for Machine Learning
课程网址: http://videolectures.net/nipsworkshops09_xu_fpga/  
主讲教师: Ningyi Xu
开课单位: 微软公司
开课时间: 2010-01-19
课程语种: 英语
中文简介:
机器学习算法在我们的日常生活中变得越来越重要。然而,在大规模数据集上的培训通常非常缓慢。FPGA是一个可重构的平台,可以实现高并行性和数据吞吐量。在加速基于FPGA的机器学习算法方面已经做了很多工作。在本文中,我们通过实现一个用于机器学习算法的片上MapReduce框架,将Google的MapReduce模型应用于FPGA。处理器调度程序实现了最大的计算资源利用率和负载平衡。针对许多机器学习算法的特点,设计了一种通用的数据访问方案,使大规模数据集的数据吞吐量最大化。这个框架将任务控制、同步和通信隐藏在设计人员之外,以缩短开发周期。在RankBoost加速的案例研究中,与基于CPU的设计相比,实现了高达31.8倍的加速,这与完全手动设计的版本相当。我们还讨论了其他两种机器学习算法(SVM和PageRank)的实现,以演示框架的功能。
课程简介: Machine learning algorithms are becoming increasingly important in our daily life. However, training on very large scale datasets is usually very slow. FPGA is a reconfigurable platform that can achieve high parallelism and data throughput. Many works have been done on accelerating machine learning algorithms on FPGA. In this paper, we adapt Google's MapReduce model to FPGA by realizing an on-chip MapReduce framework for machine learning algorithms. A processor scheduler is implemented for the maximum computation resource utilization and load balancing. In accordance with the characteristics of many machine learning algorithms, a common data access scheme is carefully designed to maximize data throughput for large scale dataset. This framework hides the task control, synchronization and communication away from designers to shorten development cycles. In a case study of RankBoost acceleration, up to 31.8x speedup is achieved versus CPU-based design, which is comparable with a fully manually designed version. We also discuss the implementations of two other machine learning algorithms, SVM and PageRank, to demonstrate the capability of the framework.
关 键 词: 机器学习算法; 并行性; 数据吞吐量; 资源利用率; 处理器调度程序
课程来源: 视频讲座网
最后编审: 2020-06-15:wuyq
阅读次数: 32