

浏览全部资源
扫码关注微信
辽宁工程技术大学软件学院,辽宁,葫芦岛,125105
Published Online:25 April 2020,
Published:2020
移动端阅览
Performance Prediction Framework for CUDA Programs[J]. Acta Electronica Sinica, 2020, 48(4): 654-661.
Performance Prediction Framework for CUDA Programs[J]. Acta Electronica Sinica, 2020, 48(4): 654-661. DOI: 10.3969/j.issn.0372-2112.2020.04.006.
为对CUDA并行程序内核性能进行分析和预测,从而指导并行程序设计及性能优化,提出一种性能预测框架.1)从GPU编程模型和设备架构细节入手,以线程束为研究单位,通过整合与GPU程序用时密切相关的软硬件基本特征,定义了并行空间闲置度、流处理器线程束负载、并行效应因子等高层次性能相关特征.2)基于上述特征,框架针对线程负载均衡型GPU程序,评估内核函数在不同问题规模以及执行配置下的执行时间.3)依据性能评估原理提出了内核函数执行配置参数的优化策略.验证实验结果表明,该框架在两种典型情境下对现有程序性能的平均预测准确率分别达到89%和94%,客观归纳了高层次特征与程序性能间的相关关系,且能定性分析并行算法性能水平.
In order to analyze and predict the performance of CUDA program kernel and guide parallel program design and performance optimization
a performance prediction framework is proposed. This paper starts with the GPU programming model and hardware architecture details
with warp as the research unit. By integrating hardware and software factors closely related to GPU program time
high-level performance-related features such as device parallel space idle degree (DPSID)
number of streaming multiprocessor warp (NSMW) are defined. Based on the above features
a framework for evaluating the execution time of kernel functions under different problem sizes and execution configurations is built for thread load balancing GPU programs. The principle of optimizing configuration parameters of kernel function execution is put forward to guide optimizing program performance. The experimental results show that the average prediction accuracy of the framework is 89% and 94% in the two scenarios
respectively.
0
Views
100
下载量
0
CSCD
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802024621