电子学报 ›› 2016, Vol. 44 ›› Issue (6): 1300-1306.DOI: 10.3969/j.issn.0372-2112.2016.06.006

• 学术论文 • 上一篇    下一篇

考虑温度/功耗/热导之间相互作用的单循环迭代热分析算法

潘月斗1,2, 王嘉琪1,2, 唐亮3, 骆祖莹3   

  1. 1. 北京科技大学自动化学院, 北京 100083;
    2. 北京科技大学钢铁流程先进控制教育部重点实验室, 北京 100083;
    3. 北京师范大学信息科学与技术学院, 北京 100875
  • 收稿日期:2014-07-22 修回日期:2015-06-25 出版日期:2016-06-25
    • 作者简介:
    • 潘月斗 男,1966年出生,博士,副教授.主要从事交流电动机智能控制理论,研究及高速高精交流电动机驱动系统的计算机数字控制系统设计.E-mail:ydpan@ustb.edu.cn;王嘉琪 男,1989年出生,硕士研究生.主要从事电热综合分析、高性能并行计算方面的研究工作.E-mail:3706157@qq.com
    • 基金资助:
    • 国家自然科学基金 (No.51331002)

TPG_Sli:Single-Loop Iterative Thermal Analysis Algorithm Considering Interactions Among Temperature,Power and Heat Conductance

PAN Yue-dou1,2, WANG Jia-qi1,2, TANG Liang3, LUO Zu-ying3   

  1. 1. School of Automation, University of Science and Technology Beijing, Beijing 100083, China;
    2. Key Laboratory of Advanced Control of Iron and Steel Process (Ministry of Education), Beijing 100083, China;
    3. College of Information Science and Technology, Beijing Normal University, Beijing 100875, China
  • Received:2014-07-22 Revised:2015-06-25 Online:2016-06-25 Published:2016-06-25
    • Supported by:
    • National Natural Science Foundation of China (No.51331002)

摘要:

随着纳米工艺的不断改进,温度对漏电流功耗和热导的影响日益显著.考虑温度/功耗/热导相互作用的3D芯片热分析需要采用迭代方法对温度进行精确求解,即先用功耗密度向量和热导矩阵来求解温度向量,再用求解出来的温度向量来刷新功耗密度向量和热导矩阵.为了提高3D芯片热分析的效率,本文以一个设定温度值下的均匀热导矩阵作为预条件,先提出了一种双循环、内循环低迭代次数的高效求解算法TPG-FTCG.鉴于TPG-FTCG具有超快的内循环收敛速度,本文省去了TPG-FTCG算法的内循环部分,提出了一种单循环、低迭代次数的TPG求解算法TPG-Sli.基于GPU(Graphics Processing Unit)并行加速技术,本文编写并改进了TPG-Sli的GPU加速算法.实验数据表明:与采用经典高效的ICCG算法进行3D芯片热分析的TPG-ICCG算法相比,在足够小的误差范围内,TPG-Sli的GPU加速算法可以获得120倍的速度提升.

关键词: 算法, 热分析, 快速傅里叶变换, GPU并行

Abstract:

With the improvement of the nanometer technology, the influences among temperature, leakage power and heat conductance become increasingly significant and it should be taken into account in 3D chip comprehensive thermal analysis to solve the accurate temperature based on the iterative solution.The comprehensive thermal analysis method uses the nodal power density vector and the heat conductance matrix to solve the nodal temperature vector, and then, refreshes power density and heat conductance with the obtained nodal temperature.In order to improve the efficiency of 3D chip comprehensive thermal analysis, this work uses the heat conductance matrix as the precondition under a setting temperature.Then it proposes an efficient algorithm TPG-FTCG (CG with the Fast Transform-based Preconditioner) which has double-loop and lower inner-loop iterations.According to TPG-FTCG's fast inner-loop convergence rate, this work removes TPG-FTCG's inner-loop part then proposes a more efficient TPG solving algorithm TPG-Sli (Single-loop iterative), which only has single-loop iterative and fewer iterations.Based on the GPU parallel computing, this work compiles and refines TPG-Sli's GPU-parallel-computing algorithm.Experimental results demonstrate that:On the premise of precision losing, the TPG-Sli's GPU algorithm can achieve about 120X speedup compared with the TPG-ICCG algorithm, which uses the classical and efficient ICCG to deal with the 3D chip comprehensive thermal analysis.

Key words: algorithm, thermal analysis, Fast Fourier transform, GPU parallel computing

中图分类号: