摘要 Amdahl's law is a simple and fundamental tool for understanding the evolution of performance as a function of parallelism. Following a recent trend on timing and power analysis of general purpose many-core chip using this law, we develop a novel PIP Panalytical model for evaluating the performance and power of hierarchical on-chip large-scale parallel architectures with the core number, super-node size, processing element number, and function unit number taken into consideration. We thereby investigate the influence of workload characteristics (Thread-level parallel TLP, Instruction-level parallel IL Pand Data-level parallel DLP) on resource allocation with the restriction of performance and power. The results provide some feasible options to design TOPS level DS Parchitecture as well as a theoretical basis for making the design more scalable.
Abstract:Amdahl's law is a simple and fundamental tool for understanding the evolution of performance as a function of parallelism. Following a recent trend on timing and power analysis of general purpose many-core chip using this law, we develop a novel PIP Panalytical model for evaluating the performance and power of hierarchical on-chip large-scale parallel architectures with the core number, super-node size, processing element number, and function unit number taken into consideration. We thereby investigate the influence of workload characteristics (Thread-level parallel TLP, Instruction-level parallel IL Pand Data-level parallel DLP) on resource allocation with the restriction of performance and power. The results provide some feasible options to design TOPS level DS Parchitecture as well as a theoretical basis for making the design more scalable.
This work is supported by the National Natural Science Foundation of China (No.61070036, No.61133007).
通讯作者:
YANG Hui, CHEN Shuming
E-mail: huihui19870124@126.com;smchen@nudt.edu.cn
作者简介: WU Tiebin was born in 1987. He received the B.E. and M.E. degrees from the School of Computer Science, National University of Defense Technology (NUDT), Changsha, China, in 2009 and 2011, respectively. Her research interests include modeling and simulation of DS Psystem and micro-architecture.
引用本文:
YANG Hui, CHEN Shuming, WU Tiebin. Parameterized Integrated Power and Performance (PIPP) Model for Ultra High-Performance of TOPS level DSP[J]. 电子学报, 2013, 22(4): 707-711.
YANG Hui, CHEN Shuming, WU Tiebin. Parameterized Integrated Power and Performance (PIPP) Model for Ultra High-Performance of TOPS level DSP. Chinese Journal of Electronics, 2013, 22(4): 707-711.
[1] Hao Xuefei, Chen Jie, Zhao Danfeng, Zhou Chaoxian, “Algorithms and architecture implentations of MIMO OFDM basebandr eceiver based on the SIMD DS Pcore”, Journal of Electronics,Vol.23, No.5, pp.763-768, 2006.[2] M.Woh, S. Seo, S. Mahlke, “AnySP: Anytime anywhere anyways ignal processing”, Proc. of ACM International Symposium on Computer Architecture, Austin, Texas, USA, pp.81-91, 2009.[3] H.Y. Dai, X.S. Wang, “A new polarimetric method by usings patial polarization characteristics of scanning antenna”, IEEET rans. on Antennas and Propag., Vol.60, No.3, pp.1653-1656,2 012.[4] Giuseppe Ascia, Vincenzo Catania, Maurizio, “EPIC-Explorer:A parameterized VLIW-based platform framework for designs pace exploration”, Proc. of 1st Workshop on Embedded Systemsf or Real-Time Multimedia (ESTIMedia), Newport Beach, CA, USA, pp.65-72, 2003.[5] D. Chandra, F. Guo, S. Kim, Y. Solihin, “Predicting interthreadc ache contention on a chip multi-processor architecture”,I nternational Symposium on High-performance Computer Architecture( HPCA), Washington DC, USA, pp.340-351, 2005.[6] M. Hill, M. Marty, “Amdahl's law in the multicore era”, IEEE Computer Society, Vol.41, No.7, pp.33-38, 2008.[7] D.H. Woo, H.S. Lee, “Extending Amdahl's law for energyefficientc omputing in the many-core era”, IEEE Computer Society,Vol.41, No.12, pp.24-31, 2008.[8] R. Ge, K.W. Cameron, “Power-Aware Speedup”, IEEE International Parallel and Distributed Processing Symposium( IPDPS), Long Beach, California, USA, Vol.41, No.7, pp.1-10,2 007.[9] R. Ge, X.Z. Feng, K.W. Cameron, “Modeling and evaluatinge nergy-performance efficiency of parallel processing on multicoreb ased power aware systems”, IEEE International Parallel and Distributed Processing Symposium (IPDPS), California,USA, Vol.30, No.7, pp.1-8, 2009.[10] G.M. Amdahl, “Validity of the single-processor approach toa chieving large-scale computing capabilities”, AFIPS Conference Proceedings, Vol.30, No.3, pp.483-485, 1967.[11] J.L. Hennessy, D.A. Patterson, Computer Architecture: AQ uantitative Approach, Fourth Edition, Publishing House ofE lectronics Industry, Beijing, China, pp.32-36, 2007. (in Chinese)