摘要:Probabilistic transfer matrix (PTM) is a more accurate approach to estimate the effects of soft errors on gate level circuit reliability.However,the existed method can only be used for small-scale circuits so far.This paper presents a serial method of circuit reliability calculation based on PTM.The circuit under evaluation is partitioned into proper scale modules that their reliability can be calculated by the original PTM.Then the reliability of the whole circuit is calculated based on the serial reliability model and reliability values of all modules.The proper circuit partition parameter (i.e.partition width) is gotten by experiments with 74-Series circuits.The proposed new reliability calculation method is applied on ISCAS85 benchmark circuits.It shows that our method can be used for larger non-redundant combinational circuits.It is verified that the new method is consistent with the reliability calculation method recommended by MIL-HDBK-217 standard by using some ISCAS85 circuits.
关键词:high-level circuit reliability evaluation;serial reliability model;probabilistic transfer matrix;circuit partition
LIU Yi, LI Ming-xiu, ZHANG Xin, LI He, JIAO Lin, QIAN De-pei
Vol. 37, Issue 2, Pages: 248-252(2009)
摘要:To support I/O operations inside transactions,this paper proposes a hardware transactional memory system architecture based on multi-core processor and current cache coherent mechanisms,It supports transactions by adding transactional buffer and related hardware and software.I/O operations within transactions are implemented by partial commit based on commit-lock,and blocking / waking-up of transactional threads.This solution solves or avoids the problems that I/O operations within transactions faced,including rollback,transaction migration and transactional buffer overflow.The system has been implemented by simulation.Its performance is evaluated by 5 benchmark applications.Simulation results show that the transactional programs executed in our system outperformed traditional lock-based programs.
摘要:The influence of electromigration on creep fracture behavior of lead-free solder micro-interconnections was characterized under different current densities(0.8~1.27×104A/cm2)and holding times(0~96 hours).The experiment results show that the creep damage process of the lead-free solder interconnected joints is obviously accelerated by electromigration effect,and the increases in both current density and holding time lead to an increase of creep strain rate and a decrease of creep lifetime of the joints.In addition,the electromigration effect leads to fracture mode transition phenomena at the cathode interface of the solder joints,that is,from ductile fracture in the joints without electromigration to brittle fracture for those after strong electromigration using high current density and long holding time.
FENG Yi, XU Jing-wei, YI Jiang-fang, TONG Dong, CHENG Xu
Vol. 37, Issue 2, Pages: 258-265(2009)
摘要:Verification on Clock Domain Crossing(CDC)design is crucial to the SoC functional verification.Traditional model checking methods on CDC design do not consider the completeness of properties.However,generating complete design properties is the basis for model checking,and incomplete properties would lead to bug escape.To generate complete properties for CDC design,we first propose a finite state automaton based property generation method.Then,to solve the exponential explosive problem,we propose a metastability based data type reduction strategy.Experiment results on two typical CDC designs show that,our approach not only achieves 100% property coverage,but also discovers a bug that escaped by traditional methods.Meanwhile,the verification time for model checking is greatly reduced.
WANG Jin-hui, GONG Na, Geng Shu-qin, HOU Li-gang, WU Wu-chen, DONG Li-min
Vol. 37, Issue 2, Pages: 266-271(2009)
摘要:A charge self-compensation techniques,based on the dynamic node of the P-type charging dynamic node of N-type,is proposed in this paper.Utilizing this technique,the dual-threshold voltage techniques and the multiple supply voltages techniques a low power and high performance Zipper CMOS domino full-adder is designed.And a novel method of the power distribution is introduced.With this method,the optimal path of the proposed Zipper CMOS full-adder with the charge self-compensation techniques is found accurately to minimize the power.Simulation results prove that active power of proposed Zipper CMOS full-adder can be reduced by up to 37%,5% and 7%,and static power can be reduced by up to 41%,20% and 43% as compared to the standard,the dual threshold voltage,and the multiple supply Zipper CMOS domino full-adder under similar delay time,respectively.At last,the inputs and clock signals combination sleep state dependent on leakage current characteristics is analyzed and the optimal sleep state is obtained.
DENG Song, WANG Ru-chuan, ZHANG Yu, ZHANG Jian-feng
Vol. 37, Issue 2, Pages: 272-277(2009)
摘要:Resource allocation of grid is part of optimization and NP-hard problem.In order to optimize resource allocation of grid,in the present research,it proposes a model of grid resource allocation,analyzes three different situations of the number of resources and tasks in detail,and then puts forward on a new algorithm which is called Grid Resource Allocation Algorithm based on Parallel GEP(GRA-PGEP).It adopts a nonlinear code based on resources and tasks and inversion operation,meanwhile,a coarse-grained model is applied to design the GRA-PGEP algorithm.By simulation experiment,it is showed that optimization successful rate,average convergent generation and consumptive time of GRA-PGEP have the advantage over traditional GEP and GA.
关键词:gene expression programming;grid;resource allocation;coarse-grained model
摘要:A novel topic segmentation algorithm is proposed in this paper.This algorithm first partitions text into some blocks.After that it constructs whole-length lexical chains to analyze multiple subtopics of this text.By constructing graph which describes blocks covering subtopics,the similar blocks which describe same subtopic can be classified.In order to solve the situations that segmentation points drop inside blocks,it segments blocks again.Experiment results demonstrate that by analyzing topic of text,this algorithm can remove interferences,which are aroused by irrelative features,from segmentation results.By constructing graph which describes blocks covering subtopics,it can mix similarities of adjacent and disconnected blocks together,and increases segmentation precision.The second segmentation makes segmentation results more reasonable.
SUN Sheng-li, DAI Dong-bo, HUANG Zhen-hua, ZHANG Qi-xun, ZHOU Li-xin
Vol. 37, Issue 2, Pages: 285-293(2009)
摘要:Management and analysis of uncertain,probabilistic data stream has attracted considerable attention within database community.Skyline query processing is an open question recently.Although previous work has addressed skyline computations over static data or traditional data stream,skyline computation over probabilistic data stream is still at large.We propose an efficient algorithm SOPDS to handle this issue.Based on more adaptable grid index,a set of heuristic rules like probability bounding,progressive refinement,pre-elimination and selective compensation are developed to improve the comprehensive performance of SOPDS from point of reducing both CPU overhead and memory consumption.Massive experiments demonstrate that SOPDS is of high overall performance.
关键词:probabilistic data stream;skyline;progressive refinement;pre-elimination strategy
摘要:This paper studies the impact of switch service performance on buffer resources and end-to-end delay of typical 2D Mesh Network on Chip.A phenomenon is found that buffer resources used in 2D Mesh network on chip are very difficult to be saved even if with very high switch service performance under low drop rate.A kind of Star-Subnet-Based-Mesh (SSBM) network on chip is presented according to this phenomenon With the same network scale and drop rate,SSBM shows less end-to-end delay,less buffer resources used and less chip area at the cost of a little switch service performance than 2D Mesh topology.
关键词:network on chip;star-subnet;2D mesh;switch service performance
摘要:A self-adaptive discrete particle swarm algorithm is proposed.In order to overcome the premature convergence of the algorithm,a repulsive process is introduced to increase the swarm diversity and a metric to measure the swarm diversity is also designed.The attractive and repulsive processes can adaptively change during running.To speed up convergence,a strategy used to control the inertia weight is advanced which changes dynamically with the iterations during different running phrase of the algorithm.Moreover,algorithm performance can be enhanced further if local search strategies are combined.Finally,the proposed algorithm is used to solve the TSP and FSSP problems and compared with other related algorithms.The experiment results showed its superiority.
HUANG Kai, YIN Liao, LIN Feng-yi, GE Hai-tong, YAN Xiao-lang
Vol. 37, Issue 2, Pages: 305-311(2009)
摘要:Fast development of embedded application drives the SoC design more complex.How to design multiprocessor SoC efficiently and reliably is becoming a challenge to the designers.To address this challenge,a new multiprocessor prototype and its SoC design methodology are proposed in this paper.It combines multi processors and their communication into one software-hardware prototype in different abstraction levels.The method of seamless refinement from high level abstraction to low level VLSI implementation can design and verify the software/hardware interface and improve designing software/hardware architecture efficiently.The experiment of H.264 decoder shows the feasibility of multiprocessor prototype in both function and physical implementation.The seamless refinement method based on this prototype can ensure the correctness of SoC design and be helpful for its software/hardware architecture optimization.
关键词:multi-processor prototype;System on Chip (SoC);software-hardware co-design
DENG Rang-yu, CHEN Hai-yan, DOU Qiang, XU Wei-xia, XIE Lun-guo, DAI Ze-fu, LI Yong-jin, XIA Jun, LUO Li, ZHANG Min-xuan
Vol. 37, Issue 2, Pages: 312-317(2009)
摘要:Heterogeneous multi-core processor can integrate merits of many types of architecture,so it can achieve peak performance as high as processors with special architecture,and keep as flexible as traditional processors at the same time.It is challenging to design a memory sub-system to suit to YHFT64-3,a heterogeneous multi-core processor with 18 float function units.In this paper,a parallel stream memory sub-system architecture is presented for YHFT64-3,its design idea is described,and the principle to find and capture parallel data streams in several levels is detailed.Testing results show that the proposed application-specific memory sub-system can improve system performance significantly.The performance of 500 MHz YHFT64-3 is 2-3 order of that of YHFT64-2 with the same working frequency,and is close to that of 1.6GHz Itanium2 with less cost.
TANG Gui-fen, YANG Wei-feng, HUANG Shuang-lin, LI Wei
Vol. 37, Issue 2, Pages: 318-324(2009)
摘要:Progressive spatial Join query can be extensively applied in spatial data integration,online spatial aggregation query etc.systems.This paper proposes an efficient progressive spatial join query algorithm.We add spatial query strategy over existing progressive join query architecture,so it can be applied to spatial join query.We present a dynamic concurrent flush policy based on resident degree to process memory overflow,which makes memory-join phase more efficiently.We also propose an incomplete join query algorithm based on BEA,which reduces redundant I/O and CPU cost in disk join phase.Extensive experiments prove that our technique delivers results significantly faster than the previous methods.
ZHAO Ke-ke, PENG Tian-qiang, LIU Sheng-tao, LI Qi-bin
Vol. 37, Issue 2, Pages: 325-328(2009)
摘要:We propose a new technique of shot transition detection based on K-L transform for news video according to the characteristics of human visual system.Firstly,the algorithm uses K-L transform of the RGB color space to produce image parameter modal that is consisted of eigenvectors.Secondly,the algorithm calculates the frame distance with the first and second principle axis’s direction differences in the two adjacent frames and weightingcoefficients.Thirdly the algorithm uses a fixed length sliding window of current frame to detect candidate transition.Lastly,through calculating the differences between the head frame and the other frames in the local window,which is defined as the candidate transition frame,we get and analysis the model variance curve to locate and classify the shot transitions and at the same time disposes the flash shot.The experimental results show that the proposed algorithm outperforms the traditional methods,and is robust to illumination changes or camera motion.
摘要:At present,grid task scheduling Algorithms focus on 1×n type grid,namely one scheduler and n resources but neglect m×n type grid.We built a Grid model of m×n type grid using M/M/1 queue system,and promoted the concept of task scheduling Nash equilibrium among multi-schedulers.The optimal objective of each scheduler is mean complete time per task.The Nash equilibrium took advantage of PSO to be solved.By simulations,we conclude that the new algorithm is better than the algorithm based on the mean scheduling strategies in mean finished task numbers per time,mean load of network and mean load of Grid resources.
摘要:In order to develop high power Terahertz sources,a large radius coaxial cavity gyrotron has been studied in this paper.The results of theoretical calculation show that in a coaxial cavity the cutoff frequency of TE0n mode is almost proportional to n and in inverse proportion to the distance between inner and outer conductor.According to these characteristics a large radius coaxial cavity 0.3THz TE04 mode gyrotron is designed.The results of numerical calculation and PIC simulation show the large radius coaxial gyrotrons have a lot of advantages compared with the waveguide gyrotrons:the size of coaxial cavity is comparatively larger and outpwt power can be increased.The interval between TE0n and TEmn mode is increased which has improved mode competition greatly.
摘要:A novel compact dual-band Wilkinson power divider(PD) is researched,which consists of tri-sections impedance transformer integrated with a spiral defected ground structure(SP-DGS).To satisfy the impedance and phase conditions by using SP-DGS,the characteristics of PD such as an equal power split,impedance matching at all ports,and a good isolation between the two output ports are fulfilled at two arbitrary given frequencies simultaneously.The main advantage of this design is that the required reactance is accurately realized by SP-DGS.The design process and formulas are also given,which are strict but flexible.Finally,a 2.4 GHz/5.8 GHz dual-band PD is developed.Measured results show that the maximal passband insert-loss is 0.1dB and 0.4dB,and the return loss and isolation are both more than 15 dB in a wide frequency range.
摘要:Based on the master equation method of single-electron transistor and a simplified Lientschnig’s single-electron transistor (SET) model,this paper presents a Verilog-A behavioral model for SET,and verified by the tool of Cadence Spectre.The model is shown to be reasonably accurate and fast for SET logic circuit simulation.It offers an efficient method for SET circuits and hybrid circuit co-simulation.
摘要:Images’ sparse representations over over-complete dictionaries have a wide application in image processing due to the properties of sparsity,integrity and separability.This paper proposes a dictionary learning algorithm which is applied to image de-noising.The dictionary learning problem is expressed as a box-constrained quadratic program and a fast projected gradient method is introduced to solve it.The learned dictionary describes the image content effectively.Experimental results show that:in comparison with the wavelet-based de-noising methods,our learning-based algorithm has better de-noising ability,keep more detail image information and improve the peak signal-to-noise ratio.
关键词:sparse representation;basis pursuit;matching pursuit;dictionary learning;quadratic program
摘要:Most existing direction-of-arrival(DOA)and polarization estimation methods,which based on electromagnetic vector sensors array,generally assume that the electromagnetic vector sensor orient the referenced Cartesian coordinates strictly,i.e.,no orientation errors exist.While in the practice,the orientation errors are unavoidable,which make the manifold of the vector sensor deviate from the ideal value,and deteriorate the estimation performance of the existing methods.So it is necessary to calibrate the orientation errors of the electromagnetic vector sensors.Based on the first-order Taylor approximation of the manifold and a known source,a new calibration algorithm against the orientation errors of electromagnetic vector sensors is proposed,and the corresponding expressions for the stochastic Cramér-Rao bound(CRB)is derived.The performance of the new method is confirmed through numerical examples.
关键词:electromagnetic vector sensor;polarization;direction of arrival;orientation errors;calibration
摘要:CABAC is the entropy coding method adopted in H.264/AVC main profile,and the combination of RDO and CABAC can represent an improvement in bit rate of around 20%.However,the computational complexity increases drastically as well.Parallel algorithms are efficient ways to speed up the encoder.However,because of the adaptive characteristic of CABAC,there are strict data-dependencies between macro-blocks encoding,which makes it hard to exploit the parallelism of encoder.In this paper,an efficient parallel algorithm for H.264/AVC encoder with CABAC entropy coding is proposed based on the Macro-Block Region Partition(MBRP)parallel method and CABAC bit-rate estimation technique:the H.264 encoder is partitioned into the mode decision part and the bit-stream producing part with relationship of producer and consumer;CABAC in the RDO mode decision is replaced with bit-rate estimation,which removes the data-dependencies brought by CABAC;in the mode decision part,the MBRP parallel algorithm is adopted;the bit-stream producing part is performed in a single processor,and forms task pipelining with mode decision part.Simulation results on 4-processor simulator show that the proposed parallel algorithm can achieve speedups above 4.7 while maintaining the similar R-D performance.
关键词:H.264;CABAC;bit-rate estimation;RDO(rate-distortion optimization)mode decision;multi-processor;parallel algorithm;MBRP(macro-block region partition)
摘要:In this paper,we propose an on-demand mechanism for data cache leakage power management,which manages data cache activities according to the demand of memory accessing instructions.Specifically,this mechanism keeps the whole data array in leakage-saving mode whenever it finds no memory accessing instructions at all;once a load instruction is detected,it employs two data cache access control policies and the dynamic selection scheme to capture the access demand of the load address early in the pipeline.Experimental results demonstrate that the data cache leakage power is reduced by an average of 85.4%.Meanwhile,the performance is increased by 4.41%.Compared to traditional methods,the mechanism proposed in this paper achieves better results in both power and performance.
LIU Wen-yuan, ZHANG Jiang-xiao, HU Qing-hua, GU Xiu-zhi
Vol. 37, Issue 2, Pages: 367-371(2009)
摘要:There exist some drawbacks such as low efficiency and computation lay by lay from a root node when the user spends E-cash in the divisible electronic cash(E-cash)based on TTP.Based on without TTP,the concept of direct computation and division again was firstly introduced to off-line divisible E-cash,a new off-line divisible E-cash system without trusted third party was presented.In the new system,the E-cash total amount spent by user was times than before(is the binary tree layer),the different node divided from the same E-cash was unlinkable,moreover the computation of expending on any node was the same,thus we can enhance the overall efficiency of the system.
摘要:As heuristic algorithms usually omit the difference between instruction and instruction instance,we improved one existing heuristic algorithm to GreedyHeur algorithm.It calculates custom instructions’ weights from their instruction instances,then select custom instruction instances with greedy strategy according to their instructions’ weights.To find better custom instruction than heuristic algorithms,we introduced an algorithm(ISDE)integrating greedy strategy with differential evolution algorithm.Simple encoding and efficient fitness evaluation help ISDE find the best combination of custom instructions quickly.Experiments show that our algorithms can find better custom instruction candidates more quickly and efficiently than heuristic algorithm.
关键词:differential evolution algorithm;greedy strategy;instruction set extension;instruction selection
摘要:Timeliness is a key security property of practical non-repudiation protocols.This paper analyzed an existing representative multi-party non-repudiation protocol using formal method and found that it lacks the support for timeliness.Then an improved protocol was proposed which fixes the flaw by both adding extra time limit information into protocol messages and modifying the sequence of protocol steps.
摘要:In the localization service of two dimensional space,reference nodes optimizing selection theorem in two dimensional space is proposed on the basis of analyzing the location error areas.The theorem of refeence nodes optimizing selection indicates that purposive selection of reference nodes will minimize the location error in the process of indoor localization.Meanwhile this theorem builds up theoretical foundation for the layout and selection of reference nodes in indoor environment.Based on this, reference nodes optimization selection algorithm(RNOS Algorithm)is proposed by improving the traditional polygon positioning algorithm.This algorithm is based on the relationship between the position of reference nodes and the position of unknown node.By selecting optimum reference nodes in the process of calculating the unknown nodes’ position,the more accurate location information can be obtained.The simulation results indicate that the location reference node optimization selection(RNOS)algorithm can meet the requirement of pervasive terminal’s real-time localization and possesses the preferable localization precision.
摘要:The procedure of the parametric design and numerical simulation with application to sunflower heat sink is developed.The procedure is embodied into parametric design and numerical simulation software utilizing VB language,which can generate high quality hexahedral mesh automatically.The numerical simulation can also be performed automatically and the results about the entropy generation rate of fan-sink assembly can be characterized without resorting to anyone.Geometry parametric investigations are carried out to illustrate the effect of various design parameters on the performance of the fan-sink assembly by choosing one parameter at a time as the control variable which can be achieved quickly by the procedure,which can provide physical insight into the flow and heat transfer characteristics of the heat sinks and deep understanding for the optimization of the heat sinks.The multi-parameter constraint structural optimization procedure based on the combination algorithms of Design of Experiments(DOE),Response Surface Models(RSM),Genetic Algorithm(GA) and Mixed Integer Optimization(MOST),is performed on the fan-sink assembly for the minimum of entropy generation rate with parallel CFD.Results show that the entropy generation rate of the optimal heat sink is decreased by 4.5% compared to the initial design.
摘要:Based on H.248 protocol and the estimation model of video subjective quality,a bandwidth allocation strategy for video communications in next generation network is presented.Then,aiming at the different change of network status,two algorithms of the bandwidth allocation are proposed,one of which is used to guarantee the QoS(quality of service)of the different classes of terminals,and the other of which is used to maximize the total video subjective quality of all the terminals.When the network status has changed worse,experimental results show that the average PSNR(peak signal-to-noise ratio)of recovery video quality can be increased by more than 15dB using the proposed algorithm.In addition,when the network status has degraded too much to guarantee the QoS of all the active terminals,the proposed bandwidth allocation algorithm can also guarantee the total video subjective quality of all the terminals as maximal as possible.
关键词:next generation network;video communications;H.248 protocol;Quality of Service
LI Xin, Janet M.Wang, ZHANG Ying, TANG Wei-qing, WU Hui-zhong
Vol. 37, Issue 2, Pages: 398-403(2009)
摘要:The process variations have an impact on the analysis of interconnects.Considering the impact,the model with process variation is proposed and a spectral stochastic method based analysis method for interconnect crosstalk is presented.Coupled interconnects with process variations are decoupled when analyzing the interconnect segments.Then the crosstalk is analyzed by Stochastic Galerkin Method(SGM)and Stochastic Collocation Method(SCM)respectively.Finally,the expression of crosstalk noise is obtained by complex approximation method.Experimental results demonstrate that the proposed analysis method not only evaluates the crosstalk noise of non-uniform coupled interconnects effectively but also shows good computational efficiency
QIAN Jiang-bo, WANG Yong-li, CHEN Zheng, CHEN Hua-hui, JIN Guang
Vol. 37, Issue 2, Pages: 404-409(2009)
摘要:Accelerating data processing is one of the key problems in DSMS(Data Streams Management System)research.Many researchers promote processing speed by query optimization.It potentially results in saturating the CPU.Load shedding is a candidate choice when a DSMS is executing aggregate operations and it becomes overload.However,load shedding can not be applied on join operation for it will potentially lose many results.In order to greatly accelerate join processing,we present a novel hardware processor implementation for window join evaluation over multiple data streams.To address variations in join conditions,we propose a set of instructions that can be executed in the processor.In addition,we introduce a multi-core processor to improve scalability and flexibility of the processor.Experimental results show that the processor outperforms software by more than ten times.
摘要:A methodology to build a behavioral model for large scale analog circuits is presented in the paper.Any analog circuit can be equivalently described by a network consisting of Sources,Amplifier(amplifier can be considered as a virtual filter with gains),Switchers and Impedance.This network is defined as analog circuits’ SASI structure in the paper.Due to the equivalency between this abstracted network and Hammerstein model,macro behaviors of analog circuits can be modeled based on Hammerstein method.Then,those macro behavior models can be described by using the standard HDL(Verilog-A,VHDL-AMS,and so on)accordingly.This Top-Down methodology is validated using the in-house designed Infrared Remote Control Receiver as an example.
摘要:A unified inverse transform architecture based on 8×8block for Video Codec One standard is proposed in this paper.With the data reconstruction,four different inverse transforms mode could use the same architecture,which made the hardware design easy.At the same time,an inverse transform hardware architecture is proposed with less hardware resource.The experiment showed that when clocked at 108MHz,the proposed design could perform real-time inverse transform for standard definition and high definition video decoder.
摘要:The interpolation mechanism of the single-input-single-output (SISO) fuzzy controller is further analyzed.Based on the symbol calculation,analytic expressions for the control function of the SISO fuzzy controller and the differences between the control function and the interpolation expression are derived,when the implication operator is specified as the min operator or the multiply operator.Then an upper bound estimation of the differences is given and the number of the fuzzy inference rules is selected.
摘要:Web service discovery has been a key problem with the development of web services.The current service discovery standard,UDDI,has the limitation of lack of semantic inference support,and the mechanism which cannot search based on service capabilities and properties leads to a limited performance.The problem can be addressed by adopting OWL-S in web service discovery.In this paper a matching algorithm for service discovery is proposed based on semantic similarity of concepts,aiming at enhancing both recall and precision of web service search.The algorithm is based on a key function of semantic similarity measure that produces numeric similarity degree of ontology entity concepts.Experimental results are presented to show the effectiveness of the algorithm.
关键词:web service discovery;UDDI;web ontology language for services(OWL-S);semantic matching algorithm;similarity measure function