ACTA ELECTRONICA SINICA

Select

SURVEY AND REVIEW

A Review of Coupled LDPC Codes for Streaming Communications

WANG Qian-fan, YANG Jia-yi, WANG Yin-chu, CAI Sui-hua, MA Xiao

ACTA ELECTRONICA SINICA. 2024, 52(8): 2913-2932. https://doi.org/10.12263/DZXB.20240167

Abstract (2825) Download PDF (965) HTML (2675)

Knowledge map

Save

Streaming communication is an essential communication scenario in optical fiber communication networks and mobile communication networks. Unlike traditional intermittent or block-oriented communication, the data transmission in streaming communication exhibits typical continuous streaming characteristics. Coupled codes, compared to traditional block codes, have shown significant performance improvement in streaming communication scenarios. Additionally, they inherit low encoding and decoding latency. These advantages make coupled codes an important candidate for channel coding in streaming communication scenarios. This paper first reviews existing coupled LDPC codes, including product-like coupled LDPC codes, partially reencoded coupled LDPC codes, spatially coupled LDPC (SC-LDPC) codes, and globally coupled LDPC (GC-LDPC) codes. Following that, this paper introduces a series of improved designs for coupled LDPC codes based on free-ride codes, and introduces a new class of coupled LDPC codes based on block Markov superposition transmission (BMST) techniques. Finally, this paper concludes with a discussion on future prospects and research directions of the coupled LDPC codes.

Select

PAPERS

A Node Classification Method Based on Graph Attention and Improved Transformer

LI Xin, LU Wei, MA Zhao-yi, ZHU Pan, KANG Bin

ACTA ELECTRONICA SINICA. 2024, 52(8): 2799-2810. https://doi.org/10.12263/DZXB.20230515

Abstract (2529) Download PDF (930) HTML (2472)

Knowledge map

Save

Currently, graph Transformers mainly add auxiliary modules in the traditional Transformer framework to model graph data. However, these methods have not improved the original Transformer architecture. Their data modeling accuracy needs to be further enhanced. Thus, this paper suggests a node classification method based on graph attention and improved Transformer. In the proposed framework, a topology enhancement based node embedding is constructed for graph structure reinforcement learning. Then, a secondary mask based multi-head attention is developed for aggregation and update. Finally, pre-Norm and skip connection are introduced to improve the interlayer structure of Transformer, which can avoid the over-smoothing problem caused by feature convergence. Experimental results demonstrate that compared to 6 typical baseline models, our method is able to achieve optimal evaluation results on all different indicators. Moreover, it can simultaneously handle the node classification task for both small and medium datasets and comprehensively improve the classification performance.

Select

PAPERS

A Spatiotemporal Fusion Algorithm of Remote Sensing Images Based on Cross-Scale Similarity Prior

FANG Shuai, WAN Qi, CAO Yang

ACTA ELECTRONICA SINICA. 2024, 52(6): 2037-2052. https://doi.org/10.12263/DZXB.20221147

Abstract (2294) Download PDF (507) HTML (2141)

Knowledge map

Save

The trade-off between spatial and temporal resolution of satellite images leads to spatial and temporal contradictions in image sequences. Spatiotemporal image fusion provides a solution to generate high spatial resolution and high temporal resolution images to satisfy various earth observation applications. The spatiotemporal fusion algorithm based on sparse representation establishes the relationship between high and low spatial resolution images by jointly training the dictionary and sparse coding representation, which provides a unified fusion framework for phenological change and type change. However, the multi-source remote sensing images come from different sensors, and the relationship model between high and low spatial resolution images implies the sensor mapping. This inevitably leads to that the model is device dependent. To solve the problem, we decompose the multi-source remote sensing spatiotemporal fusion process into two sub-problems, device dependent sensor bias correction and device independent spatiotemporal fusion. The sensor bias correction can be used as a preprocessing module to improve the universality and accuracy of subsequent fusion models. When there are large space scale gaps between high and low spatial resolution image, the assumption that “the sparse coefficients of high and low spatial resolution images are the same” will bring about very significant fusion errors. To solve the problem, we optimize the objective function of sparse representation using cross-scale similarity prior. Intermediate-scale images are constructed to reduce ambiguity of cross-scale similar patches and improve the accuracy of cross-scale similar patches. Experimental results in three typical scenarios demonstrate the generalization ability of our algorithm. The contrastive experiments show that on the BOREAS dataset, compared to suboptimal indicators, SSIM (Structural SIMilarity) is improved by 4.2%, SAM (Spectral Angle Mapper) is increased by 4.6%; On the CIA dataset, compared to suboptimal indicators, SSIM is increased by 2.7%, and SAM is increased by 12.8%; On the LGC dataset, compared to suboptimal indicators, SSIM is increased by 7.1%, and SAM is increased by 16.3%. Our algorithm is superior to other compared methods in spatial and spectral performance.

Select

PAPERS

Boundary Feature Fusion and Foreground Guidance for Camouflaged Object Detection

LIU Wen-xi, ZHANG Jia-bang, LI Yue-zhou, LAI Yu, NIU Yu-zhen

ACTA ELECTRONICA SINICA. 2024, 52(7): 2279-2290. https://doi.org/10.12263/DZXB.20230668

Abstract (2226) Download PDF (1334) HTML (2175)

Knowledge map

Save

Camouflage object detection aims to detect highly concealed objects hidden in complex environments, and has important application value in many fields such as medicine and agriculture. The existing methods that combine boundary priors excessively emphasize boundary area and lack the ability to represent the internal information of camouflaged objects, resulting in inaccurate detection of the internal area of the camouflaged objects by the model. At the same time, existing methods lack effective mining of foreground features of camouflaged objects, resulting in the background area being mistakenly detected as camouflaged object. To address the above issues, this paper proposes a camouflage object detection method based on boundary feature fusion and foreground guidance, which consists of several stages such as feature extraction, boundary feature fusion, backbone feature enhancement and prediction. In the boundary feature fusion stage, the boundary features are first obtained through the boundary feature extraction module and the boundary mask is predicted. Then, the boundary feature fusion module effectively fuses the boundary features and boundary mask with the lowest level backbone features, thereby enhancing the camouflage object’s boundary position and internal region features. In addition, a foreground guidance module is designed to enhance the backbone features using the predicted camouflage object mask. The camouflage object mask predicted by the previous layer of features is used as the foreground attention of the current layer features, and performing spatial interaction on the features to enhance the network’s ability to recognize spatial relationships, thereby enabling the network to focus on fine and complete camouflage object areas. A large number of experimental results in this paper on four widely used benchmark datasets show that the proposed method outperforms the 19 mainstream methods compared, and has stronger robustness and generalization ability for camouflage object detection tasks.

Select

SURVEY AND REVIEW

A Survey of Network Attack Investigation Based on Provenance Graph

QIU Jing, CHEN Rong-rong, ZHU Hao-jin, XIAO Yan-jun, YIN Li-hua, TIAN Zhi-hong

ACTA ELECTRONICA SINICA. 2024, 52(7): 2529-2556. https://doi.org/10.12263/DZXB.20231057

Abstract (2074) Download PDF (1513) HTML (1974)

Knowledge map

Save

Investigating network attacks is crucial for the implementation of proactive defenses and the formulation of tracing countermeasures. With the rise of sophisticated and stealthy network threats, the need to develop efficient and automated methods for investigations has become a pivotal aspect of advance intelligent network attack and defense capabilities. Existing studies have focused on modeling system audit logs into provenance graphs that represent causal dependencies of attack events. Leveraging the powerful associative analysis and semantic representation capabilities of provenance graphs, complex and stealthy network attacks can be effectively investigated, yielding superior results compared to conventional methods. This paper offers a systematic review of the literature on provenance-graph-based attack investigation, categorizing the diverse methodologies into three principal groups: causality analysis, deep representation learning, and anomaly detection. For each category, the paper succinctly presents the workflows and the core frameworks that underpin these methodologies. Additionally, it delves into the optimization techniques for provenance graphs and chronicles the evolution of these technologies from theoretical constructs to their application in industrial settings. This study methodically aggregates and reviews datasets prevalently utilized in attack investigation research, offering a comprehensive comparative analysis of representative techniques alongside their associated performance metrics, specifically within the ambit of provenance graph-based methodologies. Subsequently, it delineates the prospective directions for future research and development within this specialized field, thereby providing a structured roadmap for advancing the domain's academic and practical applications.

Select

PAPERS

A Fast Simplified Model for X-ray Pulsar Photon Arrival Time Conversion

ZHANG Ze-wei, BAO Wei-min, FANG Hai-yan, SU Jian-yu, LI Xiao-ping, YAO Yun-feng

ACTA ELECTRONICA SINICA. 2024, 52(9): 2939-2949. https://doi.org/10.12263/DZXB.20221003

Abstract (1988) Download PDF (380) HTML (1951)

Knowledge map

Save

A high-precision X-ray photon arrival time conversion model is crucial to the accuracy of X-ray Pulsar-based Navigation. Aiming at the current problem that the complete model is complex and the simplified model has limited accuracy, a fast simplified model with accuracy no less than the existing simplified model is proposed in this paper. Through the derivation of the existing complete model, the influence of each delay item on the accuracy of the model is theoretically analyzed, and it is pointed out that the Roemer delay is still the key to the accuracy of the simplified model. A fast simplified model was obtained by changing the expression of the Roemer delay and its second-order expansion, and considering the ease of access to physical quantities in practical application. The accuracy and computational efficiency of the proposed model are analyzed by using the complete model and the proposed simplified model to time-transform the measured photon data of NICER (Neutron star Interior Composition Explorer) and HXMT (Hard X-ray Modulation Telescope) satellites. Furthermore, the influence of orbital altitude and pulsar angular position measurement errors on the accuracy of the simplified model is analyzed by numerical simulation, and the accuracy and computational efficiency of the simplified model in the application of Earth orbit at different altitudes are discussed. The results show that the computational efficiency of the simplified model proposed in this paper is improved 50% than that of the Sheikh’s simplified model and 10% than the fei’s model, without causing a decrease in accuracy.

Select

PAPERS

Refocusing for Three-Dimensional Rotating Ship Targets in SAR Images Based on Minimum Entropy Criteria and Generative Adversarial Network

HUA Qing-long, ZHANG Yun, REN Hang, JIANG Yi-cheng, XU Dan

ACTA ELECTRONICA SINICA. 2024, 52(8): 2900-2912. https://doi.org/10.12263/DZXB.20230465

Abstract (1979) Download PDF (528) HTML (1915)

Knowledge map

Save

In synthetic aperture radar (SAR) system, the three-dimensional rotation of ship targets in the presence of a medium and high sea state would lead to time-varying Doppler spectrum and image defocusing, which will adversely affect the subsequent information interpretation of ship targets in SAR images. Aiming at the refocusing problem of three-dimensional rotating ship targets, this paper proposes a SAR refocusing method for three-dimensional rotating ship target based on minimum entropy criterion and generative adversarial network, and designs the network structure of generator and discriminator. The generator transforms the defocused complex SAR ship image into range-Doppler domain, and estimates the phase error coefficient by range unit using phase error coefficient estimation network, and realizes the compensation of multi-order phase errors. The discriminator is composed of a complex-valued convolutional neural network, and all its elements, including convolution layer, activation function, feature mapping and parameters, are extended to the complex domain. The minimum entropy criterion and adversarial loss are introduced into the loss function to achieve unsupervised training and avoid the problem that it is difficult to obtain the target labeling samples of non-cooperative ships. Experiments on simulated data and Gaofen-3 data show that the proposed method achieves significant improvements in both refocusing accuracy and efficiency.

Select

PAPERS

Robust Identification of Nonlinear State-Space System Based on Dual Heavy-Tailed Noise Distributions

LIU Xin, HAI Yang, DAI Wei

ACTA ELECTRONICA SINICA. 2024, 52(9): 3052-3064. https://doi.org/10.12263/DZXB.20230957

Abstract (1962) Download PDF (1139) HTML (1896)

Knowledge map

Save

The state space model is a common and important model structure for automation and control. In this paper, the robust identification of nonlinear state-space model corrupted by outliers is investigated. The outliers imposed on both the state transition process and the output measurement process are considered and a more comprehensive and robust identification algorithm is proposed. To ensure the robustness of the proposed algorithm, two independent heavy-tailed Student's t-distributions are used to describe the state noise and the output noise, respectively. Then the particle smoothing method is applied to estimate the posterior distribution of the unknown states. Finally, the expectation maximization algorithm is used to realize the parameter estimation problem. The mathematical decomposition of the Student's t-distribution is employed in the identification process which brings two main advantages: (1) facilitating the derivation and implementation of the proposed algorithm; (2) providing a more clearer explanation of the robustness of the algorithm. The usefulness of the proposed algorithm is demonstrated via the numerical and mechanical examples.

Select

PAPERS

A Reconfigurable Heterogeneous Memory Architecture and Memory Controller

JIN Xiao-zhong, LIU Hai-kun, LAI Hao, MAO Fu-bing, ZHANG Yu, LIAO Xiao-fei, JIN Hai

ACTA ELECTRONICA SINICA. 2024, 52(9): 3038-3051. https://doi.org/10.12263/DZXB.20221257

Abstract (1898) Download PDF (1371) HTML (1837)

Knowledge map

Save

Heterogeneous memory systems composed of traditional dynamic random access memory (DRAM) and new non-volatile memory (NVM) can be organized in a horizontal architecture or a hierarchical architecture. The horizontal DRAM/NVM architecture often requires page migration technologies to improve memory access performance. However, hot page monitoring and migration implemented in operating systems would cause significant software performance overhead. The hardware-supported hierarchical architecture even increases the memory access latency for big data applications with poor data locality due to the deeper memory hierarchy. To this end, this paper proposes a reconfigurable heterogeneous memory architecture that can be converted between horizontal and hierarchical architectures at runtime to dynamically adapt the memory access characteristics of different applications. We design a DRAM/NVM heterogeneous memory controller (HMC) based on the new instruction set architecture RISC-V (Reduced Instruction Set Computing-V). The HMC uses a few hardware counters for memory access monitoring and analyzing, and achieves dynamic address mapping and eﬀicient page migration between DRAM and NVM pages. Experimental results show that the DRAM/NVM hybrid memory controller can improve application performance by 43%.

Select

PAPERS

Ultrawideband and High Absorption Composite Absorber Enabled by Double Layers of Metasurface Complementary Enhancement Design

MA Zhe-yi-pei, JIANG Chao, LIU Yan-qiong, LI Jia-le

ACTA ELECTRONICA SINICA. 2024, 52(8): 2668-2678. https://doi.org/10.12263/DZXB.20230609

Abstract (1876) Download PDF (561) HTML (1828)

Knowledge map

Save

In this study, a design method of multilayer structure composite absorber based on double layers of metasurface is proposed. The designed composite absorber consists of two layers of metasurface, top absorption enhanced skin and several support dielectric slabs. The unit cells of metasurface Ⅰ and Ⅱ are separately irregular-shaped metal patches connected by chip resistors and hexagonal metal rings loaded with chip resistors; the top absorption enhanced skin is a fiberglass enforced epoxy laminate; support dielectric slabs adopt PMI foam. The simulation results indicate that the absorption frequency bands with reflection coefficients below -10 dB and -20 dB are 2.80~23.64 GHz and 3.56~22.56 GHz, respectively. The measurement results show that, the absorption frequency bands with reflection coefficients below -10 dB and -20 dB are 2.36~23.87 GHz and 3.17~23.16 GHz, respectively; the reflection coefficient curves obtained by simulation and test have good consistency, which verifies the effectiveness of the design method. The simulation and test results show that the frequency band of -10 dB reflection coefficient at 50° oblique incidence is basically consistent with that at normal incidence, while the offset of start and stop frequencies is less than 0.8 GHz; further, the simulation and test results show that when the oblique incidence angle is 60°, the fractional bandwidth of the reflection coefficient below -10 dB is up to 141.8%, which indicates that the composite absorber designed in this study has the incidence stability in a wide angle range. In addition, the mechanism of ultrawideband and high absorption of the structure and the influence of the main structure parameters are analyzed; the results demonstrate that the top absorption enhanced skin can improve absorptivity of the whole structure by up to 0.2(1.0 represents 100% absorption), and the complementary enhancement design of the two layers of metasurface absorption frequency bands can improve oblique incidence stability obviously.

Select

PAPERS

A CatBoost Optimization-Based Fault Diagnosis Model for Photovoltaic Arrays

PENG Zi-ran, XU Huai-shun, XIAO Shen-ping

ACTA ELECTRONICA SINICA. 2024, 52(7): 2418-2428. https://doi.org/10.12263/DZXB.20240236

Abstract (1859) Download PDF (816) HTML (1830)

Knowledge map

Save

CSCD(1)

Most of the photovoltaic power stations are located in remote areas with complex terrain, which are affected by the external environment and prone to various faults. The traditional PV array fault diagnosis methods have the problems of low accuracy and low utilization of PV data. Aiming at the above problems, in this paper, we first improve the sparrow search algorithm (SSA) by introducing the Levy flight strategy and the dynamic adjustment strategy of the step factor to reduce the risk of the SSA algorithm falling into the local optimum and improve the optimization ability of the SSA algorithm. Then the improved levy adjustment sparrow search algorithm (LASSA) is used to optimize the key hyperparameters of the CatBoost model, and a photovoltaic array fault diagnosis model LASSA-based on CatBoost and using LASSA as the optimization strategy is proposed. CatBoost for accurate diagnosis of short-circuit, open-circuit, aging and shadow masking faults in PV arrays. The experimental results show that the fault diagnosis accuracy of the LASSA-CatBoost model is 99.7%, which is 3.6% higher compared to the CatBoost model before optimization. Compared with the existing PV array fault diagnosis models, the LASSA-CatBoost model has higher accuracy and stability.

Select

PAPERS

Design of a Millimeter-Wave Dual-Band Low Phase Noise VCO in 45 nm CMOS SOI Process

CHEN Zhe, WANG Pin-qing, ZHOU Pei-gen, CHEN Ji-xin, HONG Wei

ACTA ELECTRONICA SINICA. 2024, 52(7): 2161-2169. https://doi.org/10.12263/DZXB.20230645

Abstract (1813) Download PDF (714) HTML (1777)

Knowledge map

Save

This paper presents the design of a millimeter-wave dual-band low phase noise voltage-controlled-oscillator in 45 nm CMOS SOI (Complementary Metal Oxide Semiconductor Silicon On Insulator) process, which covers bands of 24.25~27.5 GHz and 37~43.5 GHz for 5G millimeter-wave communications. Based on the transistor’s high performance as the RF switch in SOI process, the switched cap-bank and switched inductor topology are proposed in this paper, to enhance the quality factor Q for the wide-band tuning inductance and capacitance, increase the VCO (Voltage Controlled Oscillator) operating bandwidth, and lower the phase noise performance. Meanwhile, the switched capacitor is also adopted in the output matching network for good matching and stable output power in dual-bands. Measured results show that the designed VCO covers the bands of 24.25~27.5 GHz and 37~43.5 GHz for 5G millimeter-wave communication standards as in WRC-19, with output power of -4.8~0 dBm in low band and -6.4~-2.3 dBm in high band. The measured phase noise is -105.1 ‍dBc/Hz@1 MHz offset for the 24.482 GHz carrier, and -95.3 dBc/Hz@1 MHz offset for the 43.308 GHz carrier. The DC power consumption for the core circuit is 15.3~18.5 mW, and the core area is 0.198 mm². The corresponding FoM (Figure of Merit) and FoM_T for low (high) band is -181.3 dBc/Hz (-175.4 dBc/Hz), and -194.3 dBc/Hz (-188.3 dBc/Hz), respectively.

Select

PAPERS

Correlation Filtering Tracking Algorithm Based on Adaptive Aspect-Ratio

ZHONG Yu-bin, YANG Peng, DOU Lei

ACTA ELECTRONICA SINICA. 2024, 52(6): 2112-2122. https://doi.org/10.12263/DZXB.20230162

Abstract (1794) Download PDF (505) HTML (1702)

Knowledge map

Save

Due to the irregular deformation of target in the tracking process, it is unable to accurately estimate the target scale, while using the scale model with fixed aspect ratio. In this paper, we propose an aspect-ratio-based correlation filtering tracking algorithm to address this problem. Based on the fDSST (fast Discriminative Scale Space Tracking) algorithm, first train and learn an aspect-ration model to update the aspect ratio of the target, which could help to obtain a more accurate target scale. On this basis, this paper designs a smoothing correction scheme and an adaptive learning rate mechanism to alleviate the model drift and achieve more accurate tracking. The results of comparative experiments on OTB100, VOT2016 and VOT2018 datasets show that the proposed algorithm improves the performance of the baseline algorithm. Especially, the overall precision and success rate of the proposed algorithm on OTB100 are 9.6% and 6.2% higher than those of fDSST.

Select

PAPERS

Optimization and Capacitance Characteristics of 1 500 V Super Junction Power MOS Devices

CHONG Yi-ning, LI Jue, QIAO Ming

ACTA ELECTRONICA SINICA. 2024, 52(7): 2271-2278. https://doi.org/10.12263/DZXB.20230845

Abstract (1794) Download PDF (569) HTML (1736)

Knowledge map

Save

In this paper, the design of high-voltage super junction power MOS (Metal Oxide Semiconductor) device is carried out by using the semi-super junction structure, the super junction cell structure is designed based on the Sentaurus TCAD (Technology Computer Aided Design) simulation platform, and the breakdown voltage and on-resistance of the high-voltage super junction power MOS devices are optimized, and then the characteristics of parasitic capacitance are explored. Finally, based on multiple epitaxial processes, a high-voltage super junction power MOS device with a simulated breakdown voltage of 1 658 V, a process simulation breakdown voltage of 1 598 V and a specific on-resistance value of 303 mΩ·cm² has been independently designed, which reduced the specific on-resistance value by about 50% compared with the same withstand voltage device. At the same time, the influence of four main structural parameters, namely super junction doping concentration and thickness and voltage support layer doping concentration and thickness, on the parasitic capacitance characteristics of the device has been explored.

Select

PAPERS

Structure-Wise Feature Reconstruction for Hyperspectral Image Classification

XING Chang-da, WANG Mei-ling, XU Yong-chang, WANG Zhi-sheng

ACTA ELECTRONICA SINICA. 2024, 52(9): 3010-3022. https://doi.org/10.12263/DZXB.20230077

Abstract (1765) Download PDF (1201) HTML (1722)

Knowledge map

Save

Feature extraction is a key operation for hyperspectral image (HSI) classification. For current classification approaches, they usually ignore the information preservation and spatial distribution in feature extraction, which may export features with low information utilization and disordered distribution, generating unsatisfactory prediction results. To remedy such deficiencies, a novel method based on structure-wise feature reconstruction is proposed for the HSI classification. This method can reduce the information loss and improve the information preservation during the process of feature extraction. In addition, the distribution is also fully considered to enhance the discriminability and separability. In this proposed method, considering the reconstruction idea and the self-expression theory, a structure-wise feature reconstruction model is constructed to extract the features of the HSI, which can improve the information utilization of original information from the HSI and describe the structure reflecting the well-ordered distribution. Here, an optimization with alternative updating is presented to solve the above constructed model. The support vector machine is finally used to classify the extracted features and predict the labels of the HSI. The Salinas, Pavia Center, Botswana, and Houston datasets are used for experimental validation. Results show that the proposed method achieves the better classification performance compared with some state-of-the-art approaches, which is averagely higher 2.6%, 3.9%, 3.3% at OA (Overall Accuracy), AA (Average Accuracy), and Kappa indexes.

Select

PAPERS

Quantum Approximate Optimization Algorithm for Graph Partitioning

YUAN Zhi-qiang, YANG Si-chun, RUAN Yue, XUE Xi-ling, TAO Tao

ACTA ELECTRONICA SINICA. 2024, 52(6): 2025-2036. https://doi.org/10.12263/DZXB.20220784

Abstract (1713) Download PDF (642) HTML (1591)

Knowledge map

Save

Quantum approximate optimization algorithm (QAOA) is an algorithm framework for solving combinatorial optimization problems. It is regarded as one of the promising candidates to demonstrate the advantages of quantum computing in the near future. Within the QAOA framework, the symmetries of quantum states induced by the binary encoding scheme restrain the performance of QAOA. Inspired by the Dicke state preparation algorithm, we proposed a new encoding scheme that eliminated the symmetry of quantum states representing solutions. Beyond that, we also proposed a novel evolution operator, star graph (SG) mixer, and its corresponding SG algorithm. The quantum circuit implementation of the SG algorithm on IBM Q showed the SG algorithm has an average performance improvement of about 25.3% over the standard QAOA algorithm in solving the graph partitioning problem.

Select

PAPERS

Convex Solution for Target Localization in Passive MIMO Radar Using Delay, Doppler and Angle Measurements

YANG Jing, LIU Cheng-cheng, HUANG Jie, LI Xia

ACTA ELECTRONICA SINICA. 2024, 52(6): 2091-2102. https://doi.org/10.12263/DZXB.20221276

Abstract (1704) Download PDF (791) HTML (1611)

Knowledge map

Save

A convex-optimum localization algorithm based on semidefinite relaxation is proposed for moving target localization from time delay, Doppler shift and angle of arrival measurements in distributed multiple-input multiple-output radar. This algorithm alleviates the threshold effect that the positioning error deviates from the Cramer-Rao lower bound (CRLB) when the measurement error is large. First, the localization problem is formulated as a maximum likelihood estimation problem, which is reformulated as a weighted least squares problem with constraints by introducing auxiliary variables and then a convex semidefinite programming (SDP) problem by performing semidefinite relaxation. The SDP problem is solved efficiently by using the interior-point method to obtain the target position and velocity estimates. Since the local optimal solution of the convex optimization problem is the global optimal solution, the proposed algorithm has good global convergence. Simulation results demonstrate that the proposed algorithm approaches the CRLB, and achieves higher localization accuracy and robustness than existing algorithms at relatively large measurement noise levels.

Select

PAPERS

The Semi-Automatic Classification Data Labeling Method Based on Dispute About Weak Label

LI Zi-qiang, YANG Wei, YANG Xian-feng, LUO Lin

ACTA ELECTRONICA SINICA. 2024, 52(8): 2891-2899. https://doi.org/10.12263/DZXB.20230648

Abstract (1701) Download PDF (619) HTML (1652)

Knowledge map

Save

CSCD(1)

At present, deep active learning (DAL) in the classification data labeling work has achieved outstanding success. How to select samples to improve the performance of models is still a difficult problem in deep active learning. We proposes a semi-automatic classification data labeling method based on weak label dispute (Dispute about Weak Label-based Deep Active Learning, DWLDAL). The method iteratively selects samples that is difficult for model to distinguish, and manually annotate these sample. This method contains pseudo label generator and weak label generator, pseudo label generator is trained on accurately annotated datasets to generate pseudo label for unlabeled data; weak label generator is trained on random data subset with pseudo labels. Weak label generator committee are used to determine which unlabeled data is the most controversial and should be manually annotated. We conducted experimental validation on the common datasets IMDB (Internet Movie Database), 20NEWS (20NEWSgroup), and chnsenticorp (chnsenticorp_htl_all) to address the issue of text classification. Three different voting decision-making methods are evaluated from the perspective of the accuracy of data annotation and classification tasks. The F ₁ score of data annotation in DWLDAL method is 30.22%, 14.07% and 2.57% higher than that in the existing method Snuba, respectively. The F ₁ score of classification task in DWLDAL method is 1.01%, 22.72% and 4.83% higher than that in Snuba method, respectively.

Select

PAPERS

Efficient Multimodal Contribution Aware Network for Assessment of Microvascular Invasion in Hepatocellular Carcinoma

JIA Xi-bin, YU Gao-yuan, WANG Luo, DENG Yu-hui, YANG Da-wei, YANG Zheng-han

ACTA ELECTRONICA SINICA. 2024, 52(6): 2053-2066. https://doi.org/10.12263/DZXB.20220919

Abstract (1697) Download PDF (709) HTML (1570)

Knowledge map

Save

Microvascular invasion (MVI) is an important factor for early recurrence and poor long-term prognosis in patients with hepatocellular carcinoma (HCC) after resection or transplantation. Therefore, it is of great clinical value to evaluate whether MVI exists in patients with HCC before operation. In recent years, deep learning has provided a valuable solution for MVI image diagnosis and evaluation. Nevertheless, due to the difficulties of data annotation and collection, the current researches mostly use computed tomography (CT) or magnetic resonance imaging (MRI) methods to collect single modal sequences in images independently, which lacks the comprehensive application of multimodal sequences in various imaging methods. In order to make more effective use of multimodal data of CT and MRI images and improve diagnosis efficiency under few-shot scenarios, an efficient multimodal montribution aware network is proposed in this paper. The modality grouping convolution and efficient multimodal adaptive weighting module in this network are used to to learn the diagnostic contribution of each modal information of CT or MRI under complex and diverse MVI representation with little computational cost introduced. The experiment is carried out on the clinical dataset collected by the third-class hospital. Result show that with the support of a small amount of labeled data,our method can achieve better MVI diagnostic performance than many deep neural networks based on attention mechanism,which provides an effective reference for professional doctors’ diagnostic analysis.

Select

PAPERS

Differential Privacy-Based Double Energy Auction Privacy-Preserving on Consortium Blockchain

JIANG Shun-rong, SHI Kun, ZHOU Yong

ACTA ELECTRONICA SINICA. 2024, 52(9): 3023-3037. https://doi.org/10.12263/DZXB.20221299

Abstract (1695) Download PDF (1334) HTML (1678)

Knowledge map

Save

Micro-grid is a distributed small-scale power generation and distribution system, which has realized the circular flow of electricity through adjacent energy trading according to the different needs of prosumers. In order to develop optimal price and transaction strategies in energy trading of micro-grid, we proposed a double sealed bid (DSB) auction scheme according to the characteristics of consortium blockchain. Except met key economic properties (individual rationality, budget balance, and so on), this scheme would determine the final winner based on the users' offers, bids, volumes, average price and other factors. In the meanwhile, in order to protect the personal privacy of users in the auction process, we proposed the blockchain-based differential privacy (BDP) algorithm based on the differential privacy theory and the characteristics of the DSB auction scheme, which was satisfied with differential privacy demands and mean validity through privacy analysis and data validity analysis. Finally, we applied the BDP algorithm to the DSB auction scheme and realized a safe and efficient double energy auction privacy-preserving scheme—differential privacy-based double auction on blockchain (DPDAB), which not only developed the optimal price and transaction strategy but also protected the users' privacy in the process of auction. In addition, we analyzed the influence of the BDP algorithm on auction data and the data computation time overhead on the auction scheme through experiments, and proved the validity of the DPDAB scheme in terms of average benefit, user satisfaction and social welfare through comparative experiments.

Select

PAPERS

Research on the Deep Learning Method Based on Data Feature Relevance and Adaptive Differential Privacy

KANG Hai-yan, WANG Xiao-shi

ACTA ELECTRONICA SINICA. 2024, 52(6): 1963-1976. https://doi.org/10.12263/DZXB.20220892

Abstract (1659) Download PDF (1262) HTML (1536)

Knowledge map

Save

In the deep learning privacy protection based on differential privacy, the length of the training period and the allocation of the privacy budget directly restrict the utility of the deep learning model. In the existing methods of deep learning combined with differential privacy, the model training cycle is limited and the budget allocation of a large number of feature privacy is unreasonable, which leads to poor security and availability of the model. We propose a method of deep learning methods based on data feature relevance and adaptive differential privacy (RADP). First, the method uses the layer-by-layer correlation propagation algorithm to calculate the average correlation of each feature parameter and the output result on the original data set on the pre-trained model and uses the information entropy-based method to calculate the average correlation of each feature parameter. According to the privacy metric, the Laplace noise is adaptively added to the average correlation; on this basis, according to the average correlation of each feature parameter, the privacy budget is allocated reasonably, Laplace noise is added to the feature parameters; finally, theoretical analysis shows that the method proposed in this paper satisfies $ε$ -differential privacy and take into account security and availability. Based on the experimental results on 3 real datasets MNIST, Fashion-MNIST, and CIFAR-10, the accuracy and average loss of RADP are better than those of the AdLM (Adaptive Laplace Mechanism) method,the DPSGD (Differential Privacy with Stochastic Gradient Descent) method and the DPDLIGDO (Differentially Private Deep Learning with Iterative Gradient Descent Optimization) method. Moreover, the stability of RADP method can still be maintained well.

Select

PAPERS

Multi-Graph Learning Based on Structure-Aware

FU Dong-lai, GAO Ze-an

ACTA ELECTRONICA SINICA. 2024, 52(7): 2407-2417. https://doi.org/10.12263/DZXB.20230565

Abstract (1654) Download PDF (497) HTML (1578)

Knowledge map

Save

Multi-graph learning is a very important learning paradigm. Compared with multi-instance learning, in multi-graph learning, a bag represents an object, and each graph in the bag corresponds to a sub-object. This data representation method can express the structural information of sub-objects. However, existing multi-graph learning methods not only implicitly assume that the graphs in the bag satisfy independent and identical distribution, but also mostly adopt the technical idea of transforming multi-graph learning problems into multi-instance learning problems. This type of multi-graph learning method easily loses the structural information of the graph itself and the relationships between graphs. In response to the above problems, a multi-graph learning method based on structure awareness is proposed to effectively learn the structural information of the graph itself and the relationships between graphs. This method uses graph kernels to retain the structural information of the graph itself by calculating the similarity between graphs, expresses the structural information between graphs by generating bag-level graphs, and designs a bag encoder to effectively learn the structural information between graphs. Experimental results on the NCI(1), NCI(109), and AIDB datasets show that compared with existing methods, the proposed method improved by 5.97%, 3.44%, 4.48%, and 2.56% in accuracy, precision, F ₁ value, and AUC respectively. In terms of recall rate decreased by 2.12%.

Select

PAPERS

Sampling-Rate Offset Estimation for Wireless Acoustic Sensor Networks in Low SNR Environments

SHI Qing, YANG Fei-ran, CHEN Xian-mei, YANG Jun

ACTA ELECTRONICA SINICA. 2024, 52(6): 2131-2140. https://doi.org/10.12263/DZXB.20230339

Abstract (1654) Download PDF (617) HTML (1441)

Knowledge map

Save

The performance of existing sampling rate offset (SRO) estimation algorithms can be degraded significantly in low signal-to-noise ratio (SNR) conditions. To address this problem, we propose the frequency-sliding double-cross correlation processing (FS-DXCP) algorithm based on the subband secondary generalized cross-correlation function to estimate SRO. The proposed algorithm adopts a frequency-domain sliding window to construct the subband SGCC function matrix of the sensor signals. Then, by utilizing the singular value decomposition (SVD), we adaptively mitigate the influence of low SNR frequency bins on estimating secondary generalized cross-correlation functions. Finally, a higher precision SRO estimation is achieved by tracking the maximum point of the estimated SGCC function. Computer simulations show that the root mean squared error of the proposed method for sampling rate offset is 4.21 ppm when the SNR is -5 dB, which is about 8.17 ppm lower than that of the double-cross correlation processing with phase transform (DXCP-PHAT) algorithm. The proposed algorithm effectively improves the estimation accuracy of the SRO in low SNR conditions.

Select

SURVEY AND REVIEW

Research Progress of Two-Dimensional Electrical Conductivity and Field Effect Transistors of Diamond

ZHANG Jin-feng, ZHANG Jin-cheng, REN Ze-yang, SU Kai, HAO Yue

ACTA ELECTRONICA SINICA. 2024, 52(6): 2151-2160. https://doi.org/10.12263/DZXB.20240103

Abstract (1647) Download PDF (566) HTML (1543)

Knowledge map

Save

Diamond surface-channel field-effect transistor utilizes two-dimensional hole gas (2DHG) on the hydrogen-terminated diamond surface as the channel to realize the control on output current by input voltage, and it is the mainstream structure of diamond electronic devices. The 2DHG conductivity has a large range of controllable sheet density and a high saturation drift velocity. This paper reviewed the research progress of diamond field-effect transistors in DC, frequency, and power characteristics, and revealed that low mobility is the main limiting factor for the development of diamond-based low-power high-speed digital circuits, high-frequency devices, and high-power microwave devices. It summarized the theoretical and experimental research of a new doping mechanism similar to modulation doping that emerged for the diamond surface conductivity recently. At room temperature the 2DHG Hall mobility has increased to 680 cm²/Vs, and the relevant square resistance has decreased from about 10 kΩ/sq to 1.4 kΩ/sq, which is expected to cause a great improvement in the performance of diamond field-effect transistors.

Select

PAPERS

Organic-Light-Emitting-Diode-on-Silicon Micro-Display Based on Super Pixel Strategy

WANG Xin-rui, JI Yuan, ZHANG Yin, CHEN Hong-gang, MU Ting-zhou

ACTA ELECTRONICA SINICA. 2024, 52(7): 2291-2299. https://doi.org/10.12263/DZXB.20230049

Abstract (1621) Download PDF (471) HTML (1629)

Knowledge map

Save

Based on super pixel technology, a digital driven strategy for color silicon OLED (Organic Light Emitting Diode) micro-display is proposed. By reusing adjacent pixel information, a single pixel can be used for imaging multiple adjacent pixels to greatly improve the display resolution. A digital driving circuit for color OLEDoS (Organic Light Emitting Diode on Silicon) micro-display is designed. Under the condition of 120 Hz frame rate, 256 grey levels and 4K display resolution can be achieved while the circuit area and data transmission per second are only 50% of the traditional driving mode. The test results show that the average current range of OLED pixel realized by the driving circuit is 13.1 pA~3.74 nA, which can meet the demand of near-eye display of micro display.

Select

PAPERS

Knowledge Assisted Integrated Identification of Aerial Targets

CUI Yi-han, LIANG Yan, SONG Qian-qian, ZHANG Hui-xia, WANG Fan

ACTA ELECTRONICA SINICA. 2024, 52(9): 2961-2970. https://doi.org/10.12263/DZXB.20230440

Abstract (1610) Download PDF (1223) HTML (1547)

Knowledge map

Save

With the increasing complexity of modern battlefield environment and the upgrading of aviation equipment technology, massive multi-source heterogeneous sensor data inevitably appear inconsistent and incomplete problems. Traditional multi-sensor fusion method ignores sensor features correlation, and forms a closed data-driven recognition system of sensors. Whereas expert cognition, domain experience, attribute rules and other knowledge can instruct model construction and inference recognition of comprehensive target recognition in the form of expert experience, rule constraints and so on, this paper presents a method of knowledge assisted integrated identification of aerial targets. First of all, a military combat knowledge map of typical aerial target features is constructed, and key feature parameters are extracted to establish a target identification framework model. Then data basic trust assignment and evidence conflict credibility are constructed at recognition and decision recognition level respectively. Besides, time-domain fusion rules for high-conflict evidence is formulated to adjust timing fusion weights by using historical data. Finally, type recognition of multi-sensor is hierarchically realized through static reasoning and dynamic fusion. This study recognition accuracy is better than the existing algorithms in typical aerial target recognition tasks, demonstrating the effectiveness of the proposed algorithm.

Select

PAPERS

Parameter Estimation Algorithm for Bistatic FDA-MIMO Radar Based on Tensor Framework

GUO Yue-hao, WANG Xian-peng, LAN Xiang, SU Ting

ACTA ELECTRONICA SINICA. 2024, 52(6): 2103-2111. https://doi.org/10.12263/DZXB.20230172

Abstract (1604) Download PDF (512) HTML (1520)

Knowledge map

Save

Frequency diversity array (FDA) radar was proposed by Antonik and Wicks in 2006. Since there is a frequency offset between each adjacent antenna of FDA radar, there exists two-dimensional dependence on range and angle in the transmitting array. For bistatic FDA-multiple input multiple output (MIMO) radar, direction of departure (DOD)- direction of arrival (DOA)-range information is coupled in the transmitting steering vector. How to decouple the three information has become the focus of research. In this paper, aiming at the problem of target parameter estimation of bistatic FDA-MIMO radar, a reduced-dimension multiple signal classification (RD-MUSIC) parameter estimation algorithm based on tensor framework is proposed. Firstly, in order to decouple the DOD and range information in the transmitting array, it is necessary to divide the transmitting array into subarrays. Then the signal subspace is obtained by high-order-singular value decomposition, and the two-dimensional spatial spectral function is constructed. Secondly, the dimension of spatial spectrum is reduced by Lagrange algorithm, so that it is only related to DOA, and the DOA estimation is obtained. Then the frequency increment between subarrays is used to decouple the DOD and range information, and eliminate the phase ambiguity at the same time. Finally, the DOD and range estimation automatically matched with DOA estimation are obtained. The proposed algorithm uses the multidimensional structure of high-dimensional data to improve the estimation accuracy. Meanwhile, the proposed RD-MUSIC algorithm can effectively reduce the computational complexity. Numerical experiments show the superiority of the proposed algorithm.

Select

PAPER

DRE-3DC: Document-Level Relation Extraction with Three-Dimensional Representation Combination Modeling

WANG Yu, WANG Zhen, WEN Li-qiang, LI Wei-ping, ZHAO Wen

ACTA ELECTRONICA SINICA. 2024, 52(9): 2950-2960. https://doi.org/10.12263/DZXB.20221187

Abstract (1599) Download PDF (161) HTML (1591)

Knowledge map

Save

The task of document-level relation extraction aims to extract facts from multiple sentences of unstructured documents, which is a key step in the construction of domain knowledge graph and knowledge answering application. The task requires that the model not only capture the complex interactions between entities based on the structural features of documents, but also deal with the serious long-tail category distribution problem. Existing table-based relation extraction models try to solve this issue, but they mainly model documents in two-dimensional “entity/entity” space, and use multi-layer convolutional network or restricted self-attention mechanism to extract the interaction features between entities, which cannot avoid the influence of category overlap and capture the directional features of relationships, resulting in the lack of decoupled semantic information of interaction. For the above challenges, this paper proposes a new document-level relation extraction model, named DRE-3DC (Document-Level Relation Extraction with Three-Dimensional Representation Combination Modeling), in which the “entity/entity” modeling extend to the form of three-dimensional “entity/entities/relationship” modeling method. Based on the deformable convolution in triple attention mechanism, the model effectively distinguishes and integrates the interaction features under different semantic space and adaptively captures the document structural features. At the same time, we propose a multi-task learning method to enhance the perception of relation category combination of documents to alleviate the long-tail distribution problem. The experimental results reveal better score on DocRED and Revisit-DocRED dataset respectively. The effectiveness of the proposed method was verified by ablation experiment, comparative analysis and example analysis.

Select

PAPERS

Experimental Comparative Study on Gain and Noise Characteristics of All-Fiber Few-Mode Amplifier Under Different Pumping Methods

ZHANG Xin-yi, FANG Yi-hong, HUANG Xi-heng, ZENG Yan, QIN Yu-wen, XU Ou, LI Jiang-ping

ACTA ELECTRONICA SINICA. 2024, 52(6): 2074-2082. https://doi.org/10.12263/DZXB.20230078

Abstract (1576) Download PDF (659) HTML (1500)

Knowledge map

Save

An all-fiber few-mode erbium-doped amplifier was built to compare the effects of different pump modes and different pump directions on the gain characteristics of the three signal modes, LP₀₁, LP_11a, and LP_11b. The experimental results show that the amplifier has the best performance under forward LP₁₁ pumping. The signal gain is more than 20 dB, the differential modal gain (DMG) is less than 0.9 dB and the noise figure is less than 9.6 dB in the whole C band. At a signal input power of -10 dBm/mode, the gain of all three signal modes at 1 550 nm exceeds 20.8 dB, the DMG is as low as 0.3 dB, the noise figure of LP₀₁ signaling light is lower than 6.2 dB, and the noise figure of LP₁₁ signaling light is lower than 9.6 dB. Comparing the different pumping directions under the four pumping schemes, it can be found that the noise figure of the forward-pumped amplifier is the smallest, but the gains of the three signal modes are also smaller, while the gain of the higher-order signal modes is increased by using the backward-pumped one, but the noise figure will also become larger. Comparing the pumping modes, it can be found that compared with the LP₀₁ pumping, the LP₁₁ pumping can significantly increase the gain of the LP₁₁ signaling light, and has less effect on the gain of the LP₀₁ signaling light, which can reduce the DMG value.

Select

PAPERS

Multimodal Cooperative Traffic Flow Prediction Model Based on Error Compensation

WU Yu-xuan, YU Hui-qun, FAN Gui-sheng

ACTA ELECTRONICA SINICA. 2024, 52(8): 2878-2890. https://doi.org/10.12263/DZXB.20230523

Abstract (1574) Download PDF (802) HTML (1532)

Knowledge map

Save

Since the traffic flow is affected by multiple factors such as periodic characteristics and unexpected conditions, the prediction accuracy of existing models cannot satisfy the practical requirements. Under this background, this paper proposes a multimodal collaborative traffic flow prediction model based on error compensation (MCEC). To address the problem that traditional prediction models cannot take account of time series and covariates, this paper proposes a feature expansion method based on wavelet analysis, which introduces a clustering algorithm to obtain holiday labeling features, and uses congestion index, traffic accident map, and weather information as expanded features, and decomposes them on multiple scales. In the training phase, a multimodal collaborative model training was designed by adopting ARIMA (AutoregRessive Integrated Moving Average) model, LSTM (Long-Short-Term Memory network), a restricted dynamic time regularization technique, and a self-attentive mechanism to achieve the effect of fully learning each part of the data and optimally matching the model. In the error compensation stage, the obtained corresponding process values are input into the error compensation module based on SVR (Support Vector Regression) to learn and compensate the errors of each component, and reconstruct the prediction results. The MCEC is validated using a publicly available real highway data set. The results of a large number of comparison experiments at multiple time intervals show that the MAPE (Mean Absolute Percentage Error) of MCEC in traffic flow prediction reaches 17.02%,which has a higher prediction accuracy than other prediction models such as LSTM-SVR, ConvLSTM (Convolutional Long Short-Term Memory network), ST-GCN (Spatial Temporal Graph Convolutional Networks), MFFB (Multi-stream Feature Fusion Block), Transformer, indicating the validity and reasonableness of the MCEC model.

Select

PAPERS

Any-to-Any Voice Conversion Using Double Exchange Representation Separation

ZHANG Zi-xu, JIAN Zhi-hua

ACTA ELECTRONICA SINICA. 2024, 52(6): 2141-2150. https://doi.org/10.12263/DZXB.20230246

Abstract (1571) Download PDF (335) HTML (1457)

Knowledge map

Save

In any-to-any voice conversion, the encoder was usually utilized to disentangle the same speaker’s speech and then the decoder was used for self-reconstruction in the training phase, but the decoder in the conversion phase coupled the content information of source speech and the personality characteristics of target speech. Therefore, there existed performance mismatch between the decoder in the conversion phase and the training phase, which deteriorated the performance of voice conversion. This paper proposed a voice conversion method named DERS-VC (Double Exchange Representation Separation Voice Conversion) using double exchange representation separation. In self-reconstruction process of the training phase, the proposed method applied the speech of the same speaker to simulate the voice of different target speakers for self-supervised training. Meanwhile, the conversion invariance loss and the cycle consistency loss were introduced, and the cycle process of separation was conducted by double exchange representation separation to make the self-reconstructed speech closer to the original speech. The experimental results demonstrated that DERS-VC had an average reduction of 4.03% in MCD (Mel-Cepstral Distortion), and had an increment of 3.62% in MOS (Mean Opinion Score), compared with the AGAIN-VC (Activation Guidance and Adaptive Instance Normalization Voice Conversion) method, and the quality and similarity of converted speech both had been improved. This shows that the method of double exchange representation separation can decrease the mismatch of the decoder and improve the performance of any-to-any voice conversion.

Select

PAPERS

A High-Performance Model Predictive Control Strategy Based on Si/SiC Cascaded H-Bridge Inverter

GUO Zi-yue, QUAN Hui-min, PENG Zi-shun, DAI Yu-xing

ACTA ELECTRONICA SINICA. 2024, 52(9): 3000-3009. https://doi.org/10.12263/DZXB.20230094

Abstract (1569) Download PDF (1215) HTML (1501)

Knowledge map

Save

Si/SiC cascaded H-bridge inverters enable a combination of different devices to ensure low output current total harmonic distortion (THD) and high device efficiency. However, this also presents the challenge of switching and assigning Si/SiC cells. In this paper, a model predictive control (MPC) with variable weight is designed to select the total switch state and assign the cell switch combination. In this method, a variable weight based on the switching loss of the device is introduced into the cost function of selecting the total switching state of the inverter and the switching combination of Si/SiC cells, to improve the efficiency and output current harmonic distortion rate of the inverter. The effectiveness of variable-weight MPC is verified on the five-level Si/SiC cascaded H-bridge inverter device, and the output current THD is reduced by up to 2.05% and the device loss is reduced by up to 4.53% compared with the fixed-weight MPC.

Select

PAPERS

Lightweight Continuous Authentication Protocol for Vehicles in Vehicular Networks

ZOU Guang-nan, YOU Qi-di, JIN Xing-hu, MA Yong-chun, LI Jie-yu

ACTA ELECTRONICA SINICA. 2024, 52(6): 1903-1910. https://doi.org/10.12263/DZXB.20230661

Abstract (1565) Download PDF (815) HTML (1477)

Knowledge map

Save

Cloud-edge computing for the Internet of vehicle (CEIoV) can support real-time access and service requests of large-scale vehicles. In order to ensure the security of its internal resources, vehicle identity usually needs to be validated before it can access CEIoV. However, because the vehicle itself is in the running state and moreover its computing, storage and communication resources are limited, the existing identity authentication protocol cannot be directly applied to authenticate a running vehicle in CEIoV. Therefore, this paper proposes a lightweight continuous authentication (LCA) protocol to realize vehicle authentication and guarantee the security of CEIoV internal resources. LCA is designed based on chameleon Hash function, whose implementation requires simple cryptographic operations and is easy to be deployed on the resource-limited devices. By using random oracle model, the semantic security of LCA is proved strictly. At the same time, the experimental results show that LCA has lower computational and communication costs in the continuous authentication process compared with prior schemes.

Select

PAPERS

Dual-Layer Federated Learning Based Edge Collaborative Computing Mechanism for High Dynamic Internet of Vehicle Businesses

XU Si-ya, GUO Jia-hui

ACTA ELECTRONICA SINICA. 2024, 52(7): 2228-2241. https://doi.org/10.12263/DZXB.20230065

Abstract (1558) Download PDF (1696) HTML (1483)

Knowledge map

Save

As an emerging distributed machine learning architecture, federated learning (FL) allows multiple users to train local models and achieve global aggregation of models with data privacy protection, thus providing reliable Internet of Vehicle (IoV) services. However, in the training process of FL, many training terminals may switch among domains due to the high mobility, resulting in low accuracy of the global model. Besides, malicious terminals may frequently upload invalid or incorrect model data which leads to low service reliability. Therefore, we build the dual-layer FL based edge collaborative computing mechanism for high dynamic IoV businesses. Firstly, we comprehensively consider the mobility, computing ability and reliability to construct the service capability model for the terminal, and then propose the edge collaborative computing domain (ECCD) construction algorithm based on deep reinforcement learning. By clustering the vehicle terminals covered by multiple edge nodes, the switching probability of the terminal local model will be reduced, and the sustainability of the FL model training can be guaranteed. Furthermore, we design a dual-layer FL framework including the inter-ECCD aggregation layer and cross-ECCD aggregation layer, respectively. It adopts the semi-asynchronous aggregation mechanism for local models based on the adaptive aggregation factor in the inter-ECCD aggregation layer, and the asynchronous aggregation mechanism for domain’s regional model based on data volume in the cross-ECCD aggregation layer, which jointly improve the aggregation efficiency of the FL system. In particular, considering that the high speed terminals may cause the cross-domain problem, we introduce the partial conditional update mechanism for the local model to avoid the situation that the high-quality models are covered by the low-quality models, which further improves the accuracy of the global model and the utilization of FL system resources. The simulation results verify that the proposed framework outperforms the local computing and asynchronous/synchronous FL algorithms in terms of model accuracy and service reliability.

Select

PAPERS

Detection of Alzheimer's Disease Based on dVAE-BERT Model

CHEN Xu-chu, PU Yu, ZHANG Wei-qiang

ACTA ELECTRONICA SINICA. 2024, 52(9): 2971-2978. https://doi.org/10.12263/DZXB.20230050

Abstract (1550) Download PDF (1217) HTML (1468)

Knowledge map

Save

Alzheimer's disease (AD) is a neurodegenerative disease that causes symptoms such as aphasia and decreased speech fluency. Researchers have used articulatory features, paralinguistic features such as fluency and pauses, or features extracted from transcribed text to detect Alzheimer's disease. However, traditional acoustic feature detection methods are difficult to obtain semantic information, while transcribing speech into text is time-consuming and laborious, and the quality of transcription is significantly degraded due to the effects of accent and disease in the elderly. In this paper, we propose a dVAE-BERT (discrete Variational Autoencoders-Bidirectional Encoder Representations from Transformers) model, which uses discrete Variational Autoencoders (dVAE) to convert speech into pseudo-phoneme sequences, and then uses the Bidirectional Encoder Representations from Transformers (BERT) model to model the connection relations of the pseudo-phoneme sequences to extract the representation of audio in the language dimension. The accuracy of the model on the ADReSSo (Alzheimer's Dementia Recognition through Spontaneous Speech only) dataset is 70.42%, which is 5.63% better than the baseline system, and its accuracy is 76.06% and 71.83% after fusion with Wav2vec2.0 and Hidden-unit BERT (HuBERT) models, respectively.

Select

PAPERS

Image Classification Network of Gating Mechanism

JIANG Wen-tao, GAO Yuan, YUAN Heng, LIU Wan-jun

ACTA ELECTRONICA SINICA. 2024, 52(7): 2393-2406. https://doi.org/10.12263/DZXB.20240104

Abstract (1541) Download PDF (960) HTML (1502)

Knowledge map

Save

To extract more expressive and discriminative key features, reduce the loss of key features during network transmission, and improve the image classification ability of neural networks, a new image classification network of gating mechanism (GMNet) is proposed. Firstly, the shallow features are extracted using gated convolution, and the convolution operation is selectively performed through the gating mechanism to improve the network's ability to extract key features of the original image. Secondly, an interpolation gated convolution (IGC) module is designed, which combines Lanczos interpolation with gated convolution to enhance shallow features while extracting more discriminative features, improving the non-linear expression ability of features. Then, a large kernel gated attention mechanism (LGAM) module is designed, which combines large kernel attention with gated convolution to achieve selective enhancement and fusion of features, and improve the contribution of key region features. Finally, the large kernel gated attention mechanism module is embedded into the residual branch to enable the model to learn input data's features and contextual information more effectively, reduce the loss of key features during network information transmission, and improve the network's classification ability. The method achieved classification accuracy of 97.05%, 83.68%, 97.68%, 90.60%, and 83.05% on image datasets CIFAR-10, CIFAR-100, SVHN, Imagenette, and Imagewoof, respectively, and improved on average by 3.26%, 7.08%, 3.44%, 2.65%, and 5.02% compared to current advanced methods. Compared with existing mainstream network models, the gated mechanism image classification network proposed in this paper can enhance the non-linear expression ability of features, extract more expressive and discriminative vital features, the loss of key features, improve the contribution of key region features, and effectively improve the image classification ability of neural networks.

Select

PAPERS

YOLO-POD: High-Precision PCB Tiny-Defect Detection Algorithm Based on Multi-Dimensional Attention Mechanism

GUO Yan, WANG Zhi-wen, ZHAO Run-xing

ACTA ELECTRONICA SINICA. 2024, 52(7): 2515-2528. https://doi.org/10.12263/DZXB.20230772

Abstract (1532) Download PDF (753) HTML (1391)

Knowledge map

Save

With the widespread application of electronic devices, printed circuit boards (PCB) hold significant importance in the electronics manufacturing industry. However, due to imperfections in the manufacturing process and interference from environmental factors, tiny defects may in PCB. Therefore, the development of efficient and accurate defect detection algorithms is crucial in ensuring product quality. To address the challenge of detecting tiny defects on PCB, this paper proposes a high-precision PCB tiny defect detection algorithm based on multi-dimensional attention mechanism. To reduce model parameters and computational complexity, partial convolution (PConv) is introduced, and the ELAN module is redesigned as the more efficient P-ELAN. Additionally, to enhance the network’s feature extraction capability for tiny defects, the omni-dimensional dynamic convolution (ODConv) based on the multi-dimensional attention mechanism (MDAM) is introduced. By combining partial convolution, the POD-CSP (Partial ODConv-Cross Stage Partial) and POD-MP (Partial ODConv-Max Pooling) cross-stage partial network modules are designed, along with the OD-Neck structure. Finally, based on YOLOv7, a more efficient YOLO-POD model for small object detection is proposed, and the network is optimized during the training phase using a novel loss function called Alpha-SIoU. Experimental results demonstrate that YOLO-POD achieves a detection precision of 98.31% and recall rate of 97.09%, exhibiting substantial advantages across multiple metrics. Notably, it achieves a 28% improvement over the original YOLOv7 model, as to more stringent mAP75 metric. These results validate the high accuracy and robustness of YOLO-POD in PCB defect detection, fulfilling the requirements for high-precision detection and providing an effective detection solution for the PCB manufacturing industry.

Select

PAPER

GAT-IL: A Service Function Chain Deployment Method Based on Graph Attention Network and Imitation Learning

FAN Qi-lin, NIU Yue, YIN Hao, WANG Tian-fu, LI Xiu-hua, HAO Jin-long

ACTA ELECTRONICA SINICA. 2024, 52(8): 2811-2823. https://doi.org/10.12263/DZXB.20221180

Abstract (1523) Download PDF (323) HTML (1460)

Knowledge map

Save

Network function virtualization simplifies the configuration and management of network services by migrating network functions from dedicated hardware devices to software middleboxes running on commercial servers. Under the environment of network function virtualization, the service function chain (SFC) composed of a series of ordered virtual network functions is becoming a mainstream alternative to host network services. The SFC deployment problem is to allocate the underlying physical network resources to the requirements of service function chains. It is challenging for infrastructure providers to obtain long-term high returns under limited resources. In this paper, we formally define the problem of SFC deployment and propose a novel method named graph attention network and imitation learning (GAT-IL) based on graph attention (GAT) network and imitation learning for SFC deployment. This method utilizes GAT to evaluate the potentials of each physical server, provides expert demonstrations through the Monte Carlo tree search algorithm, applies imitation learning to train the agent, and integrates the beam search strategy to optimize the solution space. Extensive experimental results show that the GAT-IL method proposed in this paper outperforms existing representative algorithms on performance metrics of average revenue-to-cost ratio and acceptance rate.

Select

PAPERS

Scene Text Image Super-Resolution Reconstruction Based on Perceiving Multi-Domain Character Distance

HUANG Jun-yang, CHEN Hong-hui, WANG Jia-bao, CHEN Ping-ping, LIN Zhi-jian

ACTA ELECTRONICA SINICA. 2024, 52(7): 2262-2270. https://doi.org/10.12263/DZXB.20240090

Abstract (1516) Download PDF (1032) HTML (1462)

Knowledge map

Save

CSCD(1)

Scene text image super-resolution (STISR) aims to enhance the resolution and legibility of text in low-resolution images. In cases of spatial deformation or low-resolution text images, the lack of details in text regions and the difficulty in aligning semantic cues and visual features with character position make it difficult to recognize text effectively. In order to address these challenges, this paper proposes a perceiving multi-domain character distance for scene text image super-resolution method (PMDC), which improves the image text region and edge texture details. Firsly, the visual and semantic features are extracted by using the asymmetric convolution module along with the semantic prior module. Then the enhanced position coding is obtained by the character distance perception module to perceive the distance change and semantic similarity between characters. Finally, the guiding cues and visual features are combined to restructure the pixels and generate a super-resolution text image. In comparison to TATT, experimental results from the public dataset TextZoom showed an increase of 0.11 dB in the fidelity of the peak signal-to-noise ratio index. This improvement effectively enhances the clarity of the text area and the detailed edge texture. Additionally, the recognition accuracy was improved by 1.4%, which effectively enhances the readability of the text image.

Select

PAPERS

A Low Profile, Wideband and Dual-Polarized Base Station Antenna Based on the Dipole with High Input Impedance

HUANG He, MA Rui-hua

ACTA ELECTRONICA SINICA. 2024, 52(7): 2300-2306. https://doi.org/10.12263/DZXB.20240018

Abstract (1512) Download PDF (726) HTML (1461)

Knowledge map

Save

In this paper, a wideband, dual-polarized antenna with extremely low profile is developed for base station application. The antenna evolved from two fan-shaped dipoles that crossed each other. By adding annular branches and metallized through holes at the end of the dipole, its port input impedance increases when the antenna occupies a lower height. Besides, the flare angle of the fan-shaped arm is increased so that a second resonant point can be generated to achieve the purpose of expanding the bandwidth. The dual-polarized antenna can provide a bandwidth of 22% in the 2.17~2.7 GHz band. Because the two dipoles is highly symmetrical about the geometric center, the isolation degree and cross polarization discrimination are high in the working frequency band, among which the simulation value of the isolation degree can reach 51 dB, and the simulation value of the cross polarization discrimination in the 0° can reach 48 dB. In addition, the simulated peak gain of the antenna is as high as 9.6 dBi. The antenna has the advantages of high isolation, high cross-polarization discrimination and high gain, and has a good application prospect in the base station system.

Most accessed

Please choose a citation manager

Content to export

模态框（Modal）标题

Most accessed

Please choose a citation manager

Content to export