1.重庆大学微电子与通信工程学院,重庆 400030
2.重庆广播电视大学,重庆 400052
[ "张小恒 男,1980年生,四川达州人.博士研究生,副教授.主要研究领域为医学信号处理、机器学习. E-mail:7818320@qq.com" ]
[ "张馨月 女,1996年生,四川泸县人.硕士研究生.主要研究领域为医学信号处理. E-mail:1029323666@qq.com" ]
[ "李勇明(通信作者) 男,1976年生,四川绵阳人.博士,教授、博士生导师.主要研究领域为医学信号处理、机器学习. E-mail:yongmingli@cqu.edu.cn" ]
收稿:2020-09-10,
修回:2021-02-24,
纸质出版:2022-01-25
移动端阅览
张小恒,张馨月,李勇明等.面向帕金森病语音诊断的非监督两步式卷积稀疏迁移学习算法[J].电子学报,2022,50(01):177-184.
ZHANG Xiao-heng,ZHANG Xin-yue,LI Yong-ming,et al.An Unsupervised Two-Step Convolution Sparse Transfer Learning Algorithm for Parkinson’s Disease Speech Diagnosis[J].ACTA ELECTRONICA SINICA,2022,50(01):177-184.
张小恒,张馨月,李勇明等.面向帕金森病语音诊断的非监督两步式卷积稀疏迁移学习算法[J].电子学报,2022,50(01):177-184. DOI: 10.12263/DZXB.20201003.
ZHANG Xiao-heng,ZHANG Xin-yue,LI Yong-ming,et al.An Unsupervised Two-Step Convolution Sparse Transfer Learning Algorithm for Parkinson’s Disease Speech Diagnosis[J].ACTA ELECTRONICA SINICA,2022,50(01):177-184. DOI: 10.12263/DZXB.20201003.
帕金森病(Parkinson’s Disease,PD)语音诊断存在小样本问题,如果借助相关语音数据集进行迁移学习,容易加重训练集和测试集之间的分布差异,影响分类准确率.为了解决上述矛盾问题,本文提出了两步式稀疏迁移学习算法.该算法分为两大步:第一步算法为语音段特征同时优选的快速卷积稀疏编码算法,构造卷积稀疏编码算子用于快速学习公共语音数据集的结构信息,然后将其迁移到PD语音目标集以弥补后者样本信息的不足,接着再同时对语音段和特征进行同时优选以获得更有价值的信息;第二步算法为联合局部结构信息分布对齐算法,对训练集和测试集进行域适应,在保持各自样本结构信息的同时,最小化分布误差.实验结果表明:本文算法中每一步迁移学习算法均有效;与相关算法相比,本文算法准确率显著较高,达97.5%.
Parkinson's disease(PD) speech diagnosis has a small sample problem. Although it is possible to transfer learning with the help of relevant speech datasets. The introduction of other samples will lead to the distribution difference between samples of different subjects
so the classification accuracy is greatly affected. Therefore
in this paper
to solve the problems above
we propose a novel unsupervised two-step convolutional sparse transfer leaning algorithm. The algorithm is divided into two steps: fast convolutional sparse coding with coordinate selection of samples and features(FCSC&SF)
joint local structure distribution alignment(JLSDA). In the FCSC&SF
speech structure among public speech dataset is quickly learned by fast convolution sparse coding(FCSC)
and transferred into the target dataset
after that
the more valuable information is obtained by coordinate selection of samples and features. JLSDA is designed to maintain the local structure information in the two domains
and reduce the distribution difference between the two domains at the same time. The experimental results showed that each step of the proposed algorithm has a positive effect on the classification results; compared with the representative relevant algorithms
the accuracy of the proposed method is significantly higher at 97.5%.
MIRARCHI D , VIZZA P , TRADIGO G , et al . Signal analysis for voice evaluation in Parkinson's disease [C]// 2017 IEEE International Conference on Healthcare Informatics (ICHI) . Park City : IEEE , 2017 : 530 - 535 .
GILLIVAN-MURPHY P , MILLER N , CARDING P . Voice tremor in Parkinson's disease: An acoustic study [J]. Journal of Voice , 2019 , 33 ( 4 ): 526 - 535 .
ZOU N , HUANG X . Empirical bayes transfer learning for uncertainty characterization in predicting Parkinson's disease severity [J]. IISE Transactions on Healthcare Systems Engineering , 2018 , 8 ( 3 ): 209 - 219 .
SAKAR B E , ISENKUL M E , SAKAR C O , et al . Collection and analysis of a Parkinson speech dataset with multiple types of sound recordings [J]. IEEE Journal of Biomedical and Health Informatics , 2013 , 17 ( 4 ): 828 - 834 .
NASEER A , RANI M , NAZ S , et al . Refining Parkinson's neurological disorder identification through deep transfer learning [J]. Neural Computing and Applications , 2020 , 32 ( 3 ): 839 - 854 .
AL-FATLAWI A H , JABARDI M H , LING S H . Efficient diagnosis system for Parkinson's disease using deep belief network [C]// 2016 IEEE Congress on Evolutionary Computation (CEC) . Vancouver : IEEE , 2016 : 1324 - 1330 .
AVCI D , DOGANTEKIN A . An expert diagnosis system for Parkinson disease based on genetic algorithm-wavelet kernel-extreme learning machine [J]. Parkinson's Disease , 2016 , 2016 : 5264743 .
CAI Z N , GU J H , CHEN H L . A new hybrid intelligent framework for predicting Parkinson's disease [J]. IEEE Access , 2017 , 5 : 17188 - 17200 .
CAESARENDRA W , ARIYANTO M , SETIAWAN J D , et al . A pattern recognition method for stage classification of Parkinson's disease utilizing voice features [C]// 2014 IEEE Conference on Biomedical Engineering and Sciences (IECBES) . Kuala Lumpur : IEEE , 2014 : 87 - 92 .
OZKAN H . A comparison of classification methods for telediagnosis of Parkinson's disease [J]. Entropy , 2016 , 18 ( 4 ): 115 .
BENBA A , JILBAB A , HAMMOUCH A . Hybridization of best acoustic cues for detecting persons with Parkinson's disease [C]// 2014 Second World Conference on Complex Systems (WCCS) . Agadir : IEEE , 2014 : 622 - 625 .
NARANJO L , PÉREZ C J , CAMPOS-ROCA Y , et al . Addressing voice recording replications for Parkinson's disease detection [J]. Expert Systems with Applications , 2016 , 46 : 286 - 292 .
HIRSCHAUER T J , ADELI H , BUFORD J A . Computer-aided diagnosis of Parkinson's disease using enhanced probabilistic neural network [J]. Journal of Medical Systems , 2015 , 39 ( 11 ): 179 .
DAS D , LEE C S G . Sample-to-sample correspondence for unsupervised domain adaptation [J]. Engineering Applications of Artificial Intelligence , 2018 , 73 : 80 - 91 .
ZHANG H , PATEL V M . Convolutional sparse and low-rank coding-based image decomposition [J]. IEEE Transactions on Image Processing , 2018 , 27 ( 5 ): 2121 - 2133 .
KONONENKO I . Estimating attributes: Analysis and extensions of RELIEF [C]// European conference on machine learning . Catania : Springer , 1994 : 171 - 182 .
PAN S J , TSANG I W , KWOK J T , et al . Domain adaptation via transfer component analysis [J]. IEEE Transactions on Neural Networks , 2011 , 22 ( 2 ): 199 - 210 .
赵鹏 , 王美玉 , 纪霞 , 等 . 基于张量表示的域适配的迁移学习中特征表示方法 [J]. 电子学报 , 2020 , 48 ( 2 ): 359 - 368 .
ZHAO P , WANG M Y , JI X , et al . A novel feature representation based on tensor and domain adaption for transfer learning [J]. Acta Electronica Sinica , 2020 , 48 ( 2 ): 359 - 368 . (in Chinese)
BOYD S , PARIKH N . Distributed optimization and statistical learning via the alternating direction method of multipliers [J]. Foundations and Trends in Machine Learning , 2010 , 3 ( 1 ): 1 - 122 .
SOREL M , SROUBEK F . Fast convolutional sparse coding using matrix inversion lemma [J]. Digital Signal Processing , 2016 , 55 ( 1 ): 44 - 51 .
CAI X J , GU G Y , HE B S , et al . A proximal point algorithm revisit on the alternating direction method of multipliers [J]. Science China Mathematics , 2013 , 56 ( 10 ): 2179 - 2186 .
HE X , NIYOGI P . Locality preserving projections [C]// Proceedings of Conference on Advances in Neural Information Processing Systems(NIPS) . Vancouver and Whistler : NIPS foundation , 2004 : 153 - 160 .
CANTURK I , KARABIBER F . A machine learning system for the diagnosis of Parkinson's disease from speech signals and its application to multiple speech signal types [J]. Arabian Journal for Science and Engineering , 2016 , 41 ( 12 ): 5049 - 5059 .
ZHANG H H , YANG L , LIU Y , et al . Classification of Parkinson's disease utilizing multi-edit nearest-neighbor and ensemble learning algorithms with speech samples [J]. Biomedical Engineering Online , 2016 , 15 ( 1 ): 122 - 143 .
LI Y M , ZHANG C , JIA Y J , et al . Simultaneous learning of speech feature and segment for classification of Parkinson disease [C]// 2017 IEEE 19th International Conference on e-Health Networking, Applications and Services (Healthcom) . Dalian : IEEE , 2017 : 1 - 6 .
BENBA A , JILBAB A , HAMMOUCH A . Using human factor cepstral coefficient on multiple types of voice recordings for detecting patients with Parkinson's disease [J]. IRBM , 2017 , 38 ( 6 ): 346 - 351 .
BENBA A , JILBAB A , HAMMOUCH A . Analysis of multiple types of voice recordings in cepstral domain using MFCC for discriminating between patients with Parkinson's disease and healthy people [J]. International Journal of Speech Technology , 2016 , 19 ( 3 ): 449 - 456 .
ALI L , ZHU C , ZHANG Z H , et al . Automated detection of Parkinson's disease based on multiple types of sustained phonations using linear discriminant analysis and genetically optimized neural network [J]. IEEE Journal of Translational Engineering in Health and Medicine , 2019 , 7 : 1 - 10 .
0
浏览量
8
下载量
4
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621