

浏览全部资源
扫码关注微信
重庆师范大学计算机与信息科学学院,重庆 401331
Received:03 August 2023,
Revised:2024-05-14,
Published:25 July 2024
移动端阅览
项秋艳, 訾玲玲, 丛鑫. 改进自适应模型池的在线异常检测算法[J]. 电子学报, 2024, 52(07): 2503-2514.
XIANG Qiu-yan, ZI Ling-ling, CONG Xin. Improved Adaptive Model Pools for Online Anomaly Detection Algorithms[J]. Acta Electronica Sinica, 2024, 52(07): 2503-2514.
项秋艳, 訾玲玲, 丛鑫. 改进自适应模型池的在线异常检测算法[J]. 电子学报, 2024, 52(07): 2503-2514. DOI:10.12263/DZXB.20230731
XIANG Qiu-yan, ZI Ling-ling, CONG Xin. Improved Adaptive Model Pools for Online Anomaly Detection Algorithms[J]. Acta Electronica Sinica, 2024, 52(07): 2503-2514. DOI:10.12263/DZXB.20230731
精确的在线异常检测方法是物联网行业发展的核心,其中,以复杂和动态数据流为对象的在线异常识别是研究热点.现有在线异常检测方法存在处理复杂性负载过重问题,离线深度异常检测方法则存在因数据分布变化导致概念漂移问题.针对上述问题,本文提出了改进自适应模型池的在线异常检测框架,该框架可以与基于自动编码器的异常检测方法协作实现在线异常检测.首先,利用基于自动编码器的异常检测模型进行基本异常识别;其次,以自适应模型池为基础,融合概念漂移检测算法准确识别概念漂移,适应动态变化的数据流,解决概念漂移现象;最后,优化自适应模型池的模型合并方法,提升在线异常识别能力.实验结果表明,相比自动编码器模型的流变体和原自适应模型池算法,提出的算法在异常检测精度指标上分别提升了20.2%和5.83%,同时,最佳精度指标高于现有在线异常检测算法约16.7%.
Accurate online anomaly detection methods are at the core of the development of IoT-related industries
in which online anomaly identification targeting complex and dynamic data streams is one of the important research hotspots. Existing online anomaly detection methods suffer from the problem of processing complexity overload
while offline deep anomaly detection methods suffer from the problem of concept drift due to the change of data distribution. To address the above problems
this paper proposes an online anomaly detection framework with improved adaptive model pooling
which can collaborate with autoencoder-based anomaly detection methods to achieve online anomaly detection. Firstly
the basic anomaly identification is carried out using the autoencoder-based anomaly detection model. Secondly
based on the adaptive model pool
the concept drift detection algorithm is integrated to accurately identify concept drift
adapt to the dynamically changing data flow
and solve the concept drift phenomenon. Finally
the model merging method of the optimised adaptive model pool is optimised
which enhances the capability of online anomaly identification. The experimental results show that compared with the flow variant of autoencoder model and the original adaptive model pool algorithm
the proposed algorithm improves the anomaly detection accuracy indexes by 20.2% and 5.83% respectively
and meanwhile is higher than the existing online anomaly detection algorithms in the best accuracy indexes by about 16.7%.
NASSIF A BOU , TALIB M ABU , NASIR Q , et al . Machine learning for anomaly detection: A systematic review [J ] . IEEE Access , 2021 , 9 : 78658 - 78700 .
孙海丽 , 龙翔 , 韩兰胜 , 等 . 工业物联网异常检测技术综述 [J ] . 通信学报 , 2022 , 43 ( 3 ): 196 - 210 .
SUN H L , LONG X , HAN L S , et al . Overview of anomaly detection techniques for industrial Internet of Things [J ] . Journal on Communications , 2022 , 43 ( 3 ): 196 - 210 . (in Chinese)
金明 , 丁蓉 . 一种联合时域和空域残差的网络异常检测与节点定位方法 [J ] . 电子学报 , 2023 , 51 ( 5 ): 1172 - 1178 .
JIN M , DING R . Detection and localization of outlier nodes in wireless sensor networks via jointing temporal and spatial residuals [J ] . Acta Electronica Sinica , 2023 , 51 ( 5 ): 1172 - 1178 . (in Chinese)
LU T Y , WANG L , ZHAO X Y . Review of anomaly detection algorithms for data streams [J ] . Applied Sciences , 2023 , 13 ( 10 ): 6353 .
刘帅 , 乔颖 , 罗雄飞 , 等 . 时序数据库关键技术综述 [J ] . 计算机研究与发展 , 2024 , 61 ( 3 ): 614 - 638 .
LIU S , QIAO Y , LUO X F , et al . Key techniques of time series databases: A survey [J ] . Journal of Computer Research and Development , 2024 , 61 ( 3 ): 614 - 638 . (in Chinese)
陆克中 , 陈超凡 , 蔡桓 , 等 . 面向概念漂移和类不平衡数据流的在线分类算法 [J ] . 电子学报 , 2022 , 50 ( 3 ): 585 - 597 .
LU K Z , CHEN C F , CAI H , et al . Online classification algorithm for concept drift and class imbalance data stream [J ] . Acta Electronica Sinica , 2022 , 50 ( 3 ): 585 - 597 . (in Chinese)
PANG G S , SHEN C H , CAO L B , et al . Deep learning for anomaly detection: A review [J ] . ACM Computing Surveys , 2021 , 54 ( 2 ): 38 .
WANG R Y , NIE K X , WANG T , et al . Deep learning for anomaly detection [C ] // Proceedings of the 13th International Conference on Web Search and Data Mining . New York : ACM , 2020 : 894 - 896 .
蒋洪迅 , 江俊毅 , 梁循 . 基于机器学习的信用卡交易欺诈检测研究综述 [J ] . 计算机工程与应用 , 2023 , 59 ( 21 ): 1 - 25 .
JIANG H X , JIANG J Y , LIANG X . Survey on credit card transaction fraud detection based on machine learning [J ] . Computer Engineering and Applications , 2023 , 59 ( 21 ): 1 - 25 . (in Chinese)
CHOI K , YI J H , PARK C , et al . Deep learning for anomaly detection in time-series data: Review, analysis, and guidelines [J ] . IEEE Access , 2021 , 9 : 120043 - 120065 .
SCHLIMMER J C , GRANGER R H . Incremental learning from noisy data [J ] . Machine Learning , 1986 , 1 ( 3 ): 317 - 354 .
文益民 , 刘帅 , 缪裕青 , 等 . 概念漂移数据流半监督分类综述 [J ] . 软件学报 , 2022 , 33 ( 4 ): 1287 - 1314 .
WEN Y M , LIU S , MIAO Y Q , et al . Survey on semi-supervised classification of data streams with concept drifts [J ] . Journal of Software , 2022 , 33 ( 4 ): 1287 - 1314 . (in Chinese)
SOUZA V M A , DOS REIS D M , MALETZKE A G , et al . Challenges in benchmarking stream learning algorithms with real-world data [J ] . Data Mining and Knowledge Discovery , 2020 , 34 ( 6 ): 1805 - 1858 .
韩光洁 , 赵腾飞 , 刘立 , 等 . 基于多元区域集划分的工业数据流概念漂移检测 [J ] . 电子学报 , 2023 , 51 ( 7 ): 1906 - 1916 .
HAN G J , ZHAO T F , LIU L , et al . Concept drift detection of industrial data flow based on multivariate region set partition [J ] . Acta Electronica Sinica , 2023 , 51 ( 7 ): 1906 - 1916 . (in Chinese)
BHATIA S , JAIN A , LI P , et al . Mstream: Fast anomaly detection in multi-aspect streams [C ] // Proceedings of the Web Conference 2021 . New York : ACM , 2021 : 3371 - 3382 .
YOON S , SHIN Y , LEE J G , et al . Multiple dynamic outlier-detection from a data stream by exploiting duality of data and queries [C ] // Proceedings of the 2021 International Conference on Management of Data . New York : ACM , 2021 : 2063 - 2075 .
YOON S , LEE Y , LEE J G , et al . Adaptive model pooling for online deep anomaly detection from a complex evolving data stream [C ] // Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining . New York : ACM , 2022 : 2347 - 2357 .
GUO T , XU Z , YAO X , et al . Robust online time series prediction with recurrent neural networks [C ] // 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA) . Piscataway : IEEE , 2016 : 816 - 825 .
MALHOTRA P , RAMAKRISHNAN A , ANAND G , et al . LSTM-based encoder-decoder for multi-sensor anomaly detection [EB/OL ] . ( 2016-07-01 )[ 2023-08-01 ] . http://arxiv.org/abs/1607.00148 http://arxiv.org/abs/1607.00148 .
LI G , JUNG J J . Deep learning for anomaly detection in multivariate time series: Approaches, applications, and challenges [J ] . Information Fusion , 2023 , 91 ( C ): 93 - 102 .
WANG H Z , BAH M J , HAMMAD M . Progress in outlier detection techniques: A survey [J ] . IEEE Access , 2019 , 7 : 107964 - 108000 .
QIN X , CAO L , RUNDENSTEINER E A , et al . Scalable kernel density estimation-based local outlier detection over large data streams [C ] // 22nd International Conference on Extending Database Technology (EDBT) . Spain : OpenProceedings , 2019 : 421 - 432 .
YOON S , LEE J G , LEE B S . Ultrafast local outlier detection from a data stream with stationary region skipping [C ] // Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining . New York : ACM , 2020 : 1181 - 1191 .
YOON S , LEE J G , LEE B S . NETS [J ] . Proceedings of the VLDB Endowment , 2019 , 12 ( 11 ): 1303 - 1315 .
SALEHI M , LECKIE C , BEZDEK J C , et al . Fast memory efficient local outlier detection in data streams [J ] . IEEE Transactions on Knowledge and Data Engineering , 2016 , 28 ( 12 ): 3246 - 3260 .
BLÁZQUEZ-GARCÍA A , CONDE A , MORI U , et al . A review on outlier/anomaly detection in time series data [J ] . ACM Computing Surveys , 54 ( 3 ): 56 .
HILAL W , GADSDEN S A , YAWNEY J . Financial fraud: A review of anomaly detection techniques and recent advances [J ] . Expert Systems with Applications , 2022 , 193 : 116429 .
QI S . Deep Autoencoding gaussian mixture model for unsupervised anomaly detection [C ] // International Conference on Learning Representations . Vancouver : OpenReview , 2018 : 1 - 19 .
LAI C H , ZOU D M , LERMAN G . Robust subspace recovery layer for unsupervised anomaly detection [EB/OL ] . ( 2019-03-30 )[ 2023-08-01 ] . http://arxiv.org/abs/1904.00152 http://arxiv.org/abs/1904.00152 .
KIM K H , SHIM S , LIM Y , et al . Rapp: Novelty detection with reconstruction along projection pathway [C ] // International Conference on Learning Representations . Ethiopia : OpenReview , 2020 : 1 - 10 .
LU W N , CHENG Y , XIAO C , et al . Unsupervised sequential outlier detection with deep architectures [J ] . IEEE Transactions on Image Processing , 2017 , 26 ( 9 ): 4321 - 4330 .
XIANG Q Y , ZI L L , CONG X , et al . Concept drift adaptation methods under the deep learning framework: A literature review [J ] . Applied Sciences , 2023 , 13 ( 11 ): 6515 .
SUÁREZ-CETRULO A L , QUINTANA D , CERVANTES A . A survey on machine learning for recurring concept drifting data streams [J ] . Expert Systems with Applications , 2023 , 213 : 118934 .
穆栋梁 , 韩萌 , 李昂 , 等 . 概念漂移复杂数据流分类方法综述 [J ] . 计算机应用 , 2023 , 43 ( 6 ): 1664 - 1675 .
MU D L , HAN M , LI A , et al . Overview of classification methods for complex data streams with concept drift [J ] . Journal of Computer Applications , 2023 , 43 ( 6 ): 1664 - 1675 . (in Chinese)
BANK D , KOENIGSTEIN N , GIRYES R . Autoencoders [EB/OL ] . ( 2020-03-12 )[ 2023-08-01 ] . http://arxiv.org/abs/2003.05991 http://arxiv.org/abs/2003.05991 .
FRIEDRICH B , SAWABE T , HEIN A . Unsupervised statistical concept drift detection for behaviour abnormality detection [J ] . Applied Intelligence , 2023 , 53 ( 3 ): 2527 - 2537 .
ASHFAHANI A , PRATAMA M , LUGHOFER E , et al . DEVDAN: Deep evolving denoising autoencoder [J ] . Neurocomputing , 2020 , 390 : 297 - 314 .
BHATIA S , JAIN A , SRIVASTAVA S , et al . MemStream: Memory-based streaming anomaly detection [C ] // Proceedings of the ACM Web Conference 2022 . New York : ACM , 2022 : 610 - 621 .
GAMA J , SEBASTIÃO R , RODRIGUES P P . On evaluating stream learning algorithms [J ] . Machine Learning , 2013 , 90 ( 3 ): 317 - 346 .
GAMA J , MEDAS P , CASTILLO G , et al . Learning with drift detection [C ] // Brazilian Symposium on Artificial Intelligence . Berlin : Springer , 2004 : 286 - 295 .
WANG X M , CHEN W , XIA J Z , et al . ConceptExplorer: Visual analysis of concept drifts in multi-source time-series data [C ] // 2020 IEEE Conference on Visual Analytics Science and Technology (VAST) . Piscataway : IEEE , 2020 : 1 - 11 .
XIONG W L , HE D F , MU J B , et al . Adaptive stochastic model predictive control via network ensemble learning [J ] . International Journal of Systems Science , 2023 , 54 ( 16 ): 3013 - 3026 .
FRÍAS-BLANCO I , DEL CAMPO-ÁVILA J , RAMOS-JIMÉNEZ G , et al . Online and non-parametric drift detection methods based on hoeffding’s bounds [J ] . IEEE Transactions on Knowledge and Data Engineering , 2015 , 27 ( 3 ): 810 - 823 .
BIFET A , GAVALDÀ R . Learning from time-changing data with adaptive windowing [C ] // Proceedings of the 2007 SIAM International Conference on Data Mining . Philadelphia : Society for Industrial and Applied Mathematics , 2007 : 443 - 448 .
KORNBLITH S , NOROUZI M , LEE H , et al . Similarity of neural network representations revisited [EB/OL ] . ( 2019-05-01 )[ 2023-08-01 ] . http://arxiv.org/abs/1905.00414 http://arxiv.org/abs/1905.00414 .
LECUN Y , BOTTOU L , BENGIO Y , et al . Gradient-based learning applied to document recognition [J ] . Proceedings of the IEEE , 1998 , 86 ( 11 ): 2278 - 2324 .
SHIN S Y , JO G , WANG G X . A novel method for fashion clothing image classification based on deep learning [J ] . Journal of Information and Communication Technology , 2023 , 22 ( 1 ): 127 - 148 .
YANG Y J , WANG W , FU H Y , et al . On supervised feature selection from high dimensional feature spaces [EB/OL ] . ( 2022-03-22 )[ 2023-08-01 ] . http://arxiv.org/abs/2203.11924 http://arxiv.org/abs/2203.11924 .
VERGARA A , VEMBU S , AYHAN T , et al . Chemical gas sensor drift compensation using classifier ensembles [J ] . Sensors and Actuators B: Chemical , 2012 , 166/167 : 320 - 329 .
LOSING V , HAMMER B , WERSING H . KNN classifier with self adjusting memory for heterogeneous concept drift [C ] // 2016 IEEE 16th International Conference on Data Mining (ICDM) . Piscataway : IEEE , 2016 : 291 - 300 .
CAMPOS G O , ZIMEK A , SANDER J , et al . On the evaluation of unsupervised outlier detection: Measures, datasets, and an empirical study [J ] . Data Mining and Knowledge Discovery , 2016 , 30 ( 4 ): 891 - 927 .
0
Views
14
下载量
0
CSCD
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802024621