To overcome the drawbacks of current traffic network flow prediction methods
such as the low capability of capturing highly dynamic spatio-temporal correlation and long-term spatial dependence
this paper constructs a novel traffic flow prediction model based on multi-head self-attention network. The model takes the data tensor at daily period and weekly period scales as model inputs to express the temporal similarity of traffic flow data
and obtains its static spatio-temporal characteristics by encoding the spatio-temporal position embedding of the input data. The main model designs temporal multi-head attention module and spatial multi-head attention module respectively based on multi-head self-attention mechanism for considering the dynamic spatio-temporal characteristics of traffic flow and the long-range spatial dependences. The temporal multi-head attention module obtains the local attention using a graph masking matrix and fuses it into a multi-head self-attention to extract the dynamic temporal characteristics of traffic flow. The spatial multi-head attention module obtains the local attention and global attention using two graph masking matrices and fuses them into a multi-head self-attention to extract the dynamic spatial characteristics and long-range spatial dependences of road network nodes. Finally
a gated fusion module is designed to adaptively fuse the spatio-temporal correlation characteristics of traffic flow data. The effectiveness of the proposed model is verified on three real traffic flow benchmark datasets PEMS04
PEMS07 and PEMS08
and the results show that the three prediction accuracy metrics of the proposed model on the three data sets improved by 4.437%
2.930%
and 4.275% on average compared with the other models with the highest accuracy.
LI C L , WANG S , YUE W W , et al . Virtual lanes for air-ground integrated transportation systems: Evolution and key techniques [J]. Acta Electronica Sinica , 2022 , 50 ( 5 ): 1255 - 1265 . (in Chinese)
LV Y S , DUAN Y J , KANG W W , et al . Traffic flow prediction with big data: A deep learning approach [J]. IEEE Transactions on Intelligent Transportation Systems , 2015 , 16 ( 2 ): 865 - 873 .
GORI M , MONFARDINI G , SCARSELLI F . A new model for learning in graph domains [C]// Proceedings of 2005 IEEE International Joint Conference on Neural Networks . Piscataway : IEEE , 2005 : 729 - 734 .
JIANG W W , LUO J Y , HE M , et al . Graph neural network for traffic forecasting: The research progress [J]. ISPRS International Journal of Geo-Information , 2023 , 12 ( 3 ): 100 .
YANG J X , YU C S , LI R , et al . Traffic network speed prediction via multi-periodic-component spatial-temporal neural network [J]. Journal of Transportation Systems Engineering and Information Technology , 2021 , 21 ( 3 ): 112 - 119, 139 . (in Chinese)
GUO S N , LIN Y F , WAN H Y , et al . Learning dynamics and heterogeneity of spatial-temporal graph data for traffic forecasting [J]. IEEE Transactions on Knowledge and Data Engineering , 2022 , 34 ( 11 ): 5415 - 5428 .
VASWANI A , SHAZEER N , PARMAR N , et al . Attention is all you need [C]// Proceedings of the 31st International Conference on Neural Information Processing Systems . New York : ACM , 2017 : 6000 - 6010 .
LIN Z Q , FENG J , LU Z Y , et al . DeepSTN+: Context-aware spatial-temporal neural network for crowd flow prediction in metropolis [J]. Proceedings of the AAAI Conference on Artificial Intelligence , 2019 , 33 ( 1 ): 1020 - 1027 .
ZONOOZI A , KIM J-J , LI X L , et al . Periodic-CRN: A convolutional recurrent model for crowd density prediction with recurring periodic patterns [C]// Proceedings of the 27th International Joint Conference on Artificial Intelligence . Stockholm : AAAI Press , 2018 : 3732 - 3738 .
HOANG V T , JEON H J , YOU E S , et al . Graph representation learning and its applications: A survey [J]. Sensors , 2023 , 23 ( 8 ): 4168 .
JEONG Y S , JEONG M K , OMITAOMU O A . Weighted dynamic time warping for time series classification [J]. Pattern Recognition , 2011 , 44 ( 9 ): 2231 - 2240 .
DRUCKER H , BURGES C J C , KAUFMAN L , et al . Support vector regression machines [C]// Proceedings of the 9th International Conference on Neural Information Processing Systems . New York : ACM , 1996 : 155 - 161 .
LI Y G , YU R , SHAHABI C , et al . Diffusion convolutional recurrent neural network: Data-driven traffic forecasting [EB/OL]. ( 2017 )[2023]. http://arxiv.org/abs/1707.01926.pdf http://arxiv.org/abs/1707.01926.pdf .
YU B , YIN H T , ZHU Z X . Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting [C]// Proceedings of the 27th International Joint Conference on Artificial Intelligence . New York : ACM , 2018 : 3634 - 3640 .
WU Z H , PAN S R , LONG G D , et al . Graph WaveNet for deep spatial-temporal graph modeling [EB/OL]. ( 2019 )[2023]. https://arxiv.org/abs/1906.00121 https://arxiv.org/abs/1906.00121 .
GUO S N , LIN Y F , FENG N , et al . Attention based spatial-temporal graph convolutional networks for traffic flow forecasting [J]. Proceedings of the AAAI Conference on Artificial Intelligence , 2019 , 33 ( 1 ): 922 - 929 .
JIANG J W , HAN C K , WANG J Y . BUAA_BIGSCity: Spatial-temporal graph neural network for wind power forecasting in Baidu KDD CUP 2022 [EB/OL]. ( 2022 )[2023]. https://arxiv.org/abs/2302.11159 https://arxiv.org/abs/2302.11159 .