1.昆明理工大学机电工程学院,云南昆明 650500
2.昆明理工大学国土资源工程学院,云南昆明 650500
[ "何自芬 女,1976年10月出生于河北省南宫市.现为昆明理工大学机电工程学院副教授、硕士生导师.主要研究方向为计算机视觉、图像处理.E-mail: zyhhzf1998@163.com" ]
[ "史本杰 男,1997年10月出生于四川省成都市.现在为昆明理工大学硕士研究生.主要研究方向为遥感图像处理.E-mail: 1217432073@qq.com" ]
[ "张印辉(通讯作者) 男,1977年9月出生于河北省衡水市.现为昆明理工大学机电工程学院教授、博士生导师.主要研究方向为图像处理、机器视觉和机器智能. Email: yinhui_z@163.com" ]
收稿:2022-01-17,
修回:2022-04-10,
纸质出版:2023-04-25
移动端阅览
何自芬,史本杰,张印辉等.多注意力融合的环高原湖泊遥感影像分割[J].电子学报,2023,51(04):885-895.
HE Zi-fen,SHI Ben-jie,ZHANG Yin-hui,et al.Remote Sensing Image Segmentation of Around Plateau Lakes Based on Multi-Attention Fusion[J].ACTA ELECTRONICA SINICA,2023,51(04):885-895.
何自芬,史本杰,张印辉等.多注意力融合的环高原湖泊遥感影像分割[J].电子学报,2023,51(04):885-895. DOI: 10.12263/DZXB.20220085.
HE Zi-fen,SHI Ben-jie,ZHANG Yin-hui,et al.Remote Sensing Image Segmentation of Around Plateau Lakes Based on Multi-Attention Fusion[J].ACTA ELECTRONICA SINICA,2023,51(04):885-895. DOI: 10.12263/DZXB.20220085.
环高原湖泊区域土地类别监测为湖泊生态保护和土地资源规划提供了决策依据.针对此区域遥感影像中河流、建筑物及植被目标分布零散、尺度不均导致分割精度较低的问题,设计了融合类别与多尺度注意力的遥感语义分割网络.该网络采用编码-解码的端到端结构并以深度残差神经网络为基础构建类别与多尺度注意力模块.类别注意力对网络特征层进行初步分类与空间信息过滤,有利于网络关注类别信息以降低像素分类误差;多尺度注意力将混合域注意力和多尺度特征进行融合,为不同尺度特征建立上下文联系,改善分布零散小尺度目标固有的分割消弥问题.实验结果表明,在建立的环滇池区域遥感影像语义分割数据集上,本文设计的注意力融合语义分割网络测试精度在平均交并比和平均像素精度指标下分别达到77.4%和86.3%.从整体分割效果来看,融合类别与多尺度注意力分割网络在一定程度上解决了分布零散小尺度目标区域的分割消弥问题,对环高原湖泊区域精准监测和科学规划提供了有效依据.
Land category monitoring in lake region around plateau provides decision-making basis for lake ecological protection and land resource planning. Aiming at the problem of low segmentation accuracy caused by scattered distribution and uneven scale of rivers
buildings and vegetation in remote sensing images of this region
a remote sensing semantic segmentation network integrating category and multi-scale attention is designed. The network adopts encoding-decoding end-to-end structure and constructs class and multi-scale attention modules based on depth residual neural network. Category attention makes a preliminary classification and spatial information filtering for the network feature layer
which is beneficial for the network to pay attention to category information and reduce pixel classification error; multi-scale attention combines mixed domain attention with multi-scale features
establishes context connection for different scale features
and improves the inherent segmentation and elimination problem of scattered small-scale targets. Experiments are performed on the semantic segmentation data set of remote sensing images around Dianchi Lake
and the test accuracy of the attention fusion semantic segmentation network designed in this paper reaches 77.4% and 86.3% under the average intersection ratio and average pixel accuracy index
respectively. From the overall segmentation effect
the fusion category and multi-scale attention segmentation network solve the segmentation and elimination problem of scattered small-scale target areas to a certain extent
and provide an effective basis for accurate monitoring and scientific planning of lakes around plateau.
闫立娟 , 郑绵平 , 魏乐军 . 近40年来青藏高原湖泊变迁及其对气候变化的响应 [J ] . 地学前缘 , 2016 , 23 ( 4 ): 310 - 323 .
YAN L J , ZHENG M P , WEI L J . Change of the lakes in Tibetan Plateau and its response to climate in the past forty years [J ] . Earth Science Frontiers , 2016 , 23 ( 4 ): 310 - 323 . (in Chinese)
朱立平 , 鞠建廷 , 乔宝晋 , 等 . “亚洲水塔”的近期湖泊变化及气候响应: 进展、问题与展望 [J ] . 科学通报 , 2019 , 64 ( 27 ): 2796 - 2806 .
ZHU L P , JU J T , QIAO B J , et al . Recent Lake changes of the Asia Water Tower and their climate response: Progress, problems and prospects [J ] . Chinese Science Bulletin , 2019 , 64 ( 27 ): 2796 - 2806 . (in Chinese)
董一凡 , 郑文秀 , 张晨雪 , 等 . 中国湖泊生态系统突变时空差异 [J ] . 湖泊科学 , 2021 , 33 ( 4 ): 992 - 1005 .
DONG Y F , ZHENG W X , ZHANG C X , et al . Temporal and spatial differences of lake ecosystem regime shift in China [J ] . Journal of Lake Sciences , 2021 , 33 ( 4 ): 992 - 1005 . (in Chinese)
杨蕴 , 李玉 , 赵泉华 . 高分辨率全色遥感图像多级阈值分割 [J ] . 光学精密工程 , 2020 , 28 ( 10 ): 2370 - 2383 .
YANG Y , LI Y , ZHAO Q H . Multi-level threshold segmentation of high-resolution panchromatic remote sensing imagery [J ] . Optics and Precision Engineering , 2020 , 28 ( 10 ): 2370 - 2383 . (in Chinese)
杨泽楠 , 牛海鹏 , 黄亮 , 等 . 基于MSR-cut的高空间分辨率遥感影像边缘检测分割 [J ] . 农业机械学报 , 2021 , 52 ( 8 ): 154 - 162 .
YANG Z N , NIU H P , HUANG L , et al . Edge detection segmentation method for high spatial resolution remote sensing image based on MSR-cut [J ] . Transactions of the Chinese Society for Agricultural Machinery , 2021 , 52 ( 8 ): 154 - 162 . (in Chinese)
罗会兰 , 张云 . 基于深度网络的图像语义分割综述 [J ] . 电子学报 , 2019 , 47 ( 10 ): 2211 - 2220 .
LUO H L , ZHANG Y . A survey of image semantic segmentation based on deep network [J ] . Acta Electronica Sinica , 2019 , 47 ( 10 ): 2211 - 2220 . (in Chinese)
RONNEBERGER O , FISCHER P , BROX T . U-Net: Convolutional networks for biomedical image segmentation [C ] // Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015 . Munich : Springer , 2015 : 234 - 241 .
LONG J , SHELHAMER E , DARRELL T . Fully convolutional networks for semantic segmentation [C ] // 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) . Boston : IEEE , 2015 : 3431 - 3440 .
ZHAO H S , SHI J P , QI X J , et al . Pyramid scene parsing network [C ] // 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) . Honolulu : IEEE , 2017 : 6230 - 6239 .
CHEN L C , ZHU Y K , PAPANDREOU G , et al . Encoder-decoder with atrous separable convolution for semantic image segmentation [C ] // European Conference on Computer Vision . Munich : Springer , 2018 : 833 - 851 .
张哲晗 , 方薇 , 杜丽丽 , 等 . 基于编码-解码卷积神经网络的遥感图像语义分割 [J ] . 光学学报 , 2020 , 40 ( 3 ): 46 - 55 .
ZHANG Z H , FANG W , DU L L , et al . Semantic segmentation of remote sensing image based on encoder-decoder convolutional neural network [J ] . Acta Optica Sinica , 2020 , 40 ( 3 ): 46 - 55 . (in Chinese)
BADRINARAYANAN V , KENDALL A , CIPOLLA R . SegNet: A deep convolutional encoder-decoder architecture for image segmentation [J ] . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2017 , 39 ( 12 ): 2481 - 2495 .
尤洪峰 , 田生伟 , 禹龙 , 等 . 基于Word Embedding的遥感影像检测分割 [J ] . 电子学报 , 2020 , 48 ( 1 ): 75 - 83 .
YOU H F , TIAN S W , YU L , et al . Remote sensing image detection and segmentation based on word embedding [J ] . Acta Electronica Sinica , 2020 , 48 ( 1 ): 75 - 83 . (in Chinese)
肖春姣 , 李宇 , 张洪群 , 等 . 深度融合网结合条件随机场的遥感图像语义分割 [J ] . 遥感学报 , 2020 , 24 ( 3 ): 254 - 264 .
XIAO C J , LI Y , ZHANG H Q , et al . Semantic segmentation of remote sensing image based on deep fusion networks and conditional random field [J ] . Journal of Remote Sensing , 2020 , 24 ( 3 ): 254 - 264 . (in Chinese)
ROY A G , NAVAB N , WACHINGER C . Concurrent spatial and channel ‘squeeze & excitation’ in fully convolutional networks [C ] // Medical Image Computing and Computer Assisted Intervention - MICCAI 2018 . Granada : Springer , 2018 : 421 - 429 .
HU J , SHEN L , SUN G . Squeeze-and-excitation networks [C ] // 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Salt Lake City : IEEE , 2018 : 7132 - 7141 .
WOO S , PARK J , LEE J Y , et al . CBAM: Convolutional block attention module [C ] /// European Conference on Computer Vision - ECCV 2018 . Munich : Springer , 2018 : 3 - 19 .
WANG Q L , WU B G , ZHU P F , et al . ECA-net: Efficient channel attention for deep convolutional neural networks [C ] // 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) . Glasgow : IEEE , 2020 : 11531 - 11539 .
WANG X L , GIRSHICK R , GUPTA A , et al . Non-local neural networks [C ] // 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Salt Lake City : IEEE , 2018 : 7794 - 7803 .
HUANG Z L , WANG X G , WEI Y C , et al . CCNet: Criss-cross attention for semantic segmentation [J/OL ] . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2020 . DOI: 10.1109/TPAMI.2020.3007032 http://dx.doi.org/10.1109/TPAMI.2020.3007032 .
LIU H J , LIU F Q , FAN X Y , et al . Polarized self-attention: Towards high-quality pixel-wise regression [EB/OL ] . ( 2021-07-02 )[ 2022-01 ] . https://arxiv.org/abs/2107.00782 https://arxiv.org/abs/2107.00782 .
HE K M , ZHANG X Y , REN S Q , et al . Deep residual learning for image recognition [C ] // 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) . Boston : IEEE , 2016 : 770 - 778 .
肖茜 , 杨昆 , 洪亮 . 近30 a云贵高原湖泊表面水体面积变化遥感监测与时空分析 [J ] . 湖泊科学 , 2018 , 30 ( 4 ): 1083 - 1096 .
XIAO Q , YANG K , HONG L . Remote sensing monitoring and temporal-spatial analysis of surface water body area changes of lakes on the Yunnan-Guizhou Plateau over the past 30 years [J ] . Journal of Lake Sciences , 2018 , 30 ( 4 ): 1083 - 1096 . (in Chinese)
XIE S N , GIRSHICK R , DOLLÁR P , et al . Aggregated residual transformations for deep neural networks [C ] // 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) . Honolulu : IEEE , 2017 : 5987 - 5995 .
HOWARD A , SANDLER M , CHEN B , et al . Searching for MobileNetV3 [C ] // 2019 IEEE/CVF International Conference on Computer Vision (ICCV) . Seoul : IEEE , 2020 : 1314 - 1324 .
SHAO Z F , ZHOU W X , DENG X Q , et al . Multilabel remote sensing image retrieval based on fully convolutional network [J ] . IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing , 2020 , 13 : 318 - 328 .
0
浏览量
13
下载量
1
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621