
浏览全部资源
扫码关注微信
1. 海军工程大学, 湖北 武汉 430033
2. 海军参谋部军事训练中心, 北京 100080
3. 久之洋红外系统股份有限公司, 湖北 武汉 430223
Received:29 September 2024,
Published Online:28 August 2025,
Published:31 August 2025
移动端阅览
Weihao TAO, Yasong LUO, Fajun LIN, et al. Marine Ship Target Segmentation Method Based on Improved DeepLabV3+[J]. Acta Armamentarii, 2025, 46(8): 240912.
Weihao TAO, Yasong LUO, Fajun LIN, et al. Marine Ship Target Segmentation Method Based on Improved DeepLabV3+[J]. Acta Armamentarii, 2025, 46(8): 240912. DOI: 10.12382/bgxb.2024.0912.
船舶目标分割对于提高海洋监测系统的自动化程度具有重要意义。现有的分割方法往往存在误分割、易受噪声干扰、处理效率低等问题
难以灵活高效地适应实际场景的需求。针对海上船舶目标分割任务
提出一种基于改进DeepLabV3+网络的分割算法MET-DeepLabV3+。在图像预处理阶段采用线性变换、双边滤波和多尺度Retinex级联式算法
以减弱噪声、天气等不利因素的影响;模型设计中采用轻量化的MobileNetV2作为主干网络
以降低模型复杂度;引入ECA-Net注意力机制
以增强模型对多尺度特征的捕捉能力。采用迁移学习方法
将在SeaShips数据集上预训练得到的特征权重应用于模型训练中
进一步优化分割效果。实验结果表明
改进算法的平均交互比为91.62%
平均像素准确率为92.87%
相比于基础DeepLabV3+模型分别提高2.13%和1.11%
且高于HRNet、PSPNet和UNet等分割模型
较好地满足了船舶目标分割任务的实际需求
具有较高的应用价值。
Ship target segmentation plays a crucial role in enhancing the automation level of marine monitoring systems.However
the existing target segmentation methods often suffers from the issues such as mis-segmentation
susceptibility to noise interference
and low processing efficiency
making them difficult to adapt flexibly and efficiently to real-world scenarios.To address these challenges in maritime ship target segmentation tasks
a segmentation algorithm based on an improved DeepLabV3+ network
termed MET-DeepLabV3+
is proposed.During the image preprocessing stage
the bilateral filtering
multi-scale Retinex algorithm
and linear transformation are employed to mitigate the effects of noise
weather and other adverse factors.In terms of model design
the lightweight MobileNetV2 is used as the backbone network to reduce model complexity
and the ECA-Net attention mechanism is introduced to enhance the model's ability to capture the multi-scale features.Additionally
a transfer learning approach is adopted
and the feature weights pre-trained on the SeaShips dataset are applied to model training to further optimize the segmentation performance.Experimental results demonstrate that the improved algorithm achieves an average interaction-over-union (IoU) of 91.62% and an average pixel accuracy (PA) of 92.87%
which are improved by 2.13% and 1.11%
respectively
compared with those of the baseline DeepLabV3+ model.Moreover
it outperforms the segmentation models such as HRNet
PSPNet
and UNet
which effectively meets the practical demands of ship target segmentation tasks and offers significant application value.
余扬 . 基于二维Otsu的舰船目标图像分割方法 [J ] . 舰船电子工程 , 2022 , 42 ( 1 ): 36 - 39 .
YU Y . Ship target image segmentation method based on two-dimensional Otsu [J ] . Ship Electronic Engineering , 2022 , 42 ( 1 ): 36 - 39 . (in Chinese)
薛丽霞 , 孙伟 , 汪荣贵 , 等 . 基于边缘概率的层次交互式图像分割算法 [J ] . 合肥工业大学学报(自然科学版) , 2020 , 43 ( 3 ): 342 - 348 .
XUE L X , SUN W , WANG R G , et al . Hierarchical interactive image segmentation algorithm based on edge probability [J ] . Journal of Hefei University of Technology (Natural Science Edition) , 2020 , 43 ( 3 ): 342 - 348 . (in Chinese)
何翠萍 . 应用区域生长的无人船红外图像精确分割方法 [J ] . 信息与电脑(理论版) , 2022 , 34 ( 24 ): 81 - 83 .
HE C P . Precise segmentation method for infrared images of unmanned ships using region growing [J ] . Information & Computer(Theoretical Edition) , 2022 , 34 ( 24 ): 81 - 83 . (in Chinese)
黄泽贤 , 吴凡路 , 傅瑶 , 等 . 基于深度学习的遥感图像舰船目标检测算法综述 [J ] . 光学精密工程 , 2023 , 31 ( 15 ): 2295 - 2318 .
HUANG Z X , WU F L , FU Y , et al . A review of ship target detection algorithms in remote sensing images based on deep learning [J ] . Optics and Precision Engineering , 2023 , 31 ( 15 ): 2295 - 2318 . (in Chinese)
ARBELÁEZ P , MAIRE M , FOWLKES C , et al . Contour detection and hierarchical image segmentation [J ] . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2011 , 33 ( 5 ): 898 - 916 . DOI: 10.1109/TPAMI.2010.161 http://doi.org/10.1109/TPAMI.2010.161 This paper investigates two fundamental problems in computer vision: contour detection and image segmentation. We present state-of-the-art algorithms for both of these tasks. Our contour detector combines multiple local cues into a globalization framework based on spectral clustering. Our segmentation algorithm consists of generic machinery for transforming the output of any contour detector into a hierarchical region tree. In this manner, we reduce the problem of image segmentation to that of contour detection. Extensive experimental evaluation demonstrates that both our contour detection and segmentation methods significantly outperform competing algorithms. The automatically generated hierarchical segmentations can be interactively refined by user-specified annotations. Computation at multiple image resolutions provides a means of coupling our system to recognition applications.
吕亚飞 , 石敏 , 江志浩 , 等 . 基于图像分割的海上舰船目标智能感知算法 [J ] 舰船科学技术 , 2022 , 44 ( 24 ): 91 - 95 .
LÜ Y F , SHI M , JIANG Z H , et al . Intelligent perception algorithm for maritime ship targets based on image segmentation [J ] . Ship Science and Technology , 2022 , 44 ( 24 ): 91 - 95 . (in Chinese)
李琦 , 王树良 , 党迎旭 , 等 . 基于聚类分析的战场重要目标识别方法 [C ] // 第十届中国指挥控制大会论文集(上册) . 北京 : 中国指挥与控制学会 , 2022 : 472 - 478 .
LI Q , WANG S L , DANG Y X , et al . Battlefield key target recognition method based on cluster analysis [C ] // Proceedings of the 10th China Command and Control Conference (Volume I) . Beijing : Chinese Institute of Command and Control , 2022 : 472 - 478 . (in Chinese)
PARK H . α-MeanShift++:improving meanshift++ for image segmentation [J ] . IEEE Access , 2021 , 9 : 131430 - 131439 .
GANESAN P , SATHISH B S , JOSEPH L M I L , et al . Investigation of fuzzy and SOM based segmentation of high-resolution satellite images in HSL color space for information retrieval [C ] // Proceedings of the 2023 9th International Conference on Advanced Computing and Communication Systems . Coimbatore,India : IEEE , 2023 , 1 : 420 - 425 .
夏平 , 任强 , 吴涛 , 等 . 融合多尺度统计信息模糊C均值聚类与Markov随机场的小波域声纳图像分割 [J ] . 兵工学报 , 2017 , 38 ( 5 ): 940 - 948 . DOI: 10.3969/j.issn.1000-1093.2017.05.014 http://doi.org/10.3969/j.issn.1000-1093.2017.05.014 声纳图像成像质量差、特征信息弱,目标分割存在一定困难,为此提出一种融合多尺度统计信息的模糊C均值(FCM)聚类与Markov随机场(MRF)的小波域声纳图像分割算法。小波域中低频信息统计特性描述了低频不同区域像素聚类情况,高频信息反映了该方向纹理特征,依据低频子带的统计峰值选取FCM初始聚类中心,应用小波域FCM聚类算法对声纳图像进行预分割,抑制噪声的影响,提高了预分割的准确性;构建初分割后图像的多尺度MRF模型,尺度间节点标记的相关性采用1阶Markov性表征,尺度内构建2阶邻域系统描述系数间的标记联系,标记场采用双点多级逻辑模型建模,同一标记的系数特征场采用高斯模型建模,弥补了MRF算法中层次信息和轮廓信息描述的不足;应用迭代条件模型算法求其最小能量下的标记场,实现声纳图像分割。从视觉主观效果和客观评价指标两方面的实验结果验证表明,该算法分割声纳图像均优于FCM聚类算法和MRF算法,分割的声纳图像边缘与细节的清晰度、精细度均有一定程度改善。
XIA P , REN Q , WU T , et al . Sonar image segmentation fusion of multi-scale statistical information FCM clustering and MRF model in wavelet domain [J ] . Acta Armamentarii , 2017 , 38 ( 5 ): 940 - 948 . (in Chinese) DOI: 10.3969/j.issn.1000-1093.2017.05.014 http://doi.org/10.3969/j.issn.1000-1093.2017.05.014 Because of poor quality, weak feature information, and difficult target segmentation in sonar image, a sonar image segmentation algorithm based on fusing the fuzzy C-means (FCM) clustering of multi-scale statistical information and Markov random field (MRF) in the wavelet domain is proposed. In the wavelet domain, the low-frequency information depicts the clustering of pixels in different regions, and the high-frequency information reflects the texture feature in that direction. The proposed algorithm selects FCM initial cluster center from the low-frequency sub-band statistical peak value, and FCM algorithm is used for the pre-segmentation of sonar image and the suppression of the noise to improve the accuracy of pre-segmentation. The algorithm is used to construct multi-scale MRF model, and the correlation of inter-scale node marks is characterized by first order Markov. The intra-scale label connection among the description coefficients of two-order neighborhood system is constructed, double-point Multi-level Logistic (MLL) model is used in the label field, and Gauss model is used in the same-marked coefficient characteristic field, thus remeding the described shortfalls of hierarchical and silhouette information in MRF algorithm. The algorithm uses iteration condition model (ICM) algorithm to obtain its label fields for the minimum energy to realize the sonar image segmentation. The experimental results show that the proposed algorithm is better than FCM algorithm and MRF model algorithm, and the clarity and precision of the edge and details in the segmented sonar image are improved in a certain degree. Key
陈松生 . 基于模糊聚类的非监督学习研究 [D ] . 南京 : 南京师范大学 , 2007 .
CHEN S S . Research on unsupervised learning based on fuzzy clustering [D ] . Nanjing : Nanjing Normal University , 2007 . (in Chinese)
XU D X , ZHU X W , WANG Z H . Segmentation of marine targets instance based on improved mask R-CNN [C ] // Proceedings of the 2023 IEEE International Conference on Mechatronics and Automation . Harbin,China : IEEE , 2023 : 1633 - 1637 .
WEI C X , JI Z Y , WEI M S , et al . SCV-UNet:saliency-combined complex-valued U-Net for SAR ship target segmentation [C ] // Proceedings of the 2023 IEEE International Geoscience and Remote Sensing Symposium . Washington,D.C.,US : IEEE , 2023 : 6948 - 6951 .
CHEN L C , ZHU Y K , PAPANDREOU G , et al . Encoder-decoder with atrous separable convolution for semantic image segmentation: arXiv:1802.02611 [R ] . Ithaca,NY , US : Cornell University , 2018 :1802.02611.
郭开武 . 基于空间特征相关的图像去噪及分割算法研究 [D ] . 西安 : 西安电子科技大学 , 2014 .
GUO K W . Research on image denoising and segmentation algorithms based on spatial feature correlation [D ] . Xi'an : Xidian University , 2014 . (in Chinese)
钱康亮 . 基于图像局部滤波去噪增强算法的图像语义分割模型 [J ] . 机械设计与制造工程 , 2023 , 52 ( 10 ): 109 - 113 .
QIAN K L . Image semantic segmentation model based on local filtering denoising and enhancement algorithm [J ] . Journal of Mechanical Design and Manufacturing Engineering , 2023 , 52 ( 10 ): 109 - 113 . (in Chinese)
TOMASI C , MANDUCHI R . Bilateral filtering for gray and color images [C ] // Proceedings of the Sixth International Conference on Computer Vision . Bombay,India : IEEE , 1998 : 839 - 846 .
RAHMAN Z , JOBSON D J , WOODELL G A . Multi-scale retinex for color image enhancement [C ] // Proceedings of the 3rd IEEE International Conference on Image Processing . Lausanne,Switzerland : IEEE , 1996 , 3 : 1003 - 1006 .
MUNTEANU C , ROSA A . Color image enhancement using evolutionary principles and the retinex theory of color constancy [C ] // Neural Networks for Signal Processing XI:Proceedings of the 2001 IEEE Signal Processing Society Workshop . North Falmouth,MA,US : IEEE , 2001 : 393 - 402 .
CHEN L C . Rethinking atrous convolution for semantic image segmentation:arxiv:1706.05587 [R ] . Ithaca,NY,US : Cornell University , 2017 :1706.05587.
CHOLLET F . Xception:deep learning with depthwise separable convolutions [C ] // Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition . Honolulu,HI,US : IEEE , 2017 : 1251 - 1258 .
刘文祥 , 舒远仲 , 唐小敏 , 等 . 采用双注意力机制DeepLabv3+算法的遥感影像语义分割 [J ] . 热带地理 , 2020 , 40 ( 2 ): 303 - 313 . DOI: 10.13284/j.cnki.rddl.003229 http://doi.org/10.13284/j.cnki.rddl.003229 针对DeepLabv3+网络在遥感影像上呈现出拟合速度慢,边缘目标分割不精确,大尺度目标分割类内不一致、存在孔洞等缺陷,提出在该网络中引入双注意力机制模块(Dual Attention Mechanism Module, DAMM),设计并实现了将DAMM结构与ASPP(Atous Spatial Pyramid Pooling)层串联或并联的2种不同连接方式网络模型,串联连接方式中先将特征图送入DAMM后,再经过ASPP结构;并联连接方式中将双注意力机制层与ASPP层并行连接,网络并行处理主干网提取特征图,再融合两层处理特征信息。将改进的2种方法通过INRIA Aerial Image高分辨率遥感影像数据集验 证,结果表明,串联或并联方式2种网络都能有效改善Deeplabv3+的不足,并联方式网络性能更好,其对原网络缺陷改善效果更明显,并在测试数据集上mIoU达到85.44%,比Deeplabv3+提高了1.8%,而串联方式网络提高了1.12%。并联结构网络更符合本文需求,其形成了一种对DeepLabv3+网络上述问题进行统一改善的方案。
LIU W X , SHU Y Z , TANG X M , et al . Semantic segmentation of remote sensing images using dual attention mechanism DeepLabV3+ algorithm [J ] . Tropical Geography , 2020 , 40 ( 2 ): 303 - 313 . (in Chinese)
FU H X , GU Z Q , WANG B Y , et al . Marine target segmentation based on improved DeepLabV3+ [C ] // Proceedings of the 2022 41st Chinese Control Conference . Hefei,China : IEEE , 2022 : 7314 - 7319 .
CHENG L , XIONG R , WU J R , et al . Fast segmentation algorithm of USV accessible area based on attention fast DeepLabV3 [J ] . IEEE Sensors Journal , 2024 , 24 ( 15 ): 24168 - 24177 .
陈昌川 , 郝晓严 , 龙虹毓 , 等 . 基于改进DeepLabv3+网络的沥青道路裂缝检测方法 [J ] . 半导体光电 , 2024 , 45 ( 3 ): 493 - 500 .
CHEN C C , HAO X Y , LONG H Y , et al . Asphalt road crack detection method based on improved DeepLabV3+ network [J ] . Semiconductor Optoelectronics , 2024 , 45 ( 3 ): 493 - 500 . (in Chinese)
SANDLER M , HOWARD A , ZHU M L , et al . MobileNetV2:Inverted residuals and linear bottlenecks [C ] // Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition . Salt Lake City,UT,US : IEEE , 2018 : 4510 - 4520 .
LU H X , HONG Z Y , ZENG Z Q , et al . Improving semantic segmentation of switch connector images in DeepLabV3+ [C ] // Proceedings of the 2023 6th International Conference on Information Communication and Signal Processing . Xi'an,China : IEEE , 2023 : 90 - 96 .
ZHANG C Y , LI M L , WEI D Z , et al . Enhanced DeepLabV3+ for urban land use classification based on UAV-borne images [C ] // Proceedings of the 2022 7th International Conference on Image,Vision and Computing . Xi'an,China : IEEE , 2022 : 449 - 454 .
SIFRE L , MALLAT S . Rigid-motion scattering for texture classification:arXiv:1403.1687 [R ] . Ithaca,NY,US : Cornell University , 2014 :1403.1687.
WANG Y , YANG L , LIU X Z , et al . An improved semantic segmentation algorithm for high-resolution remote sensing images based on DeepLabV3+ [J ] . Scientific Reports , 2024 , 14 ( 1 ): 9716 . DOI: 10.1038/s41598-024-60375-1 http://doi.org/10.1038/s41598-024-60375-1 High-precision and high-efficiency Semantic segmentation of high-resolution remote sensing images is a challenge. Existing models typically require a significant amount of training data to achieve good classification results and have numerous training parameters. A novel model called MST-DeepLabv3+ was suggested in this paper for remote sensing image classification. It's based on the DeepLabv3+ and can produce better results with fewer train parameters. MST-DeepLabv3+ made three improvements: (1) Reducing the number of model parameters by substituting MobileNetV2 for the Xception in the DeepLabv3+'s backbone network. (2) Adding the attention mechanism module SENet to increase the precision of semantic segmentation. (3) Increasing Transfer Learning to enhance the model's capacity to recognize features, and raise the segmentation accuracy. MST-DeepLabv3+ was tested on international society for photogrammetry and remote sensing (ISPRS) dataset, Gaofen image dataset (GID), and practically applied to the Taikang cultivated land dataset. On the ISPRS dataset, the mean intersection over union (MIoU), overall accuracy (OA), Precision, Recall, and F1-score are 82.47%, 92.13%, 90.34%, 90.12%, and 90.23%, respectively. On the GID dataset, these values are 73.44%, 85.58%, 84.10%, 84.86%, and 84.48%, respectively. The results were as high as 90.77%, 95.47%, 95.28%, 95.02%, and 95.15% on the Taikang cultivated land dataset. The experimental results indicate that MST-DeepLabv3+ effectively improves the accuracy of semantic segmentation of remote sensing images, recognizes the edge information with more completeness, and significantly reduces the parameter size.© 2024. The Author(s).
NIU Z Y , ZHONG G Q , YU H . A review on the attention mechanism of deep learning [J ] . Neurocomputing , 2021 , 452 : 48 - 62 .
WANG Q L , WU B G , ZHU P F , et al . ECA-net:efficient channel attention for deep convolutional neural networks:arXiv:1910.03151 [R ] . Ithaca,NY,US : Cornell University , 2019 :1910.03151.
HU J , SHEN L , SUN G . Squeeze-and-excitation networks [C ] // Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition . Salt Lake City,UT,US : IEEE , 2018 : 7132 - 7141 .
PAN S J , YANG Q . A survey on transfer learning [J ] . IEEE Transactions on Knowledge and Data Engineering , 2009 , 22 ( 10 ): 1345 - 1359 .
LI T W , LEE G C . Performance analysis of fine-tune transferred deep learning [C ] // Proceedings of the 2021 IEEE 3rd Eurasia Conference on IOT,Communication and Engineering . Yunlin,Taiwan,China : IEEE , 2021 : 315 - 319 .
SHAO Z F , WU W J , WANG Z Y , et al . Seaships:a large-scale precisely annotated dataset for ship detection [J ] . IEEE Transactions on Multimedia , 2018 , 20 ( 10 ): 2593 - 2604 .
赵为平 , 陈雨 , 项松 , 等 . 基于改进的DeepLabv3+图像语义分割算法研究 [J ] . 系统仿真学报 , 2023 , 35 ( 11 ): 2333 - 2344 . DOI: 10.16182/j.issn1004731x.joss.22-0690 http://doi.org/10.16182/j.issn1004731x.joss.22-0690 目前主流图像语义分割网络往往存在误分割、分割不连续和模型复杂度高的问题,不能灵活高效地部署于实际场景中。针对这一现象,通过综合考虑网络的参数量、预测时间和准确度,设计出一种优化DeepLabv3+模型的图像语义分割网络。骨干网络改用轻量级EfficientNetv2网络提取特征,提高参数利用率;在空洞空间金字塔池化模块中使用混合条带池化模块代替全局平均池化,引入深度可分离膨胀卷积,减少参数量和提高学习多尺度信息的能力;使用注意力机制增强模型表征力,提取骨干网络多条浅层特征,丰富图像的几何细节信息。实验表明,本文算法可达到mIoU为81.19%,参数量为55.51×10<sup>6</sup>,有效优化了分割精度和模型复杂度,同时也提高了模型泛化性。
ZHAO W P , CHEN Y , XIANG S , et al . Research on an improved DeepLabV3+ image semantic segmentation algorithm [J ] . Journal of System Simulation , 2023 , 35 ( 11 ): 2333 - 2344 . (in Chinese)
NI Z L , CHEN X H , ZHAI Y J , et al . Context-guided spatial feature reconstruction for efficient semantic segmentation [C ] // Proceedings of the European Conference on Computer Vision . Cham,Germany : Springer , 2024 : 239 - 255 .
XIE B , CAO J L , XIE J , et al . SED:a simple encoder-decoder for open-vocabulary semantic segmentation [C ] // Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition . Washington,D.C.,US : IEEE , 2024 : 3426 - 3436 .
0
Views
106
下载量
0
CNKI被引量
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802024360号