• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过解耦主体和边缘信息改进语义分割

Improving Semantic Segmentation via Decoupled Body and Edge Information.

作者信息

Yu Lintao, Yao Anni, Duan Jin

机构信息

College of Electronic Information Engineering, Changchun University of Science and Technology, Changchun 130022, China.

出版信息

Entropy (Basel). 2023 Jun 2;25(6):891. doi: 10.3390/e25060891.

DOI:10.3390/e25060891
PMID:37372235
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10296844/
Abstract

In this paper, we propose a method that uses the idea of decoupling and unites edge information for semantic segmentation. We build a new dual-stream CNN architecture that fully considers the interaction between the body and the edge of the object, and our method significantly improves the segmentation performance of small objects and object boundaries. The dual-stream CNN architecture mainly consists of a body-stream module and an edge-stream module, which process the feature map of the segmented object into two parts with low coupling: body features and edge features. The body stream warps the image features by learning the flow-field offset, warps the body pixels toward object inner parts, completes the generation of the body features, and enhances the object's inner consistency. In the generation of edge features, the current state-of-the-art model processes information such as color, shape, and texture under a single network, which will ignore the recognition of important information. Our method separates the edge-processing branch in the network, i.e., the edge stream. The edge stream processes information in parallel with the body stream and effectively eliminates the noise of useless information by introducing a non-edge suppression layer to emphasize the importance of edge information. We validate our method on the large-scale public dataset Cityscapes, and our method greatly improves the segmentation performance of hard-to-segment objects and achieves state-of-the-art result. Notably, the method in this paper can achieve 82.6% mIoU on the Cityscapes with only fine-annotated data.

摘要

在本文中,我们提出了一种利用解耦思想并结合边缘信息进行语义分割的方法。我们构建了一种新的双流卷积神经网络(CNN)架构,该架构充分考虑了物体主体与边缘之间的相互作用,并且我们的方法显著提高了小物体和物体边界的分割性能。双流CNN架构主要由主体流模块和边缘流模块组成,它们将分割物体的特征图处理为耦合度低的两部分:主体特征和边缘特征。主体流通过学习流场偏移来扭曲图像特征,将主体像素向物体内部扭曲,完成主体特征的生成,并增强物体的内部一致性。在边缘特征生成方面,当前最先进的模型在单个网络下处理颜色、形状和纹理等信息,这会忽略对重要信息的识别。我们的方法在网络中分离出边缘处理分支,即边缘流。边缘流与主体流并行处理信息,并通过引入非边缘抑制层来强调边缘信息的重要性,从而有效消除无用信息的噪声。我们在大规模公共数据集Cityscapes上验证了我们的方法,并且我们的方法极大地提高了难分割物体的分割性能,并取得了当前最优的结果。值得注意的是,本文的方法仅使用精细标注的数据就能在Cityscapes上达到82.6%的平均交并比(mIoU)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/84ac7e9def1a/entropy-25-00891-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/87486803495b/entropy-25-00891-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/50f4ff54c4ec/entropy-25-00891-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/5f34d6bc75b5/entropy-25-00891-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/f9ae7dba3e1d/entropy-25-00891-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/4e994368f2d0/entropy-25-00891-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/ca92c3e67ea6/entropy-25-00891-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/9a1d1802f6a7/entropy-25-00891-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/3d7743175e99/entropy-25-00891-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/84ac7e9def1a/entropy-25-00891-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/87486803495b/entropy-25-00891-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/50f4ff54c4ec/entropy-25-00891-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/5f34d6bc75b5/entropy-25-00891-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/f9ae7dba3e1d/entropy-25-00891-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/4e994368f2d0/entropy-25-00891-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/ca92c3e67ea6/entropy-25-00891-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/9a1d1802f6a7/entropy-25-00891-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/3d7743175e99/entropy-25-00891-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c2a/10296844/84ac7e9def1a/entropy-25-00891-g009.jpg

相似文献

1
Improving Semantic Segmentation via Decoupled Body and Edge Information.通过解耦主体和边缘信息改进语义分割
Entropy (Basel). 2023 Jun 2;25(6):891. doi: 10.3390/e25060891.
2
EA-Net: Edge-aware network for brain structure segmentation via decoupled high and low frequency features.EA-Net:通过解耦高低频特征实现脑结构分割的边缘感知网络。
Comput Biol Med. 2022 Nov;150:106139. doi: 10.1016/j.compbiomed.2022.106139. Epub 2022 Sep 21.
3
BASeg: Boundary aware semantic segmentation for autonomous driving.BASeg:用于自动驾驶的边界感知语义分割。
Neural Netw. 2023 Jan;157:460-470. doi: 10.1016/j.neunet.2022.10.034. Epub 2022 Nov 9.
4
Mask-Refined R-CNN: A Network for Refining Object Details in Instance Segmentation.Mask-Refined R-CNN:用于实例分割中细化对象细节的网络。
Sensors (Basel). 2020 Feb 13;20(4):1010. doi: 10.3390/s20041010.
5
Multi-task edge-recalibrated network for male pelvic multi-organ segmentation on CT images.多任务边缘再校准网络用于 CT 图像上的男性骨盆多器官分割。
Phys Med Biol. 2021 Jan 26;66(3):035001. doi: 10.1088/1361-6560/abcad9.
6
Multi-scale dense selective network based on border modeling for lung nodule segmentation.基于边界建模的多尺度密集选择网络用于肺结节分割。
Int J Comput Assist Radiol Surg. 2023 May;18(5):845-853. doi: 10.1007/s11548-022-02817-7. Epub 2023 Jan 13.
7
Rethinking 1D convolution for lightweight semantic segmentation.重新思考用于轻量级语义分割的一维卷积
Front Neurorobot. 2023 Feb 9;17:1119231. doi: 10.3389/fnbot.2023.1119231. eCollection 2023.
8
A novel deep learning model DDU-net using edge features to enhance brain tumor segmentation on MR images.一种利用边缘特征增强磁共振图像上脑肿瘤分割的新型深度学习模型 DDU-net。
Artif Intell Med. 2021 Nov;121:102180. doi: 10.1016/j.artmed.2021.102180. Epub 2021 Sep 28.
9
Lightweight semantic segmentation network with configurable context and small object attention.具有可配置上下文和小目标注意力的轻量级语义分割网络。
Front Comput Neurosci. 2023 Oct 23;17:1280640. doi: 10.3389/fncom.2023.1280640. eCollection 2023.
10
PEA-Net: A progressive edge information aggregation network for vessel segmentation.PEA-Net:一种用于血管分割的渐进边缘信息聚合网络。
Comput Biol Med. 2024 Feb;169:107766. doi: 10.1016/j.compbiomed.2023.107766. Epub 2023 Nov 23.

引用本文的文献

1
A Boundary-Enhanced Decouple Fusion Segmentation Network for Diagnosis of Adenomatous Polyps.用于腺瘤性息肉诊断的边界增强解耦融合分割网络
J Imaging Inform Med. 2025 Feb;38(1):229-244. doi: 10.1007/s10278-024-01195-7. Epub 2024 Jul 22.

本文引用的文献

1
CenterPNets: A Multi-Task Shared Network for Traffic Perception.CenterPNets:一种用于交通感知的多任务共享网络。
Sensors (Basel). 2023 Feb 23;23(5):2467. doi: 10.3390/s23052467.
2
Semantic-guided polarization image fusion method based on a dual-discriminator GAN.基于双鉴别器 GAN 的语义引导偏振图像融合方法。
Opt Express. 2022 Nov 21;30(24):43601-43621. doi: 10.1364/OE.472214.
3
VOLO: Vision Outlooker for Visual Recognition.VOLO:用于视觉识别的视觉展望器
IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):6575-6586. doi: 10.1109/TPAMI.2022.3206108. Epub 2023 Apr 3.
4
CE-Net: Context Encoder Network for 2D Medical Image Segmentation.CE-Net:用于二维医学图像分割的上下文编码器网络。
IEEE Trans Med Imaging. 2019 Oct;38(10):2281-2292. doi: 10.1109/TMI.2019.2903562. Epub 2019 Mar 7.
5
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.空间金字塔池化在深度卷积网络中的视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.