• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.

DOI:10.1109/TPAMI.2017.2699184
PMID:28463186
Abstract

In this work we address the task of semantic image segmentation with Deep Learning and make three main contributions that are experimentally shown to have substantial practical merit. First, we highlight convolution with upsampled filters, or 'atrous convolution', as a powerful tool in dense prediction tasks. Atrous convolution allows us to explicitly control the resolution at which feature responses are computed within Deep Convolutional Neural Networks. It also allows us to effectively enlarge the field of view of filters to incorporate larger context without increasing the number of parameters or the amount of computation. Second, we propose atrous spatial pyramid pooling (ASPP) to robustly segment objects at multiple scales. ASPP probes an incoming convolutional feature layer with filters at multiple sampling rates and effective fields-of-views, thus capturing objects as well as image context at multiple scales. Third, we improve the localization of object boundaries by combining methods from DCNNs and probabilistic graphical models. The commonly deployed combination of max-pooling and downsampling in DCNNs achieves invariance but has a toll on localization accuracy. We overcome this by combining the responses at the final DCNN layer with a fully connected Conditional Random Field (CRF), which is shown both qualitatively and quantitatively to improve localization performance. Our proposed "DeepLab" system sets the new state-of-art at the PASCAL VOC-2012 semantic image segmentation task, reaching 79.7 percent mIOU in the test set, and advances the results on three other datasets: PASCAL-Context, PASCAL-Person-Part, and Cityscapes. All of our code is made publicly available online.

摘要

在这项工作中,我们致力于使用深度学习进行语义图像分割,并做出了三个主要贡献,这些贡献在实验中被证明具有很大的实际价值。首先,我们强调了带有上采样滤波器的卷积,即“空洞卷积”,作为密集预测任务中的强大工具。空洞卷积允许我们在深度卷积神经网络中显式地控制特征响应的计算分辨率。它还允许我们有效地扩大滤波器的视野,在不增加参数数量或计算量的情况下纳入更大的上下文。其次,我们提出了空洞空间金字塔池化(ASPP),以在多个尺度上稳健地分割对象。ASPP 使用多个采样率和有效感受野的滤波器探测输入的卷积特征层,从而在多个尺度上捕获对象和图像上下文。第三,我们通过结合 DCNN 和概率图形模型的方法来提高对象边界的定位精度。DCNN 中通常采用的最大池化和下采样组合实现了不变性,但牺牲了定位精度。我们通过在最后一个 DCNN 层的响应与全连接条件随机场(CRF)相结合来克服这一问题,这在定性和定量上都显示出了提高定位性能的效果。我们提出的“DeepLab”系统在 PASCAL VOC-2012 语义图像分割任务中设定了新的技术水平,在测试集上达到了 79.7%的 mIOU,并在另外三个数据集 PASCAL-Context、PASCAL-Person-Part 和 Cityscapes 上取得了进展。我们的所有代码都在网上公开。

相似文献

1
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
2
Novel Method of Semantic Segmentation Applicable to Augmented Reality.适用于增强现实的语义分割新方法。
Sensors (Basel). 2020 Mar 20;20(6):1737. doi: 10.3390/s20061737.
3
A multiple-channel and atrous convolution network for ultrasound image segmentation.一种用于超声图像分割的多通道多孔卷积网络。
Med Phys. 2020 Dec;47(12):6270-6285. doi: 10.1002/mp.14512. Epub 2020 Oct 18.
4
A top-down manner-based DCNN architecture for semantic image segmentation.一种用于语义图像分割的基于自上而下方式的深度卷积神经网络(DCNN)架构。
PLoS One. 2017 Mar 24;12(3):e0174508. doi: 10.1371/journal.pone.0174508. eCollection 2017.
5
A Novel Upsampling and Context Convolution for Image Semantic Segmentation.一种用于图像语义分割的新型上采样与上下文卷积
Sensors (Basel). 2021 Mar 20;21(6):2170. doi: 10.3390/s21062170.
6
Waterfall Atrous Spatial Pooling Architecture for Efficient Semantic Segmentation.用于高效语义分割的瀑布空洞空间池化架构。
Sensors (Basel). 2019 Dec 5;19(24):5361. doi: 10.3390/s19245361.
7
ADR-Net: Context extraction network based on M-Net for medical image segmentation.ADR-Net:基于M-Net的医学图像分割上下文提取网络。
Med Phys. 2020 Sep;47(9):4254-4264. doi: 10.1002/mp.14364. Epub 2020 Aug 2.
8
ACDSSNet: Atrous Convolution-Based Deep Semantic Segmentation Network for Efficient Detection of Sickle Cell Anemia.ACDSSNet:基于空洞卷积的深度语义分割网络,用于高效检测镰状细胞贫血。
IEEE J Biomed Health Inform. 2024 Oct;28(10):5676-5684. doi: 10.1109/JBHI.2024.3362843. Epub 2024 Oct 3.
9
Automated segmentation of macular edema in OCT using deep neural networks.利用深度神经网络自动分割 OCT 中的黄斑水肿。
Med Image Anal. 2019 Jul;55:216-227. doi: 10.1016/j.media.2019.05.002. Epub 2019 May 10.
10
Building Corner Detection in Aerial Images with Fully Convolutional Networks.基于全卷积网络的航空影像建筑物角点检测
Sensors (Basel). 2019 Apr 23;19(8):1915. doi: 10.3390/s19081915.

引用本文的文献

1
MDWC-Net: a multi-scale dynamic-weighting context network for precise spinal X-ray segmentation.MDWC-Net:一种用于精确脊柱X光分割的多尺度动态加权上下文网络。
Front Physiol. 2025 Aug 29;16:1651296. doi: 10.3389/fphys.2025.1651296. eCollection 2025.
2
MS-UNet: A Hybrid Network with a Multi-Scale Vision Transformer and Attention Learning Confusion Regions for Soybean Rust Fungus.MS-UNet:一种结合多尺度视觉Transformer和注意力学习混淆区域的用于大豆锈菌的混合网络。
Sensors (Basel). 2025 Sep 7;25(17):5582. doi: 10.3390/s25175582.
3
RST-Net: A Semantic Segmentation Network for Remote Sensing Images Based on a Dual-Branch Encoder Structure.
RST-Net:一种基于双分支编码器结构的遥感图像语义分割网络。
Sensors (Basel). 2025 Sep 5;25(17):5531. doi: 10.3390/s25175531.
4
Multi-Scale Guided Context-Aware Transformer for Remote Sensing Building Extraction.用于遥感建筑物提取的多尺度引导上下文感知Transformer
Sensors (Basel). 2025 Aug 29;25(17):5356. doi: 10.3390/s25175356.
5
A New Classification Method for High-Volume Fly Ash: Performance Based on Coal Source and Particle Size.一种大容量粉煤灰的新分类方法:基于煤源和粒度的性能
Materials (Basel). 2025 Sep 4;18(17):4145. doi: 10.3390/ma18174145.
6
Top-k Bottom All but Loss Strategy for Medical Image Segmentation.用于医学图像分割的Top-k Bottom All but损失策略
Diagnostics (Basel). 2025 Aug 29;15(17):2189. doi: 10.3390/diagnostics15172189.
7
Enhancing Oral Health Diagnostics With Hyperspectral Imaging and Computer Vision: Clinical Dataset Study.利用高光谱成像和计算机视觉增强口腔健康诊断:临床数据集研究
JMIR Med Inform. 2025 Sep 11;13:e76148. doi: 10.2196/76148.
8
Cherry-Net: real-time segmentation algorithm of cherry maturity based on improved PIDNet.樱桃网络:基于改进型PIDNet的樱桃成熟度实时分割算法
Front Plant Sci. 2025 Sep 3;16:1607205. doi: 10.3389/fpls.2025.1607205. eCollection 2025.
9
Lightweight rice leaf spot segmentation model based on improved DeepLabv3.基于改进的DeepLabv3的轻量级水稻叶斑病分割模型。
Front Plant Sci. 2025 Aug 22;16:1635302. doi: 10.3389/fpls.2025.1635302. eCollection 2025.
10
AI-powered automated model construction for patient-specific CFD simulations of aortic flows.用于主动脉血流患者特异性计算流体动力学模拟的人工智能驱动的自动模型构建。
Sci Adv. 2025 Sep 5;11(36):eadw2825. doi: 10.1126/sciadv.adw2825.