OBELISK-Net：稀疏可变形卷积解决三维多器官分割问题，所需层数更少。

OBELISK-Net: Fewer layers to solve 3D multi-organ segmentation with sparse deformable convolutions.

机构信息

Institute of Medical Informatics, University of Lübeck, Germany.

Biomedical Image Analysis Group, Imperial College London, UK.

出版信息

Med Image Anal. 2019 May;54:1-9. doi: 10.1016/j.media.2019.02.006. Epub 2019 Feb 13.

DOI:10.1016/j.media.2019.02.006

PMID:30807894

Abstract

Deep networks have set the state-of-the-art in most image analysis tasks by replacing handcrafted features with learned convolution filters within end-to-end trainable architectures. Still, the specifications of a convolutional network are subject to much manual design - the shape and size of the receptive field for convolutional operations is a very sensitive part that has to be tuned for different image analysis applications. 3D fully-convolutional multi-scale architectures with skip-connection that excel at semantic segmentation and landmark localisation have huge memory requirements and rely on large annotated datasets - an important limitation for wider adaptation in medical image analysis. We propose a novel and effective method based on trainable 3D convolution kernels that learns both filter coefficients and spatial filter offsets in a continuous space based on the principle of differentiable image interpolation first introduced for spatial transformer network. A deep network that incorporates this one binary extremely large and inflecting sparse kernel (OBELISK) filter requires fewer trainable parameters and less memory while achieving high quality results compared to fully-convolutional U-Net architectures on two challenging 3D CT multi-organ segmentation tasks. Extensive validation experiments indicate that the performance of sparse deformable convolutions is due to their ability to capture large spatial context with few expressive filter parameters and that network depth is not always necessary to learn complex shape and appearance features. A combination with conventional CNNs further improves the delineation of small organs with large shape variations and the fast inference time using flexible image sampling may offer new potential use cases for deep networks in computer-assisted, image-guided interventions.

摘要

深度网络通过在端到端可训练的架构中用学习到的卷积滤波器替代手工制作的特征，在大多数图像分析任务中达到了最新水平。然而，卷积网络的规格仍然需要大量的人工设计 - 卷积操作的感受野的形状和大小是一个非常敏感的部分，需要针对不同的图像分析应用进行调整。具有跳过连接的 3D 全卷积多尺度架构在语义分割和地标定位方面表现出色，但需要巨大的内存要求和大型注释数据集 - 这是医学图像分析更广泛应用的一个重要限制。我们提出了一种新颖而有效的方法，该方法基于可训练的 3D 卷积核，根据首先为空间变形网络引入的可微图像插值原理，在连续空间中学习滤波器系数和空间滤波器偏移。与完全卷积的 U-Net 架构相比，在两个具有挑战性的 3D CT 多器官分割任务上，包含这种可训练的二进制极端大且弯曲稀疏核（OBELISK）滤波器的深度网络需要更少的可训练参数和更少的内存，同时可以获得高质量的结果。广泛的验证实验表明，稀疏可变形卷积的性能归因于它们能够用少量表达性滤波器参数捕获大的空间上下文的能力，并且网络深度不一定是学习复杂形状和外观特征所必需的。与传统的 CNN 相结合，进一步提高了具有大形状变化的小器官的勾画能力，并且使用灵活的图像采样的快速推理时间可能为计算机辅助、图像引导干预中的深度网络提供新的潜在应用场景。

相似文献

OBELISK-Net: Fewer layers to solve 3D multi-organ segmentation with sparse deformable convolutions.

Med Image Anal. 2019 May;54:1-9. doi: 10.1016/j.media.2019.02.006. Epub 2019 Feb 13.

Holistic decomposition convolution for effective semantic segmentation of medical volume images.

Med Image Anal. 2019 Oct;57:149-164. doi: 10.1016/j.media.2019.07.003. Epub 2019 Jul 8.

Mixture 2D Convolutions for 3D Medical Image Segmentation.

Int J Neural Syst. 2023 Jan;33(1):2250059. doi: 10.1142/S0129065722500599. Epub 2022 Nov 4.

DENSE-INception U-net for medical image segmentation.

Comput Methods Programs Biomed. 2020 Aug;192:105395. doi: 10.1016/j.cmpb.2020.105395. Epub 2020 Feb 15.

A new architecture combining convolutional and transformer-based networks for automatic 3D multi-organ segmentation on CT images.

Med Phys. 2023 Nov;50(11):6990-7002. doi: 10.1002/mp.16750. Epub 2023 Sep 22.

Swin Unet3D: a three-dimensional medical image segmentation network combining vision transformer and convolution.

BMC Med Inform Decis Mak. 2023 Feb 14;23(1):33. doi: 10.1186/s12911-023-02129-z.

DRINet for Medical Image Segmentation.

IEEE Trans Med Imaging. 2018 Nov;37(11):2453-2462. doi: 10.1109/TMI.2018.2835303. Epub 2018 May 10.

A modality-collaborative convolution and transformer hybrid network for unpaired multi-modal medical image segmentation with limited annotations.

Med Phys. 2023 Sep;50(9):5460-5478. doi: 10.1002/mp.16338. Epub 2023 Mar 15.

Abdominal multi-organ segmentation with organ-attention networks and statistical fusion.

Med Image Anal. 2019 Jul;55:88-102. doi: 10.1016/j.media.2019.04.005. Epub 2019 Apr 18.

TernaryNet: faster deep model inference without GPUs for medical 3D segmentation using sparse and binary convolutions.

Int J Comput Assist Radiol Surg. 2018 Sep;13(9):1311-1320. doi: 10.1007/s11548-018-1797-4. Epub 2018 May 30.

引用本文的文献

Enhanced abdominal multi-organ segmentation with 3D UNet and UNet + + deep neural networks utilizing the MONAI framework.

Abdom Radiol (NY). 2025 Jun 30. doi: 10.1007/s00261-025-05041-4.

A survey on deep learning in medical image registration: New technologies, uncertainty, evaluation metrics, and beyond.

Med Image Anal. 2025 Feb;100:103385. doi: 10.1016/j.media.2024.103385. Epub 2024 Nov 10.

Towards more precise automatic analysis: a systematic review of deep learning-based multi-organ segmentation.

Biomed Eng Online. 2024 Jun 8;23(1):52. doi: 10.1186/s12938-024-01238-8.

Dimensionality Reduction Hybrid U-Net for Brain Extraction in Magnetic Resonance Imaging.

Brain Sci. 2023 Nov 4;13(11):1549. doi: 10.3390/brainsci13111549.

Supervised Deep Generation of High-Resolution Arterial Phase Computed Tomography Kidney Substructure Atlas.

Proc SPIE Int Soc Opt Eng. 2022 Feb-Mar;12032. doi: 10.1117/12.2608290. Epub 2022 Apr 4.

BV-GAN: 3D time-of-flight magnetic resonance angiography cerebrovascular vessel segmentation using adversarial CNNs.

J Med Imaging (Bellingham). 2022 Jul;9(4):044503. doi: 10.1117/1.JMI.9.4.044503. Epub 2022 Aug 31.

Generating novel pituitary datasets from open-source imaging data and deep volumetric segmentation.

Pituitary. 2022 Dec;25(6):842-853. doi: 10.1007/s11102-022-01255-7. Epub 2022 Aug 9.

Fast four-dimensional cone-beam computed tomography reconstruction using deformable convolutional networks.

Med Phys. 2022 Oct;49(10):6461-6476. doi: 10.1002/mp.15806. Epub 2022 Jun 22.

Multi-contrast computed tomography healthy kidney atlas.

Comput Biol Med. 2022 Jul;146:105555. doi: 10.1016/j.compbiomed.2022.105555. Epub 2022 Apr 26.

Local Style Preservation in Improved GAN-Driven Synthetic Image Generation for Endoscopic Tool Segmentation.

Sensors (Basel). 2021 Jul 30;21(15):5163. doi: 10.3390/s21155163.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

OBELISK-Net：稀疏可变形卷积解决三维多器官分割问题，所需层数更少。

OBELISK-Net: Fewer layers to solve 3D multi-organ segmentation with sparse deformable convolutions.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献