• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于3D医学分割的面向数据的八叉树逆层次顺序聚合混合Transformer-CNN

Data-Oriented Octree Inverse Hierarchical Order Aggregation Hybrid Transformer-CNN for 3D Medical Segmentation.

作者信息

Li Yuhua, Jiang Shan, Yang Zhiyong, Wang Lixiang, Wang Liwen, Zhou Zeyang

机构信息

Mechanical Engineering Department, Tianjin University, No. 135, Yaguan Road, Haihe Education Park, Jinnan District, Tianjin City, 300350, China.

出版信息

J Imaging Inform Med. 2025 Jan 7. doi: 10.1007/s10278-024-01299-0.

DOI:10.1007/s10278-024-01299-0
PMID:39777616
Abstract

The hybrid CNN-transformer structures harness the global contextualization of transformers with the local feature acuity of CNNs, propelling medical image segmentation to the next level. However, the majority of research has focused on the design and composition of hybrid structures, neglecting the data structure, which enhance segmentation performance, optimize resource efficiency, and bolster model generalization and interpretability. In this work, we propose a data-oriented octree inverse hierarchical order aggregation hybrid transformer-CNN (nnU-OctTN), which focuses on delving deeply into the data itself to identify and harness potential. The nnU-OctTN employs the U-Net as a foundational framework, with the node aggregation transformer serving as the encoder. Data features are stored within an octree data structure with each node computed autonomously yet interconnected through a block-to-block local information exchange mechanism. Oriented towards multi-resolution feature data map learning, a cross-fusion module has been designed that associates the encoder and decoder in a staggered vertical and horizontal approach. Inspired by nnUNet, our framework automatically adapts network parameters to the dataset instead of using pre-trained weights for initialization. The nnU-OctTN method was evaluated on the BTCV, ACDC, and BraTS datasets and achieved excellent performance with dice score coefficient (DSC) 86.95, 92.82, and 90.61, respectively, demonstrating its generalizability and effectiveness. Cross-fusion module effectiveness and model scalability are validated through ablation experiments on BTCV and Kidney. Extensive qualitative and quantitative experimental results demonstrate that nnU-OctTN achieves high-quality 3D medical segmentation that has competitive performance against current state-of-the-art methods, providing a promising idea for clinical applications.

摘要

混合卷积神经网络-Transformer结构利用了Transformer的全局上下文信息和卷积神经网络的局部特征敏锐度,将医学图像分割提升到了一个新的水平。然而,大多数研究都集中在混合结构的设计和组成上,而忽略了数据结构,数据结构可以提高分割性能、优化资源效率,并增强模型的泛化能力和可解释性。在这项工作中,我们提出了一种面向数据的八叉树逆层次顺序聚合混合Transformer-卷积神经网络(nnU-OctTN),该方法专注于深入研究数据本身以识别和利用潜在信息。nnU-OctTN采用U-Net作为基础框架,节点聚合Transformer作为编码器。数据特征存储在八叉树数据结构中,每个节点自主计算,但通过块到块的局部信息交换机制相互连接。针对多分辨率特征数据图学习,设计了一个交叉融合模块,该模块以交错的垂直和水平方式关联编码器和解码器。受nnUNet的启发,我们的框架自动使网络参数适应数据集,而不是使用预训练权重进行初始化。nnU-OctTN方法在BTCV、ACDC和BraTS数据集上进行了评估,分别以86.95、92.82和90.61的骰子相似系数(DSC)取得了优异的性能,证明了其泛化能力和有效性。通过在BTCV和肾脏数据集上的消融实验验证了交叉融合模块的有效性和模型的可扩展性。广泛的定性和定量实验结果表明,nnU-OctTN实现了高质量的3D医学分割,与当前的先进方法相比具有竞争力,为临床应用提供了一个有前景的思路。

相似文献

1
Data-Oriented Octree Inverse Hierarchical Order Aggregation Hybrid Transformer-CNN for 3D Medical Segmentation.用于3D医学分割的面向数据的八叉树逆层次顺序聚合混合Transformer-CNN
J Imaging Inform Med. 2025 Jan 7. doi: 10.1007/s10278-024-01299-0.
2
Dual encoder network with transformer-CNN for multi-organ segmentation.基于 Transformer-CNN 的双编码器网络的多器官分割。
Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29.
3
A 3D hierarchical cross-modality interaction network using transformers and convolutions for brain glioma segmentation in MR images.一种使用变换和卷积的 3D 层次跨模态交互网络,用于磁共振图像中的脑胶质瘤分割。
Med Phys. 2024 Nov;51(11):8371-8389. doi: 10.1002/mp.17354. Epub 2024 Aug 13.
4
MSCT-UNET: multi-scale contrastive transformer within U-shaped network for medical image segmentation.MSCT-UNET:U 形网络中的多尺度对比变换用于医学图像分割。
Phys Med Biol. 2023 Dec 28;69(1). doi: 10.1088/1361-6560/ad135d.
5
HmsU-Net: A hybrid multi-scale U-net based on a CNN and transformer for medical image segmentation.HmsU-Net:一种基于 CNN 和 Transformer 的混合多尺度 U-Net 模型,用于医学图像分割。
Comput Biol Med. 2024 Mar;170:108013. doi: 10.1016/j.compbiomed.2024.108013. Epub 2024 Jan 22.
6
VSmTrans: A hybrid paradigm integrating self-attention and convolution for 3D medical image segmentation.VSmTrans:一种融合自注意力机制和卷积的 3D 医学图像分割混合范式。
Med Image Anal. 2024 Dec;98:103295. doi: 10.1016/j.media.2024.103295. Epub 2024 Aug 24.
7
LW-CTrans: A lightweight hybrid network of CNN and Transformer for 3D medical image segmentation.LW-CTrans:一种用于3D医学图像分割的轻量级卷积神经网络(CNN)与Transformer混合网络。
Med Image Anal. 2025 May;102:103545. doi: 10.1016/j.media.2025.103545. Epub 2025 Mar 17.
8
EMCAH-Net: an effective multi-scale context aggregation hybrid network for medical image segmentation.EMCAH-Net:一种用于医学图像分割的高效多尺度上下文聚合混合网络。
Quant Imaging Med Surg. 2025 Apr 1;15(4):3064-3083. doi: 10.21037/qims-24-1983. Epub 2025 Mar 28.
9
TAC-UNet: transformer-assisted convolutional neural network for medical image segmentation.TAC-UNet:用于医学图像分割的Transformer辅助卷积神经网络。
Quant Imaging Med Surg. 2024 Dec 5;14(12):8824-8839. doi: 10.21037/qims-24-1229. Epub 2024 Nov 5.
10
MC-DC: An MLP-CNN Based Dual-path Complementary Network for Medical Image Segmentation.MC-DC:一种基于多层感知器-卷积神经网络的医学图像分割双路径互补网络
Comput Methods Programs Biomed. 2023 Dec;242:107846. doi: 10.1016/j.cmpb.2023.107846. Epub 2023 Oct 5.

本文引用的文献

1
HmsU-Net: A hybrid multi-scale U-net based on a CNN and transformer for medical image segmentation.HmsU-Net:一种基于 CNN 和 Transformer 的混合多尺度 U-Net 模型,用于医学图像分割。
Comput Biol Med. 2024 Mar;170:108013. doi: 10.1016/j.compbiomed.2024.108013. Epub 2024 Jan 22.
2
UNesT: Local spatial representation learning with hierarchical transformer for efficient medical segmentation.UNesT:用于高效医学分割的分层转换器的局部空间表示学习。
Med Image Anal. 2023 Dec;90:102939. doi: 10.1016/j.media.2023.102939. Epub 2023 Aug 25.
3
The Big Bang of Deep Learning in Ultrasound-Guided Surgery: A Review.
深度学习在超声引导手术中的大爆炸:综述。
IEEE Trans Ultrason Ferroelectr Freq Control. 2023 Sep;70(9):909-919. doi: 10.1109/TUFFC.2023.3255843. Epub 2023 Aug 29.
4
MedViT: A robust vision transformer for generalized medical image classification.MedViT:一种用于广义医学图像分类的鲁棒视觉Transformer。
Comput Biol Med. 2023 May;157:106791. doi: 10.1016/j.compbiomed.2023.106791. Epub 2023 Mar 14.
5
Intelligent Assistant Diagnosis System of Osteosarcoma MRI Image Based on Transformer and Convolution in Developing Countries.基于Transformer和卷积的发展中国家骨肉瘤MRI图像智能辅助诊断系统
IEEE J Biomed Health Inform. 2022 Nov;26(11):5563-5574. doi: 10.1109/JBHI.2022.3196043. Epub 2022 Nov 10.
6
The Medical Segmentation Decathlon.医学分割十项全能
Nat Commun. 2022 Jul 15;13(1):4128. doi: 10.1038/s41467-022-30695-9.
7
MNet: A multi-scale multi-view framework for multi-phase pancreas segmentation based on cross-phase non-local attention.MNet:一种基于跨阶段非局部注意力的多阶段胰腺分割多尺度多视图框架。
Med Image Anal. 2022 Jan;75:102232. doi: 10.1016/j.media.2021.102232. Epub 2021 Oct 13.
8
Automatic segmentation of organs at risk and tumors in CT images of lung cancer from partially labelled datasets with a semi-supervised conditional nnU-Net.使用半监督条件 nnU-Net 对部分标记数据集的肺癌 CT 图像中的危及器官和肿瘤进行自动分割。
Comput Methods Programs Biomed. 2021 Nov;211:106419. doi: 10.1016/j.cmpb.2021.106419. Epub 2021 Sep 15.
9
Deep learning for segmentation in radiation therapy planning: a review.深度学习在放射治疗计划中的分割应用:综述
J Med Imaging Radiat Oncol. 2021 Aug;65(5):578-595. doi: 10.1111/1754-9485.13286. Epub 2021 Jul 26.
10
Predicting gastric cancer outcome from resected lymph node histopathology images using deep learning.基于深度学习的胃淋巴结病理图像预测胃癌预后。
Nat Commun. 2021 Mar 12;12(1):1637. doi: 10.1038/s41467-021-21674-7.