• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

训练期间的随机效应:对基于深度学习的医学图像分割的影响。

Random effects during training: Implications for deep learning-based medical image segmentation.

机构信息

Clinical Physiology, Department of Clinical Sciences Lund, Lund University, Lund, Sweden; Department of Biomedical Engineering, Faculty of Engineering, Lund University, Lund, Sweden.

Clinical Physiology, Department of Clinical Sciences Lund, Lund University, Lund, Sweden; Department of Biomedical Engineering, Faculty of Engineering, Lund University, Lund, Sweden; Wallenberg Centre for Molecular Medicine, Lund University, Lund, Sweden.

出版信息

Comput Biol Med. 2024 Sep;180:108944. doi: 10.1016/j.compbiomed.2024.108944. Epub 2024 Aug 2.

DOI:10.1016/j.compbiomed.2024.108944
PMID:39096609
Abstract

BACKGROUND

A single learning algorithm can produce deep learning-based image segmentation models that vary in performance purely due to random effects during training. This study assessed the effect of these random performance fluctuations on the reliability of standard methods of comparing segmentation models.

METHODS

The influence of random effects during training was assessed by running a single learning algorithm (nnU-Net) with 50 different random seeds for three multiclass 3D medical image segmentation problems, including brain tumour, hippocampus, and cardiac segmentation. Recent literature was sampled to find the most common methods for estimating and comparing the performance of deep learning segmentation models. Based on this, segmentation performance was assessed using both hold-out validation and 5-fold cross-validation and the statistical significance of performance differences was measured using the Paired t-test and the Wilcoxon signed rank test on Dice scores.

RESULTS

For the different segmentation problems, the seed producing the highest mean Dice score statistically significantly outperformed between 0 % and 76 % of the remaining seeds when estimating performance using hold-out validation, and between 10 % and 38 % when estimating performance using 5-fold cross-validation.

CONCLUSION

Random effects during training can cause high rates of statistically-significant performance differences between segmentation models from the same learning algorithm. Whilst statistical testing is widely used in contemporary literature, our results indicate that a statistically-significant difference in segmentation performance is a weak and unreliable indicator of a true performance difference between two learning algorithms.

摘要

背景

由于训练过程中的随机效应,单一学习算法可以产生性能差异很大的深度学习图像分割模型。本研究评估了这些随机性能波动对比较分割模型的标准方法可靠性的影响。

方法

通过对三个多类 3D 医学图像分割问题(脑肿瘤、海马体和心脏分割)运行 50 个不同随机种子的单个学习算法(nnU-Net),评估训练过程中随机效应的影响。从最近的文献中采样,以找到评估和比较深度学习分割模型性能的最常见方法。在此基础上,使用留一验证和 5 倍交叉验证评估分割性能,并使用 Paired t 检验和 Wilcoxon 符号秩检验测量 Dice 得分上的性能差异的统计学显著性。

结果

对于不同的分割问题,使用留一验证估计性能时,产生最高平均 Dice 得分的种子在统计学上显著优于剩余种子的 0%至 76%,而使用 5 倍交叉验证估计性能时,优于 10%至 38%。

结论

训练过程中的随机效应可能导致来自同一学习算法的分割模型之间出现高比例的统计学显著性能差异。虽然统计检验在当代文献中被广泛使用,但我们的结果表明,分割性能的统计学显著差异是两个学习算法之间真实性能差异的一个弱且不可靠的指标。

相似文献

1
Random effects during training: Implications for deep learning-based medical image segmentation.训练期间的随机效应:对基于深度学习的医学图像分割的影响。
Comput Biol Med. 2024 Sep;180:108944. doi: 10.1016/j.compbiomed.2024.108944. Epub 2024 Aug 2.
2
Semi-Supervised Learning Allows for Improved Segmentation With Reduced Annotations of Brain Metastases Using Multicenter MRI Data.半监督学习可利用多中心MRI数据,通过减少脑转移瘤的标注来改进分割。
J Magn Reson Imaging. 2025 Jun;61(6):2469-2479. doi: 10.1002/jmri.29686. Epub 2025 Jan 10.
3
Transformers for Neuroimage Segmentation: Scoping Review.用于神经图像分割的变压器:范围综述。
J Med Internet Res. 2025 Jan 29;27:e57723. doi: 10.2196/57723.
4
Predicting cognitive decline: Deep-learning reveals subtle brain changes in pre-MCI stage.预测认知衰退:深度学习揭示轻度认知障碍前阶段大脑的细微变化。
J Prev Alzheimers Dis. 2025 May;12(5):100079. doi: 10.1016/j.tjpad.2025.100079. Epub 2025 Feb 6.
5
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
6
Implementation of biomedical segmentation for brain tumor utilizing an adapted U-net model.利用改进的U-net模型实现脑肿瘤的生物医学分割。
Comput Biol Med. 2025 Aug;194:110531. doi: 10.1016/j.compbiomed.2025.110531. Epub 2025 Jun 11.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
8
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤
Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.
9
Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。
Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.
10
Deep learning-based automated detection and segmentation of bone and traumatic bone marrow lesions from MRI following an acute ACL tear.基于深度学习的急性 ACL 撕裂后 MRI 中骨和外伤性骨髓病变的自动检测和分割。
Comput Biol Med. 2024 Aug;178:108791. doi: 10.1016/j.compbiomed.2024.108791. Epub 2024 Jun 20.

引用本文的文献

1
A Lightweight Dual-Output Vision Transformer for Enhanced Lung Nodule Classification Using CT Images.一种用于使用CT图像增强肺结节分类的轻量级双输出视觉Transformer
Technol Cancer Res Treat. 2025 Jan-Dec;24:15330338251370439. doi: 10.1177/15330338251370439. Epub 2025 Aug 21.
2
Research on Motion Transfer Method from Human Arm to Bionic Robot Arm Based on PSO-RF Algorithm.基于粒子群优化-随机森林算法的人体手臂到仿生机器人手臂运动传递方法研究
Biomimetics (Basel). 2025 Jun 11;10(6):392. doi: 10.3390/biomimetics10060392.