Suppr超能文献

肺叶分割:与本地内部模型相比,开源MOOSE、TotalSegmentator和LungMask模型的性能

Lung lobe segmentation: performance of open-source MOOSE, TotalSegmentator, and LungMask models compared to a local in-house model.

作者信息

Amini Elaheh, Klein Ran

机构信息

Systems and Computer Engineering, Carleton University, Ottawa, ON, Canada.

Division of Nuclear Medicine and Molecular Imaging, Faculty of Medicine, University of Ottawa, Ottawa, ON, Canada.

出版信息

Eur Radiol Exp. 2025 Sep 4;9(1):86. doi: 10.1186/s41747-025-00623-9.

Abstract

BACKGROUND

Lung lobe segmentation is required to assess lobar function with nuclear imaging before surgical interventions. We evaluated the performance of open-source deep learning-based lung lobe segmentation tools, compared to a similar nnU-Net model trained on a smaller but more representative clinical dataset.

MATERIALS AND METHODS

We collated and semi-automatically segmented an internal dataset of 164 computed tomography scans and classified them for task difficulty as easy, moderate, or hard. The performance of three open-source models-multi-organ objective segmentation (MOOSE), TotalSegmentator, and LungMask-was assessed using Dice similarity coefficient (DSC), robust Hausdorff distance (rHd95), and normalized surface distance (NSD). Additionally, we trained, validated, and tested an nnU-Net model using our local dataset and compared its performance with that of the other software on the test subset. All models were evaluated for generalizability using an external competition (LOLA11, n = 55).

RESULTS

TotalSegmentator outperformed MOOSE in DSC and NSD across all difficulty levels (p < 0.001), but not in rHd95 (p = 1.000). MOOSE and TotalSegmentator surpassed LungMask across metrics and difficulty classes (p < 0.001). Our model exceeded all other models on the internal dataset (n = 33) in all metrics, across all difficulty classes (p < 0.001), and on the external dataset. Missing lobes were correctly identified only by our model and LungMask in 3 and 1 of 7 cases, respectively.

CONCLUSION

Open-source segmentation tools perform well in straightforward cases but struggle in unfamiliar, complex cases. Training on diverse, specialized datasets can improve generalizability, emphasizing representative data over sheer quantity.

RELEVANCE STATEMENT

Training lung lobe segmentation models on a local variety of cases improves accuracy, thus enhancing presurgical planning, ventilation-perfusion analysis, and disease localization, potentially impacting treatment decisions and patient outcomes in respiratory and thoracic care.

KEY POINTS

Deep learning models trained on non-specialized datasets struggle with complex lung anomalies, yet their real-world limitations are insufficiently assessed. Training an identical model on a smaller yet clinically diverse and representative cohort improved performance in challenging cases. Data diversity outweighs the quantity in deep learning-based segmentation models. Accurate lung lobe segmentation may enhance presurgical assessment of lung lobar ventilation and perfusion function, optimizing clinical decision-making and patient outcomes.

摘要

背景

在手术干预前,需要进行肺叶分割以通过核成像评估肺叶功能。我们评估了基于深度学习的开源肺叶分割工具的性能,并与在较小但更具代表性的临床数据集上训练的类似nnU-Net模型进行了比较。

材料与方法

我们整理并半自动分割了一个包含164例计算机断层扫描的内部数据集,并将其根据任务难度分为简单、中等或困难。使用骰子相似系数(DSC)、稳健豪斯多夫距离(rHd95)和归一化表面距离(NSD)评估了三种开源模型——多器官目标分割(MOOSE)、TotalSegmentator和LungMask的性能。此外,我们使用本地数据集训练、验证并测试了一个nnU-Net模型,并在测试子集中将其性能与其他软件的性能进行了比较。使用一个外部竞赛数据集(LOLA11,n = 55)评估了所有模型的泛化能力。

结果

在所有难度级别上,TotalSegmentator在DSC和NSD方面均优于MOOSE(p < 0.001),但在rHd95方面并非如此(p = 1.000)。MOOSE和TotalSegmentator在各项指标和难度类别上均超过了LungMask(p < 0.001)。在内部数据集(n = 33)上,我们的模型在所有指标、所有难度类别以及外部数据集上均超过了所有其他模型(p < 0.001)。在7例病例中,只有我们的模型和LungMask分别正确识别出了3例和1例缺失的肺叶。

结论

开源分割工具在简单病例中表现良好,但在不熟悉、复杂的病例中存在困难。在多样的、专门的数据集上进行训练可以提高泛化能力,强调代表性数据而非单纯的数量。

相关性声明

在本地各种病例上训练肺叶分割模型可提高准确性,从而加强术前规划、通气灌注分析和疾病定位,可能会影响呼吸和胸科护理中的治疗决策和患者预后。

关键点

在非专门数据集上训练的深度学习模型在处理复杂肺异常时存在困难,但其在现实世界中的局限性尚未得到充分评估。在较小但临床多样且具代表性的队列上训练相同模型可提高在具有挑战性病例中的性能。在基于深度学习的分割模型中,数据多样性比数量更重要。准确的肺叶分割可加强对肺叶通气和灌注功能的术前评估,优化临床决策和患者预后。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fada/12411369/1c8909aad201/41747_2025_623_Fig1_HTML.jpg

相似文献

5
Semi-Supervised Learning Allows for Improved Segmentation With Reduced Annotations of Brain Metastases Using Multicenter MRI Data.
J Magn Reson Imaging. 2025 Jun;61(6):2469-2479. doi: 10.1002/jmri.29686. Epub 2025 Jan 10.
7
Brain tumor segmentation using deep learning: high performance with minimized MRI data.
Front Radiol. 2025 Jul 8;5:1616293. doi: 10.3389/fradi.2025.1616293. eCollection 2025.
10
Artificial intelligence for diagnosing exudative age-related macular degeneration.
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.

本文引用的文献

1
TotalSegmentator: Robust Segmentation of 104 Anatomic Structures in CT Images.
Radiol Artif Intell. 2023 Jul 5;5(5):e230024. doi: 10.1148/ryai.230024. eCollection 2023 Sep.
3
A whole-body FDG-PET/CT Dataset with manually annotated Tumor Lesions.
Sci Data. 2022 Oct 4;9(1):601. doi: 10.1038/s41597-022-01718-3.
4
Fully Automated, Semantic Segmentation of Whole-Body F-FDG PET/CT Images Based on Data-Centric Artificial Intelligence.
J Nucl Med. 2022 Dec;63(12):1941-1948. doi: 10.2967/jnumed.122.264063. Epub 2022 Jun 30.
5
Study on Anatomical Variations in Fissures of Lung by CT Scan.
Indian J Radiol Imaging. 2022 Jan 11;31(4):797-804. doi: 10.1055/s-0041-1741045. eCollection 2021 Oct.
7
nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation.
Nat Methods. 2021 Feb;18(2):203-211. doi: 10.1038/s41592-020-01008-z. Epub 2020 Dec 7.
9
Development of the lung.
Cell Tissue Res. 2017 Mar;367(3):427-444. doi: 10.1007/s00441-016-2545-0. Epub 2017 Jan 31.
10
Pulmonary Fissure Detection in CT Images Using a Derivative of Stick Filter.
IEEE Trans Med Imaging. 2016 Jun;35(6):1488-500. doi: 10.1109/TMI.2016.2517680. Epub 2016 Jan 13.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验