在CT扫描的肺结节检测中深度学习需要多少隐私数据？一项回顾性多中心研究。

How Many Private Data Are Needed for Deep Learning in Lung Nodule Detection on CT Scans? A Retrospective Multicenter Study.

作者信息

Son Jeong Woo, Hong Ji Young, Kim Yoon, Kim Woo Jin, Shin Dae-Yong, Choi Hyun-Soo, Bak So Hyeon, Moon Kyoung Min

机构信息

ZIOVISION, Chuncheon 24341, Korea.

Division of Pulmonary and Critical Care Medicine, Department of Medicine, Chuncheon Sacred Heart Hospital, Hallym University Medical Center, Chuncheon 24253, Korea.

出版信息

Cancers (Basel). 2022 Jun 28;14(13):3174. doi: 10.3390/cancers14133174.

DOI:10.3390/cancers14133174

PMID:35804946

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9265117/

Abstract

Early detection of lung nodules is essential for preventing lung cancer. However, the number of radiologists who can diagnose lung nodules is limited, and considerable effort and time are required. To address this problem, researchers are investigating the automation of deep-learning-based lung nodule detection. However, deep learning requires large amounts of data, which can be difficult to collect. Therefore, data collection should be optimized to facilitate experiments at the beginning of lung nodule detection studies. We collected chest computed tomography scans from 515 patients with lung nodules from three hospitals and high-quality lung nodule annotations reviewed by radiologists. We conducted several experiments using the collected datasets and publicly available data from LUNA16. The object detection model, YOLOX was used in the lung nodule detection experiment. Similar or better performance was obtained when training the model with the collected data rather than LUNA16 with large amounts of data. We also show that weight transfer learning from pre-trained open data is very useful when it is difficult to collect large amounts of data. Good performance can otherwise be expected when reaching more than 100 patients. This study offers valuable insights for guiding data collection in lung nodules studies in the future.

摘要

早期发现肺结节对于预防肺癌至关重要。然而，能够诊断肺结节的放射科医生数量有限，且需要付出大量努力和时间。为了解决这一问题，研究人员正在研究基于深度学习的肺结节检测自动化。然而，深度学习需要大量数据，而这些数据可能难以收集。因此，在肺结节检测研究开始时，应优化数据收集以促进实验。我们从三家医院收集了515例肺结节患者的胸部计算机断层扫描图像以及经放射科医生审核的高质量肺结节标注。我们使用收集到的数据集和来自LUNA16的公开可用数据进行了多项实验。在肺结节检测实验中使用了目标检测模型YOLOX。用收集到的数据训练模型时，获得了与使用大量数据的LUNA16相似或更好的性能。我们还表明，当难以收集大量数据时，从预训练的开放数据进行权重迁移学习非常有用。否则，当患者数量超过100例时，可以预期会有良好的性能。本研究为未来肺结节研究中的数据收集提供了有价值的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22ed/9265117/0b4bdf4baeb4/cancers-14-03174-g0A1.jpg

相似文献

How Many Private Data Are Needed for Deep Learning in Lung Nodule Detection on CT Scans? A Retrospective Multicenter Study.在CT扫描的肺结节检测中深度学习需要多少隐私数据？一项回顾性多中心研究。

Cancers (Basel). 2022 Jun 28;14(13):3174. doi: 10.3390/cancers14133174.

A survey of computer-aided diagnosis of lung nodules from CT scans using deep learning.基于深度学习的 CT 扫描肺结节计算机辅助诊断研究综述。

Comput Biol Med. 2021 Oct;137:104806. doi: 10.1016/j.compbiomed.2021.104806. Epub 2021 Aug 25.

Validation of a Deep Learning Algorithm for the Detection of Malignant Pulmonary Nodules in Chest Radiographs.深度学习算法在胸部 X 光片中检测恶性肺结节的验证。

JAMA Netw Open. 2020 Sep 1;3(9):e2017135. doi: 10.1001/jamanetworkopen.2020.17135.

Deep Learning Reconstruction Shows Better Lung Nodule Detection for Ultra-Low-Dose Chest CT.深度学习重建对超低剂量胸部 CT 显示出更好的肺结节检测效果。

Radiology. 2022 Apr;303(1):202-212. doi: 10.1148/radiol.210551. Epub 2022 Jan 18.

Discrimination between transient and persistent subsolid pulmonary nodules on baseline CT using deep transfer learning.基于深度迁移学习的基线 CT 鉴别亚实性肺结节的良恶性。

Eur Radiol. 2020 Dec;30(12):6913-6923. doi: 10.1007/s00330-020-07071-6. Epub 2020 Jul 21.

Development of a deep learning-based method to diagnose pulmonary ground-glass nodules by sequential computed tomography imaging.基于深度学习的方法通过连续 CT 成像诊断肺磨玻璃结节的研究进展。

Thorac Cancer. 2022 Feb;13(4):602-612. doi: 10.1111/1759-7714.14305. Epub 2022 Jan 6.

A simplified cluster model and a tool adapted for collaborative labeling of lung cancer CT scans.简化的聚类模型和工具，适用于肺癌 CT 扫描的协作标注。

Comput Methods Programs Biomed. 2021 Jul;206:106111. doi: 10.1016/j.cmpb.2021.106111. Epub 2021 Apr 18.

LNDb challenge on automatic lung cancer patient management.LNDb 挑战赛：自动肺癌患者管理。

Med Image Anal. 2021 May;70:102027. doi: 10.1016/j.media.2021.102027. Epub 2021 Mar 5.

Preparing CT imaging datasets for deep learning in lung nodule analysis: Insights from four well-known datasets.为肺部结节分析中的深度学习准备CT成像数据集：来自四个知名数据集的见解。

Heliyon. 2023 Jun 16;9(6):e17104. doi: 10.1016/j.heliyon.2023.e17104. eCollection 2023 Jun.

Development and Validation of a Modified Three-Dimensional U-Net Deep-Learning Model for Automated Detection of Lung Nodules on Chest CT Images From the Lung Image Database Consortium and Japanese Datasets.基于肺部影像数据库联盟和日本数据集的胸部CT图像自动检测肺结节的改进三维U-Net深度学习模型的开发与验证

Acad Radiol. 2022 Feb;29 Suppl 2:S11-S17. doi: 10.1016/j.acra.2020.07.030. Epub 2020 Aug 21.

引用本文的文献

Integration of Single-Cell Analysis and Bulk RNA Sequencing Data Using Multi-Level Attention Graph Neural Network for Precise Prognostic Stratification in Thyroid Cancer.使用多级注意力图神经网络整合单细胞分析和批量RNA测序数据以实现甲状腺癌的精确预后分层

Cancers (Basel). 2025 Jul 21;17(14):2411. doi: 10.3390/cancers17142411.

Clinical efficacy of DSA-based features in predicting outcomes of acupuncture intervention on upper limb dysfunction following ischemic stroke.基于数字减影血管造影（DSA）特征预测缺血性中风后上肢功能障碍针刺干预疗效的研究

Chin Med. 2024 Nov 9;19(1):155. doi: 10.1186/s13020-024-01026-5.

Deep learning in bioinformatics.生物信息学中的深度学习。

Turk J Biol. 2023 Dec 18;47(6):366-382. doi: 10.55730/1300-0152.2671. eCollection 2023.

Imaging features and deep learning for prediction of pulmonary epithelioid hemangioendothelioma in CT images.CT图像中肺上皮样血管内皮瘤预测的影像学特征与深度学习

J Thorac Dis. 2024 Feb 29;16(2):935-947. doi: 10.21037/jtd-23-455. Epub 2024 Feb 23.

A robust model training strategy using hard negative mining in a weakly labeled dataset for lymphatic invasion in gastric cancer.基于弱标记数据集的硬负挖掘进行稳健模型训练策略在胃癌淋巴管浸润中的应用。

J Pathol Clin Res. 2024 Jan;10(1):e355. doi: 10.1002/cjp2.355. Epub 2023 Dec 20.

Artificial Intelligence in Oncology: A Topical Collection in 2022.肿瘤学中的人工智能：2022年专题文集

Cancers (Basel). 2023 Feb 7;15(4):1065. doi: 10.3390/cancers15041065.

本文引用的文献

Deep Learning Applications in Computed Tomography Images for Pulmonary Nodule Detection and Diagnosis: A Review.深度学习在计算机断层扫描图像中用于肺结节检测与诊断的应用综述

Diagnostics (Basel). 2022 Jan 25;12(2):298. doi: 10.3390/diagnostics12020298.

Radiomics-guided deep neural networks stratify lung adenocarcinoma prognosis from CT scans.基于放射组学的深度神经网络从 CT 扫描中分层肺腺癌预后。

Commun Biol. 2021 Nov 12;4(1):1286. doi: 10.1038/s42003-021-02814-7.

Identification of Benign and Malignant Lung Nodules in CT Images Based on Ensemble Learning Method.基于集成学习方法的 CT 图像中肺结节良恶性的识别。

Interdiscip Sci. 2022 Mar;14(1):130-140. doi: 10.1007/s12539-021-00472-1. Epub 2021 Nov 2.

TEM virus images: Benchmark dataset and deep learning classification.TEM 病毒图像：基准数据集和深度学习分类。

Comput Methods Programs Biomed. 2021 Sep;209:106318. doi: 10.1016/j.cmpb.2021.106318. Epub 2021 Jul 29.

Text Data Augmentation for Deep Learning.用于深度学习的文本数据增强

J Big Data. 2021;8(1):101. doi: 10.1186/s40537-021-00492-0. Epub 2021 Jul 19.

Deep Learning-Based Stage-Wise Risk Stratification for Early Lung Adenocarcinoma in CT Images: A Multi-Center Study.基于深度学习的CT图像早期肺腺癌分期风险分层：一项多中心研究

Cancers (Basel). 2021 Jun 30;13(13):3300. doi: 10.3390/cancers13133300.

Deep Learning Enables Accurate Diagnosis of Novel Coronavirus (COVID-19) With CT Images.深度学习利用 CT 图像准确诊断新型冠状病毒（COVID-19）。

IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2775-2780. doi: 10.1109/TCBB.2021.3065361. Epub 2021 Dec 8.

Use of a Commercially Available Deep Learning Algorithm to Measure the Solid Portions of Lung Cancer Manifesting as Subsolid Lesions at CT: Comparisons with Radiologists and Invasive Component Size at Pathologic Examination.应用商用深度学习算法测量 CT 显示的亚实性肺病变中肺癌实性部分的大小：与放射科医生和病理检查的侵袭性成分大小的比较。

Radiology. 2021 Apr;299(1):202-210. doi: 10.1148/radiol.2021202803. Epub 2021 Feb 2.

Preoperative CT-based Deep Learning Model for Predicting Disease-Free Survival in Patients with Lung Adenocarcinomas.基于术前 CT 的深度学习模型预测肺腺癌患者无病生存。

Radiology. 2020 Jul;296(1):216-224. doi: 10.1148/radiol.2020192764. Epub 2020 May 12.

Current cancer situation in China: good or bad news from the 2018 Global Cancer Statistics?中国当前癌症形势：2018 年全球癌症统计数据带来的是好消息还是坏消息？

Cancer Commun (Lond). 2019 Apr 29;39(1):22. doi: 10.1186/s40880-019-0368-6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

在CT扫描的肺结节检测中深度学习需要多少隐私数据？一项回顾性多中心研究。

How Many Private Data Are Needed for Deep Learning in Lung Nodule Detection on CT Scans? A Retrospective Multicenter Study.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献