使用深度卷积神经网络在CT图像上检测尿路结石：模型性能与泛化能力评估

Urinary Stone Detection on CT Images Using Deep Convolutional Neural Networks: Evaluation of Model Performance and Generalization.

作者信息

Parakh Anushri, Lee Hyunkwang, Lee Jeong Hyun, Eisner Brian H, Sahani Dushyant V, Do Synho

机构信息

Departments of Radiology (A.P., H.L., D.V.S., S.D.) and Urology (B.H.E.), Massachusetts General Hospital, 55 Fruit St, White 270, Boston, MA 02114; John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Mass (H.L.): and Department of Radiology and Center for Imaging Science, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea (J.H.L.).

出版信息

Radiol Artif Intell. 2019 Jul 24;1(4):e180066. doi: 10.1148/ryai.2019180066. eCollection 2019 Jul.

DOI:10.1148/ryai.2019180066

PMID:33937795

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8017404/

Abstract

PURPOSE

To investigate the diagnostic accuracy of cascading convolutional neural network (CNN) for urinary stone detection on unenhanced CT images and to evaluate the performance of pretrained models enriched with labeled CT images across different scanners.

MATERIALS AND METHODS

This HIPAA-compliant, institutional review board-approved, retrospective clinical study used unenhanced abdominopelvic CT scans from 535 adults suspected of having urolithiasis. The scans were obtained on two scanners (scanner 1 [hereafter S1] and scanner 2 [hereafter S2]). A radiologist reviewed clinical reports and labeled cases for determination of reference standard. Stones were present on 279 (S1, 131; S2, 148) and absent on 256 (S1, 158; S2, 98) scans. One hundred scans (50 from each scanner) were randomly reserved as the test dataset, and the rest were used for developing a cascade of two CNNs: The first CNN identified the extent of the urinary tract, and the second CNN detected presence of stone. Nine variations of models were developed through the combination of different training data sources (S1, S2, or both [hereafter SB]) with (ImageNet, GrayNet) and without (Random) pretrained CNNs. First, models were compared for generalizability at the section level. Second, models were assessed by using area under the receiver operating characteristic curve (AUC) and accuracy at the patient level with test dataset from both scanners ( = 100).

RESULTS

The GrayNet-pretrained model showed higher classifier exactness than did ImageNet-pretrained or Random-initialized models when tested by using data from the same or different scanners at section level. At the patient level, the AUC for stone detection was 0.92-0.95, depending on the model. Accuracy of GrayNet-SB (95%) was higher than that of ImageNet-SB (91%) and Random-SB (88%). For stones larger than 4 mm, all models showed similar performance (false-negative results: two of 34). For stones smaller than 4 mm, the number of false-negative results for GrayNet-SB, ImageNet-SB, and Random-SB were one of 16, three of 16, and five of 16, respectively. GrayNet-SB identified stones in all 22 test cases that had obstructive uropathy.

CONCLUSION

A cascading model of CNNs can detect urinary tract stones on unenhanced CT scans with a high accuracy (AUC, 0.954). Performance and generalization of CNNs across scanners can be enhanced by using transfer learning with datasets enriched with labeled medical images.© RSNA, 2019

摘要

目的

研究级联卷积神经网络（CNN）在未增强CT图像上检测尿路结石的诊断准确性，并评估在不同扫描仪上通过标注CT图像增强的预训练模型的性能。

材料与方法

本符合健康保险流通与责任法案（HIPAA）、经机构审查委员会批准的回顾性临床研究，使用了535例疑似患有尿石症的成人的未增强腹部盆腔CT扫描。扫描在两台扫描仪（扫描仪1[以下简称S1]和扫描仪2[以下简称S2]）上进行。一名放射科医生查阅临床报告并标注病例以确定参考标准。279例扫描发现结石（S1，131例；S2，148例），256例扫描未发现结石（S1，158例；S2，98例）。随机保留100例扫描（每台扫描仪50例）作为测试数据集，其余用于开发由两个CNN组成的级联模型：第一个CNN识别尿路范围，第二个CNN检测结石的存在。通过将不同的训练数据源（S1、S2或两者[以下简称SB]）与（ImageNet、GrayNet）预训练的CNN以及未预训练的CNN（随机）组合，开发了9种模型变体。首先，在层面水平上比较模型的通用性。其次，使用来自两台扫描仪的测试数据集（n = 100），通过受试者操作特征曲线下面积（AUC）和患者水平的准确性来评估模型。

结果

在层面水平上，当使用来自相同或不同扫描仪的数据进行测试时，GrayNet预训练模型显示出比ImageNet预训练或随机初始化模型更高的分类器准确性。在患者水平上，根据模型不同，结石检测的AUC为0.92 - 0.95。GrayNet - SB（95%）的准确性高于ImageNet - SB（91%）和随机 - SB（88%）。对于直径大于4 mm的结石，所有模型表现相似（假阴性结果：34例中的2例）。对于直径小于4 mm的结石，GrayNet - SB、ImageNet - SB和随机 - SB的假阴性结果数分别为16例中的1例、16例中的3例和16例中的5例。GrayNet - SB在所有22例患有梗阻性尿路病的测试病例中均识别出结石。

结论

CNN级联模型能够在未增强CT扫描上高精度地检测尿路结石（AUC，0.954）。通过对富含标注医学图像的数据集进行迁移学习，可以提高CNN在不同扫描仪之间的性能和通用性。©RSNA，2019

相似文献

Urinary Stone Detection on CT Images Using Deep Convolutional Neural Networks: Evaluation of Model Performance and Generalization.使用深度卷积神经网络在CT图像上检测尿路结石：模型性能与泛化能力评估

Radiol Artif Intell. 2019 Jul 24;1(4):e180066. doi: 10.1148/ryai.2019180066. eCollection 2019 Jul.

Visual Transformers and Convolutional Neural Networks for Disease Classification on Radiographs: A Comparison of Performance, Sample Efficiency, and Hidden Stratification.用于X光片疾病分类的视觉Transformer和卷积神经网络：性能、样本效率及隐藏分层的比较

Radiol Artif Intell. 2022 Sep 21;4(6):e220012. doi: 10.1148/ryai.220012. eCollection 2022 Nov.

Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究

Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.

Feasibility of a generalized convolutional neural network for automated identification of vertebral compression fractures: The Manitoba Bone Mineral Density Registry.基于广义卷积神经网络的椎体压缩性骨折自动识别的可行性：曼尼托巴骨密度登记处研究。

Bone. 2021 Sep;150:116017. doi: 10.1016/j.bone.2021.116017. Epub 2021 May 19.

Rapid kVp switching dual-energy CT in the assessment of urolithiasis in patients with large body habitus: preliminary observations on image quality and stone characterization.大体积体型患者尿路结石评估中的快速 kVp 切换双能 CT：初步观察图像质量和结石特征。

Abdom Radiol (NY). 2019 Mar;44(3):1019-1026. doi: 10.1007/s00261-018-1808-5.

Deep Learning at Chest Radiography: Automated Classification of Pulmonary Tuberculosis by Using Convolutional Neural Networks.胸部放射摄影中的深度学习：使用卷积神经网络自动分类肺结核。

Radiology. 2017 Aug;284(2):574-582. doi: 10.1148/radiol.2017162326. Epub 2017 Apr 24.

Detection and characterization of urinary stones using material-specific images derived from contrast-enhanced dual-energy CT urography.利用对比增强双能 CT 尿路造影术获得的物质特异性图像检测和描述尿路结石。

Br J Radiol. 2023 Dec;96(1152):20230337. doi: 10.1259/bjr.20230337. Epub 2023 Oct 24.

Strategies to Improve Convolutional Neural Network Generalizability and Reference Standards for Glaucoma Detection From OCT Scans.提高卷积神经网络泛化能力的策略以及从 OCT 扫描中检测青光眼的参考标准。

Transl Vis Sci Technol. 2021 Apr 1;10(4):16. doi: 10.1167/tvst.10.4.16.

Virtual Unenhanced Dual-Energy CT Images Obtained with a Multimaterial Decomposition Algorithm: Diagnostic Value for Renal Mass and Urinary Stone Evaluation.基于多物质分解算法的虚拟非增强双能 CT 图像：用于评估肾肿块和尿路结石的诊断价值。

Radiology. 2021 Mar;298(3):611-619. doi: 10.1148/radiol.2021192448. Epub 2021 Jan 19.

Evaluation of a multiview architecture for automatic vertebral labeling of palliative radiotherapy simulation CT images.评估一种多视图架构，用于自动标记姑息性放疗模拟 CT 图像的椎体。

Med Phys. 2020 Nov;47(11):5592-5608. doi: 10.1002/mp.14415. Epub 2020 Sep 15.

引用本文的文献

Deep Learning Radiomics Model Based on Computed Tomography Image for Predicting the Classification of Osteoporotic Vertebral Fractures: Algorithm Development and Validation.基于计算机断层扫描图像的深度学习放射组学模型用于预测骨质疏松性椎体骨折的分类：算法开发与验证

JMIR Med Inform. 2025 Aug 29;13:e75665. doi: 10.2196/75665.

Artificial intelligence (AI) and CT in abdominal imaging: image reconstruction and beyond.人工智能（AI）与腹部成像中的CT：图像重建及其他

Abdom Radiol (NY). 2025 Jun 16. doi: 10.1007/s00261-025-05031-6.

Developing a Deep Learning Radiomics Model Combining Lumbar CT, Multi-Sequence MRI, and Clinical Data to Predict High-Risk Adjacent Segment Degeneration Following Lumbar Fusion: A Retrospective Multicenter Study.开发一种结合腰椎CT、多序列MRI和临床数据的深度学习放射组学模型，以预测腰椎融合术后的高风险相邻节段退变：一项回顾性多中心研究。

Global Spine J. 2025 Jun 9:21925682251342531. doi: 10.1177/21925682251342531.

Fine-tuned deep learning models for early detection and classification of kidney conditions in CT imaging.用于CT成像中肾脏疾病早期检测和分类的微调深度学习模型。

Sci Rep. 2025 Mar 28;15(1):10741. doi: 10.1038/s41598-025-94905-2.

Development of a deep learning radiomics model combining lumbar CT, multi-sequence MRI, and clinical data to predict high-risk cage subsidence after lumbar fusion: a retrospective multicenter study.一种结合腰椎CT、多序列MRI和临床数据的深度学习放射组学模型的开发，用于预测腰椎融合术后高风险椎间融合器下沉：一项回顾性多中心研究。

Biomed Eng Online. 2025 Mar 2;24(1):27. doi: 10.1186/s12938-025-01355-y.

Artificial intelligence in urolithiasis: a systematic review of utilization and effectiveness.人工智能在尿石症中的应用：利用和有效性的系统评价。

World J Urol. 2024 Oct 17;42(1):579. doi: 10.1007/s00345-024-05268-8.

Construction and Validation of a General Medical Image Dataset for Pretraining.用于预训练的通用医学图像数据集的构建与验证

J Imaging Inform Med. 2025 Apr;38(2):1051-1061. doi: 10.1007/s10278-024-01226-3. Epub 2024 Aug 15.

CT-based AI model for predicting therapeutic outcomes in ureteral stones after single extracorporeal shock wave lithotripsy through a cohort study.通过一项队列研究建立基于CT的人工智能模型，用于预测单次体外冲击波碎石术后输尿管结石的治疗效果。

Int J Surg. 2024 Oct 1;110(10):6601-6609. doi: 10.1097/JS9.0000000000001820.

Exploring deep learning radiomics for classifying osteoporotic vertebral fractures in X-ray images.探索深度学习放射组学在 X 射线图像中分类骨质疏松性椎体骨折。

Front Endocrinol (Lausanne). 2024 Mar 28;15:1370838. doi: 10.3389/fendo.2024.1370838. eCollection 2024.

Automatic Urinary Stone Detection System for Abdominal Non-Enhanced CT Images Reduces the Burden on Radiologists.用于腹部非增强 CT 图像的自动尿路结石检测系统可减轻放射科医生的负担。

J Imaging Inform Med. 2024 Apr;37(2):444-454. doi: 10.1007/s10278-023-00946-2. Epub 2024 Jan 10.

本文引用的文献

A systematic study of the class imbalance problem in convolutional neural networks.卷积神经网络中类不平衡问题的系统研究。

Neural Netw. 2018 Oct;106:249-259. doi: 10.1016/j.neunet.2018.07.011. Epub 2018 Jul 29.

Deep learning for segmentation of brain tumors: Impact of cross-institutional training and testing.深度学习在脑肿瘤分割中的应用：跨机构训练和测试的影响。

Med Phys. 2018 Mar;45(3):1150-1158. doi: 10.1002/mp.12752. Epub 2018 Feb 8.

How artificial intelligence could transform emergency department operations.人工智能如何改变急诊科的运营。

Am J Emerg Med. 2018 Aug;36(8):1515-1517. doi: 10.1016/j.ajem.2018.01.017. Epub 2018 Jan 4.

Machine-Learning-Based Electronic Triage More Accurately Differentiates Patients With Respect to Clinical Outcomes Compared With the Emergency Severity Index.基于机器学习的电子分诊在区分患者临床结局方面比急诊严重指数更准确。

Ann Emerg Med. 2018 May;71(5):565-574.e2. doi: 10.1016/j.annemergmed.2017.08.005. Epub 2017 Sep 6.

A renal colic fast track pathway to improve waiting times and outcomes for patients presenting to the emergency department.一条肾绞痛快速通道，以改善急诊科患者的等待时间和治疗结果。

Open Access Emerg Med. 2017 Jul 24;9:53-55. doi: 10.2147/OAEM.S138470. eCollection 2017.

Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker.利用深度学习从原始成像数据预测大脑年龄，可得到可靠的可遗传生物标志物。

Neuroimage. 2017 Dec;163:115-124. doi: 10.1016/j.neuroimage.2017.07.059. Epub 2017 Jul 29.

Automated Critical Test Findings Identification and Online Notification System Using Artificial Intelligence in Imaging.基于人工智能的医学影像学危急值自动识别与在线通知系统

Radiology. 2017 Dec;285(3):923-931. doi: 10.1148/radiol.2017162664. Epub 2017 Jul 3.

Medical Image Data and Datasets in the Era of Machine Learning-Whitepaper from the 2016 C-MIMI Meeting Dataset Session.机器学习时代的医学图像数据与数据集——2016年C-MIMI会议数据集研讨会白皮书

J Digit Imaging. 2017 Aug;30(4):392-399. doi: 10.1007/s10278-017-9976-3.

Radiology. 2017 Aug;284(2):574-582. doi: 10.1148/radiol.2017162326. Epub 2017 Apr 24.

Fully Automated Deep Learning System for Bone Age Assessment.用于骨龄评估的全自动深度学习系统。

J Digit Imaging. 2017 Aug;30(4):427-441. doi: 10.1007/s10278-017-9955-8.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。