• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于非增强脑CT报告分类的深度学习算法的诊断精度

Diagnostic precision of a deep learning algorithm for the classification of non-contrast brain CT reports.

作者信息

Güzel Hamza Eren, Aşcı Göktuğ, Demirbilek Oytun, Özdemir Tuğçe Doğa, Erekli Pelin Berfin

机构信息

Department of Radiology, İzmir City Hospital, İzmir, Türkiye.

School of Science and Technology, IE University, Segovia, Spain.

出版信息

Front Radiol. 2025 May 9;5:1509377. doi: 10.3389/fradi.2025.1509377. eCollection 2025.

DOI:10.3389/fradi.2025.1509377
PMID:40417183
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12098364/
Abstract

OBJECTIVE

This study aimed to determine the diagnostic precision of a deep learning algorithm for the classificaiton of non-contrast brain CT reports.

METHODS

A total of 1,861 non-contrast brain CT reports were randomly selected, anonymized, and annotated for urgency level by two radiologists, with review by a senior radiologist. The data, encrypted and stored in Excel format, were securely maintained on a university cloud system. Using Python 3.8.16, the reports were classified into four urgency categories: emergency, not emergency but needs timely attention, clinically non-significant and normal. The dataset was split, with 800 reports used for training and 200 for validation. The DistilBERT model, featuring six transformer layers and 66 million trainable parameters, was employed for text classification. Training utilized the Adam optimizer with a learning rate of 2e-5, a batch size of 32, and a dropout rate of 0.1 to prevent overfitting. The model achieved a mean F1 score of 0.85 through 5-fold cross-validation, demonstrating strong performance in categorizing radiology reports.

RESULTS

Of the 1,861 scans, 861 cases were identified as fit for study through the senior radiologist and self-hosted Label Studio interpretations. It was observed that the algorithm achieved a sensitivity of 91% and a specificity of 90% in the measurements made on the test data. The F1 score was measured as 0.89 for the best fold. The algorithm most successfully distinguished emergency results with positive predictive values that were unexpectedly lower than in previously reported studies. Beam hardening artifacts and excessive noise, compromising the quality of CT scan images, were significantly associated with decreased model performance.

CONCLUSION

This study revealed decreased diagnostic accuracy of an AI decision support system (DSS) at our institution. Despite extensive evaluation, we were unable to identify the source of this discrepancy, raising concerns about the generalizability of these tools with indeterminate failure modes. These results further highlight the need for standardized study design to allow for rigorous and reproducible site-to-site comparison of emerging deep learning technologies.

摘要

目的

本研究旨在确定一种深度学习算法对非增强脑CT报告进行分类的诊断精度。

方法

总共随机选择了1861份非增强脑CT报告,进行匿名处理,并由两名放射科医生标注紧急程度,再由一名资深放射科医生进行审核。这些数据以加密形式存储在Excel格式中,并安全地保存在大学云系统上。使用Python 3.8.16,将报告分为四个紧急类别:紧急、非紧急但需及时关注、临床无意义和正常。数据集被拆分,800份报告用于训练,200份用于验证。采用具有六个Transformer层和6600万个可训练参数的DistilBERT模型进行文本分类。训练使用Adam优化器,学习率为2e-5,批量大小为32,丢弃率为0.1以防止过拟合。该模型通过5折交叉验证获得了0.85的平均F1分数,在对放射学报告进行分类方面表现出强大性能。

结果

在186处扫描中,通过资深放射科医生和自托管的Label Studio解释,确定有861例适合研究。观察到该算法在对测试数据的测量中灵敏度达到91%,特异性达到90%。最佳折的F1分数为0.89。该算法最成功地区分了紧急结果,但其阳性预测值意外低于先前报道的研究。影响CT扫描图像质量的线束硬化伪影和过多噪声与模型性能下降显著相关。

结论

本研究揭示了我们机构中人工智能决策支持系统(DSS)的诊断准确性有所下降。尽管进行了广泛评估,但我们无法确定这种差异的来源,这引发了对这些具有不确定故障模式的工具的通用性的担忧。这些结果进一步凸显了标准化研究设计的必要性,以便对新兴深度学习技术进行严格且可重复的站点间比较。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d410/12098364/13577012e3af/fradi-05-1509377-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d410/12098364/44b98f9f6a09/fradi-05-1509377-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d410/12098364/8607f79d9682/fradi-05-1509377-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d410/12098364/13577012e3af/fradi-05-1509377-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d410/12098364/44b98f9f6a09/fradi-05-1509377-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d410/12098364/8607f79d9682/fradi-05-1509377-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d410/12098364/13577012e3af/fradi-05-1509377-g002.jpg

相似文献

1
Diagnostic precision of a deep learning algorithm for the classification of non-contrast brain CT reports.一种用于非增强脑CT报告分类的深度学习算法的诊断精度
Front Radiol. 2025 May 9;5:1509377. doi: 10.3389/fradi.2025.1509377. eCollection 2025.
2
Diagnostic Accuracy and Failure Mode Analysis of a Deep Learning Algorithm for the Detection of Intracranial Hemorrhage.深度学习算法检测颅内出血的诊断准确性和失效模式分析。
J Am Coll Radiol. 2021 Aug;18(8):1143-1152. doi: 10.1016/j.jacr.2021.03.005. Epub 2021 Apr 3.
3
Comparison of Chest Radiograph Interpretations by Artificial Intelligence Algorithm vs Radiology Residents.人工智能算法与放射科住院医师对胸部 X 线片解读的比较。
JAMA Netw Open. 2020 Oct 1;3(10):e2022779. doi: 10.1001/jamanetworkopen.2020.22779.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
Automated abdominal CT contrast phase detection using an interpretable and open-source artificial intelligence algorithm.使用可解释和开源人工智能算法进行自动腹部 CT 对比期检测。
Eur Radiol. 2024 Oct;34(10):6680-6687. doi: 10.1007/s00330-024-10769-6. Epub 2024 Apr 29.
6
Deep convolutional neural network and IoT technology for healthcare.用于医疗保健的深度卷积神经网络和物联网技术。
Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.
7
MABAL: a Novel Deep-Learning Architecture for Machine-Assisted Bone Age Labeling.MABAL:一种用于机器辅助骨龄标注的新型深度学习架构。
J Digit Imaging. 2018 Aug;31(4):513-519. doi: 10.1007/s10278-018-0053-3.
8
Deep Learning-Based Detection and Classification of Bone Lesions on Staging Computed Tomography in Prostate Cancer: A Development Study.基于深度学习的前列腺癌分期 CT 骨病变检测与分类:一项开发研究。
Acad Radiol. 2024 Jun;31(6):2424-2433. doi: 10.1016/j.acra.2024.01.009. Epub 2024 Jan 22.
9
A New Deep Learning Algorithm for Detecting Spinal Metastases on Computed Tomography Images.一种用于在 CT 图像上检测脊柱转移瘤的新型深度学习算法。
Spine (Phila Pa 1976). 2024 Mar 15;49(6):390-397. doi: 10.1097/BRS.0000000000004889. Epub 2023 Dec 12.
10
Development and External Validation of a Deep Learning Algorithm to Identify and Localize Subarachnoid Hemorrhage on CT Scans.深度学习算法在 CT 扫描中识别和定位蛛网膜下腔出血的开发和外部验证。
Neurology. 2023 Mar 21;100(12):e1257-e1266. doi: 10.1212/WNL.0000000000201710. Epub 2023 Jan 13.

本文引用的文献

1
Machine learning and deep learning for classifying the justification of brain CT referrals.机器学习和深度学习在大脑 CT 转诊分类中的应用。
Eur Radiol. 2024 Dec;34(12):7944-7952. doi: 10.1007/s00330-024-10851-z. Epub 2024 Jun 24.
2
Classification of Diagnostic Certainty in Radiology Reports with Deep Learning.深度学习在放射学报告中的诊断确定性分类。
Stud Health Technol Inform. 2024 Jan 25;310:569-573. doi: 10.3233/SHTI231029.
3
Measuring appropriateness of diagnostic imaging: a scoping review.测量诊断性影像学的适宜性:一项范围综述
Insights Imaging. 2023 Apr 13;14(1):62. doi: 10.1186/s13244-023-01409-6.
4
Justification of CT practices across Europe: results of a survey of national competent authorities and radiology societies.欧洲CT实践的合理性:对国家主管当局和放射学会的调查结果
Insights Imaging. 2022 Nov 22;13(1):177. doi: 10.1186/s13244-022-01325-1.
5
A Systematic Review of Interventions to Reduce Computed Tomography Usage in the Emergency Department.一项关于减少急诊科计算机断层扫描使用的干预措施的系统评价。
Ann Emerg Med. 2022 Dec;80(6):548-560. doi: 10.1016/j.annemergmed.2022.06.001. Epub 2022 Aug 1.
6
Automated vetting of radiology referrals: exploring natural language processing and traditional machine learning approaches.放射学转诊的自动审核:探索自然语言处理和传统机器学习方法。
Insights Imaging. 2022 Aug 4;13(1):127. doi: 10.1186/s13244-022-01267-8.
7
Characterizing and quantifying low-value diagnostic imaging internationally: a scoping review.国际上低价值诊断成像的特征描述和量化:范围综述。
BMC Med Imaging. 2022 Apr 21;22(1):73. doi: 10.1186/s12880-022-00798-2.
8
Why Not? Persuading Clinicians to Reduce Overuse.为何不呢?说服临床医生减少过度医疗。
Mayo Clin Proc Innov Qual Outcomes. 2020 Jun 5;4(3):266-275. doi: 10.1016/j.mayocpiqo.2020.01.007. eCollection 2020 Jun.
9
Trends in Use of Medical Imaging in US Health Care Systems and in Ontario, Canada, 2000-2016.2000-2016 年美国医疗保健系统和加拿大安大略省医疗成像使用趋势。
JAMA. 2019 Sep 3;322(9):843-856. doi: 10.1001/jama.2019.11456.
10
National audit on the appropriateness of CT and MRI examinations in Luxembourg.卢森堡CT和MRI检查适宜性的全国性审计。
Insights Imaging. 2019 May 20;10(1):54. doi: 10.1186/s13244-019-0731-9.