用于肺部CT图像预筛查的人工智能算法测试的统计学考量

Statistical considerations for testing an AI algorithm used for prescreening lung CT images.

作者信息

Obuchowski Nancy A, Bullen Jennifer A

机构信息

Quantitative Health Sciences /JJN3, Cleveland Clinic Foundation, 9500 Euclid Ave, Cleveland, OH, 44195, USA.

出版信息

Contemp Clin Trials Commun. 2019 Aug 22;16:100434. doi: 10.1016/j.conctc.2019.100434. eCollection 2019 Dec.

DOI:10.1016/j.conctc.2019.100434

PMID:31485545

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6717063/

Abstract

Artificial intelligence, as applied to medical images to detect, rule out, diagnose, and stage disease, has seen enormous growth over the last few years. There are multiple use cases of AI algorithms in medical imaging: first-reader (or concurrent) mode, second-reader mode, triage mode, and more recently prescreening mode as when an AI algorithm is applied to the worklist of images to identify obvious negative cases so that human readers do not need to review them and can focus on interpreting the remaining cases. In this paper we describe the statistical considerations for designing a study to test a new AI prescreening algorithm for identifying normal lung cancer screening CTs. We contrast agreement vs. accuracy studies, and retrospective vs. prospective designs. We evaluate various test performance metrics with respect to their sensitivity to changes in the AI algorithm's performance, as well as to shifts in reader behavior to a revised worklist. We consider sample size requirements for testing the AI prescreening algorithm.

摘要

在过去几年中，应用于医学图像以检测、排除、诊断疾病及确定疾病分期的人工智能技术取得了巨大发展。人工智能算法在医学成像中有多种应用场景：初读（或同步）模式、复阅模式、分诊模式，以及最近出现的预筛查模式，即当将人工智能算法应用于图像工作列表以识别明显的阴性病例时，人类阅片者无需查看这些病例，而是可以专注于解读其余病例。在本文中，我们描述了设计一项研究的统计学考量，该研究旨在测试一种用于识别正常肺癌筛查CT的新型人工智能预筛查算法。我们对比了一致性研究与准确性研究，以及回顾性设计与前瞻性设计。我们评估了各种测试性能指标，考量它们对人工智能算法性能变化的敏感性，以及对阅片者行为向修订后的工作列表转变的敏感性。我们还考虑了测试人工智能预筛查算法所需的样本量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/92b7/6717063/450421c80365/gr1.jpg

相似文献

Statistical considerations for testing an AI algorithm used for prescreening lung CT images.

Contemp Clin Trials Commun. 2019 Aug 22;16:100434. doi: 10.1016/j.conctc.2019.100434. eCollection 2019 Dec.

Artificial Intelligence-Based Identification of Normal Chest Radiographs: A Simulation Study in a Multicenter Health Screening Cohort.

Korean J Radiol. 2022 Oct;23(10):1009-1018. doi: 10.3348/kjr.2022.0189.

Diagnostic study on clinical feasibility of an AI-based diagnostic system as a second reader on mobile CT images: a preliminary result.

Ann Transl Med. 2022 Jun;10(12):668. doi: 10.21037/atm-22-2157.

Artificial Intelligence Tool for Detection and Worklist Prioritization Reduces Time to Diagnosis of Incidental Pulmonary Embolism at CT.

Radiol Cardiothorac Imaging. 2023 Apr 20;5(2):e220163. doi: 10.1148/ryct.220163. eCollection 2023 Apr.

Real-world testing of an artificial intelligence algorithm for the analysis of chest X-rays in primary care settings.

Sci Rep. 2024 Mar 3;14(1):5199. doi: 10.1038/s41598-024-55792-1.

Using Artificial Intelligence to Revise ACR TI-RADS Risk Stratification of Thyroid Nodules: Diagnostic Accuracy and Utility.

Radiology. 2019 Jul;292(1):112-119. doi: 10.1148/radiol.2019182128. Epub 2019 May 21.

Improving Clinical Trial Participant Prescreening With Artificial Intelligence (AI): A Comparison of the Results of AI-Assisted vs Standard Methods in 3 Oncology Trials.

Ther Innov Regul Sci. 2020 Jan;54(1):69-74. doi: 10.1007/s43441-019-00030-4. Epub 2020 Jan 6.

Detection and Diagnosis of Breast Cancer Using Artificial Intelligence Based assessment of Maximum Intensity Projection Dynamic Contrast-Enhanced Magnetic Resonance Images.

Diagnostics (Basel). 2020 May 20;10(5):330. doi: 10.3390/diagnostics10050330.

The development an artificial intelligence algorithm for early sepsis diagnosis in the intensive care unit.

Int J Med Inform. 2020 Sep;141:104176. doi: 10.1016/j.ijmedinf.2020.104176. Epub 2020 May 21.

Improved Cancer Detection Using Artificial Intelligence: a Retrospective Evaluation of Missed Cancers on Mammography.

J Digit Imaging. 2019 Aug;32(4):625-637. doi: 10.1007/s10278-019-00192-5.

引用本文的文献

A Thorough Review of the Clinical Applications of Artificial Intelligence in Lung Cancer.

Cancers (Basel). 2025 Mar 4;17(5):882. doi: 10.3390/cancers17050882.

An Assessment of Deep Learning's Impact on General Dentists' Ability to Detect Alveolar Bone Loss in 2D Intraoral Radiographs.

Diagnostics (Basel). 2025 Feb 14;15(4):467. doi: 10.3390/diagnostics15040467.

Improving mammography interpretation for both novice and experienced readers: a comparative study of two commercial artificial intelligence software.

Eur Radiol. 2024 Jun;34(6):3924-3934. doi: 10.1007/s00330-023-10422-8. Epub 2023 Nov 8.

Effects of a comprehensive brain computed tomography deep learning model on radiologist detection accuracy.

Eur Radiol. 2024 Feb;34(2):810-822. doi: 10.1007/s00330-023-10074-8. Epub 2023 Aug 22.

"Shortcuts" Causing Bias in Radiology Artificial Intelligence: Causes, Evaluation, and Mitigation.

J Am Coll Radiol. 2023 Sep;20(9):842-851. doi: 10.1016/j.jacr.2023.06.025. Epub 2023 Jul 27.

Improving the diagnosis of acute ischemic stroke on non-contrast CT using deep learning: a multicenter study.

Insights Imaging. 2022 Dec 6;13(1):184. doi: 10.1186/s13244-022-01331-3.

Possible Bias in Supervised Deep Learning Algorithms for CT Lung Nodule Detection and Classification.

Cancers (Basel). 2022 Aug 10;14(16):3867. doi: 10.3390/cancers14163867.

Using Occlusion-Based Saliency Maps to Explain an Artificial Intelligence Tool in Lung Cancer Screening: Agreement Between Radiologists, Labels, and Visual Prompts.

J Digit Imaging. 2022 Oct;35(5):1164-1175. doi: 10.1007/s10278-022-00631-w. Epub 2022 Apr 28.

Do comprehensive deep learning algorithms suffer from hidden stratification? A retrospective study on pneumothorax detection in chest radiography.

BMJ Open. 2021 Dec 7;11(12):e053024. doi: 10.1136/bmjopen-2021-053024.

Artificial intelligence for detection and characterization of pulmonary nodules in lung cancer CT screening: ready for practice?

Transl Lung Cancer Res. 2021 May;10(5):2378-2388. doi: 10.21037/tlcr-2020-lcs-06.

本文引用的文献

Design Characteristics of Studies Reporting the Performance of Artificial Intelligence Algorithms for Diagnostic Analysis of Medical Images: Results from Recently Published Papers.

Korean J Radiol. 2019 Mar;20(3):405-410. doi: 10.3348/kjr.2019.0025.

Deep-learning-assisted diagnosis for knee magnetic resonance imaging: Development and retrospective validation of MRNet.

PLoS Med. 2018 Nov 27;15(11):e1002699. doi: 10.1371/journal.pmed.1002699. eCollection 2018 Nov.

Detection of Breast Cancer with Mammography: Effect of an Artificial Intelligence Support System.

Radiology. 2019 Feb;290(2):305-314. doi: 10.1148/radiol.2018181371. Epub 2018 Nov 20.

Machine learning analyses can differentiate meningioma grade by features on magnetic resonance imaging.

Neurosurg Focus. 2018 Nov 1;45(5):E4. doi: 10.3171/2018.8.FOCUS18191.

Demystification of AI-driven medical image interpretation: past, present and future.

Eur Radiol. 2019 Mar;29(3):1616-1624. doi: 10.1007/s00330-018-5674-x. Epub 2018 Aug 13.

Data Analysis Strategies in Medical Imaging.

Clin Cancer Res. 2018 Aug 1;24(15):3492-3499. doi: 10.1158/1078-0432.CCR-18-0385. Epub 2018 Mar 26.

Artificial intelligence in healthcare: past, present and future.

Stroke Vasc Neurol. 2017 Jun 21;2(4):230-243. doi: 10.1136/svn-2017-000101. eCollection 2017 Dec.

Methodologic Guide for Evaluating Clinical Performance and Effect of Artificial Intelligence Technology for Medical Diagnosis and Prediction.

Radiology. 2018 Mar;286(3):800-809. doi: 10.1148/radiol.2017171920. Epub 2018 Jan 8.

Computer-aided detection improves detection of pulmonary nodules in chest radiographs beyond the support by bone-suppressed images.

Radiology. 2014 Jul;272(1):252-61. doi: 10.1148/radiol.14131315. Epub 2014 Mar 12.

Results of initial low-dose computed tomographic screening for lung cancer.

N Engl J Med. 2013 May 23;368(21):1980-91. doi: 10.1056/NEJMoa1209120.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于肺部CT图像预筛查的人工智能算法测试的统计学考量

Statistical considerations for testing an AI algorithm used for prescreening lung CT images.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献