Suppr超能文献

一项针对常规放射学病例(胸部X光、荧光透视和乳房X光检查)人工智能基准测试的国际非劣效性研究。

An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography.

作者信息

Arzamasov Kirill, Vasilev Yuriy, Vladzymyrskyy Anton, Omelyanskaya Olga, Shulkin Igor, Kozikhina Darya, Goncharova Inna, Gelezhe Pavel, Kirpichev Yury, Bobrovskaya Tatiana, Andreychenko Anna

机构信息

State Budget-Funded Health Care Institution of the City of Moscow "Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department", Petrovka Street, 24, Building 1, 127051 Moscow, Russia.

Federal State Budgetary Institution "National Medical and Surgical Center Named after N.I. Pirogov" of the Ministry of Health of the Russian Federation, Nizhnyaya Pervomayskaya Street, 70, 105203 Moscow, Russia.

出版信息

Healthcare (Basel). 2023 Jun 8;11(12):1684. doi: 10.3390/healthcare11121684.

Abstract

An international reader study was conducted to gauge an average diagnostic accuracy of radiologists interpreting chest X-ray images, including those from fluorography and mammography, and establish requirements for stand-alone radiological artificial intelligence (AI) models. The retrospective studies in the datasets were labelled as containing or not containing target pathological findings based on a consensus of two experienced radiologists, and the results of a laboratory test and follow-up examination, where applicable. A total of 204 radiologists from 11 countries with various experience performed an assessment of the dataset with a 5-point Likert scale via a web platform. Eight commercial radiological AI models analyzed the same dataset. The AI AUROC was 0.87 (95% CI:0.83-0.9) versus 0.96 (95% CI 0.94-0.97) for radiologists. The sensitivity and specificity of AI versus radiologists were 0.71 (95% CI 0.64-0.78) versus 0.91 (95% CI 0.86-0.95) and 0.93 (95% CI 0.89-0.96) versus 0.9 (95% CI 0.85-0.94) for AI. The overall diagnostic accuracy of radiologists was superior to AI for chest X-ray and mammography. However, the accuracy of AI was noninferior to the least experienced radiologists for mammography and fluorography, and to all radiologists for chest X-ray. Therefore, an AI-based first reading could be recommended to reduce the workload burden of radiologists for the most common radiological studies such as chest X-ray and mammography.

摘要

开展了一项国际读者研究,以评估放射科医生解读胸部X光图像(包括荧光屏透视和乳房X光摄影图像)的平均诊断准确性,并确定独立放射学人工智能(AI)模型的要求。数据集中的回顾性研究根据两位经验丰富的放射科医生的共识,以及适用情况下的实验室检查和随访检查结果,标记为包含或不包含目标病理结果。来自11个国家的204名经验各异的放射科医生通过网络平台使用5点李克特量表对数据集进行了评估。八个商业放射学AI模型分析了相同的数据集。AI的受试者工作特征曲线下面积(AUROC)为0.87(95%CI:0.83 - 0.9),而放射科医生为0.96(95%CI:0.94 - 0.97)。AI与放射科医生相比的灵敏度和特异度分别为0.71(95%CI:0.64 - 0.78)对0.91(95%CI:0.86 - 0.95)以及0.93(95%CI:0.89 - 0.96)对0.9(95%CI:0.85 - 0.94)。对于胸部X光和乳房X光摄影,放射科医生的总体诊断准确性优于AI。然而,对于乳房X光摄影和荧光屏透视,AI的准确性不低于经验最少的放射科医生,对于胸部X光不低于所有放射科医生。因此,对于胸部X光和乳房X光摄影等最常见的放射学检查,建议采用基于AI的初读以减轻放射科医生的工作量负担。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f12e/10298418/abf5efdc38f0/healthcare-11-01684-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验