Potipimpanon Pimrada, Charakorn Natamon, Hirunwiwatkul Prakobkiat
Department of Otolaryngology, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand.
Eur Arch Otorhinolaryngol. 2022 Nov;279(11):5363-5373. doi: 10.1007/s00405-022-07436-1. Epub 2022 Jun 29.
Thyroid nodules are common. Ultrasonography (US) is the first investigation for thyroid nodules. Artificial Intelligence (AI) is widely integrated into medical diagnosis to provide additional information. The primary objective of this study was to accumulate the pooled sensitivity and specificity between all available AI and radiologists using thyroid US imaging. The secondary objective was to compare AI's diagnostic performance to that of radiologists.
A systematic review meta-analysis. PubMed, Scopus, Web of Science, and Cochrane Library data were searched for studies from inception until June 11, 2020.
Twenty five studies were included in this meta-analysis. The pooled sensitivity and specificity of AI were 0.86 (95% CI 0.81-0.91) and 0.78 (95% CI 0.73-0.83), respectively. The pooled sensitivity and specificity of radiologists were 0.85 (95% CI 0.80-0.89) and 0.82 (95% CI 0.77-0.86), respectively. The accuracy of AI and radiologists is equivalent in terms of AUC [AI 0.89 (95% CI 0.86-0.92), radiologist 0.91 (95% CI 0.88-0.93)]. The diagnostic odd ratio (DOR) between AI 23.10 (95% CI 14.20-37.58) and radiologists 27.12 (95% CI 17.45-42.16) had no statistically significant difference (P = 0.56). Meta-regression analysis revealed that Deep Learning AI had significantly greater sensitivity and specificity than classic machine learning AI (P < 0.001).
AI demonstrated comparable performance to radiologists in diagnosing benign and malignant thyroid nodules using ultrasonography. Additional research to establish its equivalency should be conducted.
甲状腺结节很常见。超声检查(US)是甲状腺结节的首选检查方法。人工智能(AI)已广泛应用于医学诊断以提供更多信息。本研究的主要目的是汇总所有可用的人工智能与放射科医生使用甲状腺超声成像的合并敏感度和特异度。次要目的是比较人工智能与放射科医生的诊断性能。
一项系统评价荟萃分析。检索了PubMed、Scopus、Web of Science和Cochrane图书馆数据库,纳入从数据库建立至2020年6月11日的研究。
本荟萃分析纳入了25项研究。人工智能的合并敏感度和特异度分别为0.86(95%CI 0.81 - 0.91)和0.78(95%CI 0.73 - 0.83)。放射科医生的合并敏感度和特异度分别为0.85(95%CI 0.80 - 0.89)和0.82(95%CI 0.77 - 0.86)。就曲线下面积而言,人工智能和放射科医生的准确性相当[人工智能0.89(95%CI 0.86 - 0.92),放射科医生0.91(95%CI 0.88 - 0.93)]。人工智能的诊断比值比(DOR)为23.10(95%CI 14.20 - 37.58),放射科医生的诊断比值比为27.12(95%CI 17.45 - 42.16),两者无统计学显著差异(P = 0.56)。Meta回归分析显示,深度学习人工智能的敏感度和特异度显著高于经典机器学习人工智能(P < 0.001)。
在使用超声诊断甲状腺良恶性结节方面,人工智能与放射科医生的表现相当。应开展进一步研究以确定其等效性。