Suppr超能文献

儿童和青年成人甲状腺结节的超声检查:放射科医生印象、美国放射学会甲状腺影像报告和数据系统(ACR TI-RADS)与深度学习算法诊断性能的比较

Thyroid Nodules on Ultrasound in Children and Young Adults: Comparison of Diagnostic Performance of Radiologists' Impressions, ACR TI-RADS, and a Deep Learning Algorithm.

作者信息

Yang Jichen, Page Laura C, Wagner Lars, Wildman-Tobriner Benjamin, Bisset Logan, Frush Donald, Mazurowski Maciej A

机构信息

Department of Electrical and Computer Engineering, Edmund T. Pratt Jr. School of Engineering, Duke University, Box 90291, Durham, NC 27708.

Department of Pediatrics, Division of Pediatric Endocrinology and Diabetes, Duke University School of Medicine, Durham, NC.

出版信息

AJR Am J Roentgenol. 2023 Mar;220(3):408-417. doi: 10.2214/AJR.22.28231. Epub 2022 Oct 19.

Abstract

In current clinical practice, thyroid nodules in children are generally evaluated on the basis of radiologists' overall impressions of ultrasound images. The purpose of this article is to compare the diagnostic performance of radiologists' overall impression, the American College of Radiology Thyroid Imaging Reporting and Data System (ACR TI-RADS), and a deep learning algorithm in differentiating benign and malignant thyroid nodules on ultrasound in children and young adults. This retrospective study included 139 patients (median age 17.5 years; 119 female patients, 20 male patients) evaluated from January 1, 2004, to September 18, 2020, who were 21 years old and younger with a thyroid nodule on ultrasound with definitive pathologic results from fine-needle aspiration and/or surgical excision to serve as the reference standard. A single nodule per patient was selected, and one transverse and one longitudinal image each of the nodules were extracted for further evaluation. Three radiologists independently characterized nodules on the basis of their overall impression (benign vs malignant) and ACR TI-RADS. A previously developed deep learning algorithm determined for each nodule a likelihood of malignancy, which was used to derive a risk level. Sensitivities and specificities for malignancy were calculated. Agreement was assessed using Cohen kappa coefficients. For radiologists' overall impression, sensitivity ranged from 32.1% to 75.0% (mean, 58.3%; 95% CI, 49.2-67.3%), and specificity ranged from 63.8% to 93.9% (mean, 79.9%; 95% CI, 73.8-85.7%). For ACR TI-RADS, sensitivity ranged from 82.1% to 87.5% (mean, 85.1%; 95% CI, 77.3-92.1%), and specificity ranged from 47.0% to 54.2% (mean, 50.6%; 95% CI, 41.4-59.8%). The deep learning algorithm had a sensitivity of 87.5% (95% CI, 78.3-95.5%) and specificity of 36.1% (95% CI, 25.6-46.8%). Interobserver agreement among pairwise combinations of readers, expressed as kappa, for overall impression was 0.227-0.472 and for ACR TI-RADS was 0.597-0.643. Both ACR TI-RADS and the deep learning algorithm had higher sensitivity albeit lower specificity compared with overall impressions. The deep learning algorithm had similar sensitivity but lower specificity than ACR TI-RADS. Interobserver agreement was higher for ACR TI-RADS than for overall impressions. ACR TI-RADS and the deep learning algorithm may serve as potential alternative strategies for guiding decisions to perform fine-needle aspiration of thyroid nodules in children.

摘要

在当前临床实践中,儿童甲状腺结节通常是根据放射科医生对超声图像的总体印象进行评估的。本文旨在比较放射科医生的总体印象、美国放射学会甲状腺影像报告和数据系统(ACR TI-RADS)以及一种深度学习算法在鉴别儿童及青年成人甲状腺结节超声图像中良恶性方面的诊断性能。这项回顾性研究纳入了139例患者(中位年龄17.5岁;119例女性患者,20例男性患者),这些患者在2004年1月1日至2020年9月18日期间接受评估,年龄在21岁及以下,超声检查发现甲状腺结节,且有细针穿刺和/或手术切除的明确病理结果作为参考标准。每位患者选取一个结节,并提取该结节的一张横向图像和一张纵向图像用于进一步评估。三名放射科医生根据总体印象(良性与恶性)和ACR TI-RADS对结节进行独立特征描述。一种先前开发的深度学习算法确定每个结节的恶性可能性,并据此得出风险水平。计算恶性的敏感性和特异性。使用Cohen kappa系数评估一致性。对于放射科医生的总体印象,敏感性范围为32.1%至75.0%(平均58.3%;95%CI,49.2 - 67.3%),特异性范围为63.8%至93.9%(平均79.9%;95%CI,73.8 - 85.7%)。对于ACR TI-RADS,敏感性范围为82.1%至87.5%(平均85.1%;95%CI,77.3 - 92.1%),特异性范围为47.0%至54.2%(平均50.6%;95%CI,41.4 - 59.8%)。深度学习算法的敏感性为87.5%(95%CI,78.3 - 95.5%),特异性为36.1%(95%CI,25.6 - 46.8%)。读者两两组合之间的观察者间一致性,以kappa表示,总体印象为0.227 - 0.472,ACR TI-RADS为0.597 - 0.643。与总体印象相比,ACR TI-RADS和深度学习算法均具有更高的敏感性,尽管特异性较低。深度学习算法与ACR TI-RADS相比具有相似的敏感性,但特异性较低。ACR TI-RADS的观察者间一致性高于总体印象。ACR TI-RADS和深度学习算法可能作为指导儿童甲状腺结节细针穿刺决策的潜在替代策略。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验