Guo Fangqi, Chang Wanru, Zhao Jiaqi, Xu Lei, Zheng Xuan, Guo Jia
Department of Ultrasound, Second Affiliated Hospital (Changzheng Hospital) of Naval Medical University, Shanghai, China.
Department of Ultrasound, Shanghai Fourth People's Hospital, School of Medicine, Tongji University, Shanghai, China.
Quant Imaging Med Surg. 2023 Feb 1;13(2):695-706. doi: 10.21037/qims-22-85. Epub 2023 Jan 2.
Thyroid cancer is the most common endocrine cancer in the world. Accurately distinguishing between benign and malignant thyroid nodules is particularly important for the early diagnosis and treatment of thyroid cancer. This study aimed to investigate the best possible optimization strategies for an already-trained artificial intelligence (AI)-based automated diagnostic system for thyroid nodule screening and, in addition, to scrutinize the clinically relevant limitations using stratified analysis to better standardize the application in clinical workflows.
We retrospectively reviewed a total of 1,092 ultrasound images associated with 397 thyroid nodules collected from 287 patients between April 2019 and January 2021, applying postoperative pathology as the gold standard. We applied different statistical approaches, including averages, maximums, and percentiles, to estimate per-nodule-based malignancy scores from the malignancy scores per image predicted by AI-SONIC Thyroid v. 5.3.0.2 (Demetics Medical Technology Ltd., Hangzhou, China) system, and we assessed its diagnostic efficacy on nodules of different sizes or tumor types with per-nodule analysis using performance metrics.
Of the 397 thyroid nodules, 272 thyroid nodules were overrepresented by malignant nodules according to the results of the surgical pathological examinations. Taking the median of the malignancy scores per image to estimate the nodule-based score with a cutoff value of 0.56 optimized for the means of sensitivity and specificity, the AI-based automated detection system demonstrated slightly lower sensitivity, significantly higher specificity (almost independent of nodule size), and similar accuracy to that of the senior radiologist. Both the AI system and the senior radiologist demonstrated higher sensitivity in diagnosing smaller nodules (≤25 mm) and comparable diagnostic performances for larger nodules. The mean diagnostic time per nodule of the AI system was 0.146 s, which was in sharp contrast to the 2.8 to 4.5 min of the radiologists.
Using our optimization strategy to achieve nodule-based diagnosis, the AI-SONIC Thyroid automated diagnostic system demonstrated an overall diagnostic accuracy equivalent to that of senior radiologists. Thus, it is expected that it can be used as a reliable auxiliary diagnostic method by radiologists for the screening and preoperative evaluation of malignant thyroid nodules.
甲状腺癌是全球最常见的内分泌癌。准确区分甲状腺结节的良恶性对于甲状腺癌的早期诊断和治疗尤为重要。本研究旨在探讨针对已训练的基于人工智能(AI)的甲状腺结节筛查自动诊断系统的最佳优化策略,并通过分层分析审视其临床相关局限性,以更好地规范其在临床工作流程中的应用。
我们回顾性分析了2019年4月至2021年1月期间从287例患者中收集的与397个甲状腺结节相关的1092张超声图像,将术后病理结果作为金标准。我们应用了不同的统计方法,包括平均值、最大值和百分位数,根据AI-SONIC甲状腺v. 5.3.0.2(德美特医疗科技有限公司,中国杭州)系统预测的每张图像的恶性肿瘤评分来估计基于结节的恶性肿瘤评分,并使用性能指标通过基于结节的分析评估其对不同大小或肿瘤类型结节的诊断效能。
根据手术病理检查结果,397个甲状腺结节中,272个甲状腺结节恶性结节占比过高。以每张图像恶性肿瘤评分的中位数来估计基于结节的评分,以灵敏度和特异度均值优化后的截断值为0.56,基于AI的自动检测系统显示出略低的灵敏度、显著更高的特异度(几乎与结节大小无关)以及与资深放射科医生相似的准确率。AI系统和资深放射科医生在诊断较小结节(≤25mm)时均表现出更高的灵敏度,而对于较大结节的诊断性能相当。AI系统每个结节的平均诊断时间为0.146秒,这与放射科医生的2.8至4.5分钟形成鲜明对比。
使用我们的优化策略实现基于结节的诊断,AI-SONIC甲状腺自动诊断系统显示出与资深放射科医生相当的总体诊断准确率。因此,预计它可作为放射科医生用于甲状腺恶性结节筛查和术前评估的可靠辅助诊断方法。