Suppr超能文献

Thy-Wise:一种用于评估甲状腺结节的可解释机器学习模型。

Thy-Wise: An interpretable machine learning model for the evaluation of thyroid nodules.

作者信息

Jin Zhe, Pei Shufang, Ouyang Lizhu, Zhang Lu, Mo Xiaokai, Chen Qiuying, You Jingjing, Chen Luyan, Zhang Bin, Zhang Shuixing

机构信息

Department of Radiology, The First Affiliated Hospital of Jinan University, Guangzhou, Guangdong, China.

Department of Ultrasound, Guangdong Provincial People's Hospital/Guangdong Academy of Medical Sciences, Guangzhou, Guangdong, China.

出版信息

Int J Cancer. 2022 Dec 15;151(12):2229-2243. doi: 10.1002/ijc.34248. Epub 2022 Sep 12.

Abstract

Current risk stratification systems for thyroid nodules suffer from low specificity and high biopsy rates. Recently, machine learning (ML) is introduced to assist thyroid nodule diagnosis but lacks interpretability. Here, we developed and validated ML models on 3965 thyroid nodules, as compared to the American College of Radiology Thyroid Imaging, Reporting and Data System (ACR TI-RADS). Subsequently, a SHapley Additive exPlanation (SHAP) algorithm was leveraged to interpret the results of the best-performing ML model. Clinical characteristics including thyroid-function tests were collected from medical records. Five ACR TI-RADS ultrasonography (US) categories plus nodule size were assessed by experienced radiologists. Random forest (RF), support vector machine (SVM) and extreme gradient boosting (XGBoost) were used to build US-only and US-clinical ML models. The ML models and ACR TI-RADS were compared in terms of diagnostic performance and unnecessary biopsy rate. Among the ML models, the US-only RF model (hereafter, Thy-Wise) achieved the optimal performance. Compared to ACR TI-RADS, Thy-Wise showed higher accuracy (82.4% vs 74.8% for the internal validation; 82.1% vs 73.4% for external validation) and specificity (78.7% vs 68.3% for internal validation; 78.5% vs 66.9% for external validation) while maintaining sensitivity (91.7% vs 91.2% for internal validation; 91.9% vs 91.1% for external validation), as well as reduced unnecessary biopsies (15.3% vs 32.3% for internal validation; 15.7% vs 47.3% for external validation). The SHAP-based interpretation of Thy-Wise enables clinicians to better understand the reasoning behind the diagnosis, which may facilitate the clinical translation of this model.

摘要

当前用于甲状腺结节的风险分层系统存在特异性低和活检率高的问题。最近,机器学习(ML)被引入以辅助甲状腺结节诊断,但缺乏可解释性。在此,我们在3965个甲状腺结节上开发并验证了ML模型,并与美国放射学会甲状腺影像报告和数据系统(ACR TI-RADS)进行了比较。随后,利用SHapley加性解释(SHAP)算法来解释表现最佳的ML模型的结果。从病历中收集包括甲状腺功能测试在内的临床特征。由经验丰富的放射科医生评估五个ACR TI-RADS超声(US)类别以及结节大小。使用随机森林(RF)、支持向量机(SVM)和极端梯度提升(XGBoost)构建仅基于US的ML模型和US-临床ML模型。在诊断性能和不必要活检率方面对ML模型和ACR TI-RADS进行了比较。在ML模型中,仅基于US的RF模型(以下简称Thy-Wise)表现最佳。与ACR TI-RADS相比,Thy-Wise显示出更高的准确率(内部验证中为82.4%对74.8%;外部验证中为82.1%对73.4%)和特异性(内部验证中为78.7%对68.3%;外部验证中为78.5%对66.9%),同时保持敏感性(内部验证中为91.7%对91.2%;外部验证中为91.9%对91.1%),并且减少了不必要的活检(内部验证中为15.3%对32.3%;外部验证中为15.7%对47.3%)。基于SHAP对Thy-Wise的解释使临床医生能够更好地理解诊断背后的推理,这可能有助于该模型的临床转化。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验