Department of Otorhinolaryngology, Head and Neck Surgery, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China.
Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, U.S.A.
Laryngoscope. 2024 Oct;134(10):4329-4337. doi: 10.1002/lary.31555. Epub 2024 Jun 3.
To extract texture features from vocal cord leukoplakia (VCL) images and establish a VCL risk stratification prediction model using machine learning (ML) techniques.
A total of 462 patients with pathologically confirmed VCL were retrospectively collected and divided into low-risk and high-risk groups. We use a 5-fold cross validation method to ensure the generalization ability of the model built using the included dataset and avoid overfitting. Totally 504 texture features were extracted from each laryngoscope image. After feature selection, 10 ML classifiers were utilized to construct the model. The SHapley Additive exPlanations (SHAP) was employed for feature analysis. To evaluate the model, accuracy, sensitivity, specificity, and the area under the receiver operating characteristic (ROC) curve (AUC) were utilized. In addition, the model was transformed into an online application for public use and further tested in an independent dataset with 52 cases of VCL.
A total of 12 features were finally selected, random forest (RF) achieved the best model performance, the mean accuracy, sensitivity, specificity, and AUC of the 5-fold cross validation were 92.2 ± 4.1%, 95.6 ± 4.0%, 85.8 ± 5.8%, and 90.7 ± 4.9%, respectively. The result is much higher than the clinicians (AUC between 63.1% and 75.2%). The SHAP algorithm ranks the importance of 12 texture features to the model. The test results of the additional independent datasets were 92.3%, 95.7%, 90.0%, and 93.3%, respectively.
The proposed VCL risk stratification prediction model, which has been developed into a public online prediction platform, may be applied in practical clinical work.
3 Laryngoscope, 134:4329-4337, 2024.
从声带白斑(VCL)图像中提取纹理特征,并使用机器学习(ML)技术建立 VCL 风险分层预测模型。
回顾性收集了 462 例经病理证实的 VCL 患者,分为低风险组和高风险组。我们使用 5 折交叉验证方法,确保使用纳入数据集构建的模型具有良好的泛化能力,避免过拟合。从每个喉镜图像中总共提取了 504 个纹理特征。在特征选择后,使用 10 种 ML 分类器构建模型。采用 SHapley Additive exPlanations(SHAP)进行特征分析。为了评估模型,使用了准确性、敏感性、特异性和接收器工作特征(ROC)曲线下面积(AUC)。此外,还将模型转化为在线应用程序,用于公众使用,并在一个包含 52 例 VCL 的独立数据集上进一步进行测试。
最终选择了 12 个特征,随机森林(RF)的模型性能最佳,5 折交叉验证的平均准确性、敏感性、特异性和 AUC 分别为 92.2±4.1%、95.6±4.0%、85.8±5.8%和 90.7±4.9%。这一结果明显高于临床医生(AUC 在 63.1%至 75.2%之间)。SHAP 算法对 12 个纹理特征对模型的重要性进行了排序。额外的独立数据集的测试结果分别为 92.3%、95.7%、90.0%和 93.3%。
本研究开发的 VCL 风险分层预测模型已经转化为一个公共在线预测平台,可能应用于实际的临床工作中。
3 级喉镜,134:4329-4337,2024。