Cao Zuwei, Chen Feifan, Grais Emad M, Yue Fengjuan, Cai Yuexin, Swanepoel De Wet, Zhao Fei
Center for Rehabilitative Auditory Research, Guizhou Provincial People's Hospital, Guiyang City, China.
Centre for Speech and Language Therapy and Hearing Science, Cardiff School of Sport and Health Sciences, Cardiff Metropolitan University, Cardiff, UK.
Laryngoscope. 2023 Apr;133(4):732-741. doi: 10.1002/lary.30291. Epub 2022 Jul 18.
To systematically evaluate the development of Machine Learning (ML) models and compare their diagnostic accuracy for the classification of Middle Ear Disorders (MED) using Tympanic Membrane (TM) images.
PubMed, EMBASE, CINAHL, and CENTRAL were searched up until November 30, 2021. Studies on the development of ML approaches for diagnosing MED using TM images were selected according to the inclusion criteria. PRISMA guidelines were followed with study design, analysis method, and outcomes extracted. Sensitivity, specificity, and area under the curve (AUC) were used to summarize the performance metrics of the meta-analysis. Risk of Bias was assessed using the Quality Assessment of Diagnostic Accuracy Studies-2 tool in combination with the Prediction Model Risk of Bias Assessment Tool.
Sixteen studies were included, encompassing 20254 TM images (7025 normal TM and 13229 MED). The sample size ranged from 45 to 6066 per study. The accuracy of the 25 included ML approaches ranged from 76.00% to 98.26%. Eleven studies (68.8%) were rated as having a low risk of bias, with the reference standard as the major domain of high risk of bias (37.5%). Sensitivity and specificity were 93% (95% CI, 90%-95%) and 85% (95% CI, 82%-88%), respectively. The AUC of total TM images was 94% (95% CI, 91%-96%). The greater AUC was found using otoendoscopic images than otoscopic images.
ML approaches perform robustly in distinguishing between normal ears and MED, however, it is proposed that a standardized TM image acquisition and annotation protocol should be developed.
NA Laryngoscope, 133:732-741, 2023.
系统评估机器学习(ML)模型的发展,并比较其使用鼓膜(TM)图像对中耳疾病(MED)进行分类的诊断准确性。
检索截至2021年11月30日的PubMed、EMBASE、CINAHL和CENTRAL数据库。根据纳入标准选择使用TM图像诊断MED的ML方法发展的研究。遵循PRISMA指南,提取研究设计、分析方法和结果。使用敏感性、特异性和曲线下面积(AUC)来总结荟萃分析的性能指标。使用诊断准确性研究质量评估-2工具结合预测模型偏倚风险评估工具评估偏倚风险。
纳入16项研究,涵盖20254张TM图像(7025张正常TM和13229张MED)。每项研究的样本量从45到6066不等。纳入的25种ML方法的准确率在76.00%至98.26%之间。11项研究(68.8%)被评为低偏倚风险,以参考标准作为高偏倚风险的主要领域(37.5%)。敏感性和特异性分别为93%(95%CI,90%-95%)和85%(95%CI,82%-88%)。总TM图像的AUC为94%(95%CI,91%-96%)。使用耳内镜图像比耳镜图像发现的AUC更大。
ML方法在区分正常耳朵和MED方面表现强劲,然而,建议制定标准化的TM图像采集和注释方案。
NA 喉镜杂志,133:732-741,2023年。