Department of Otolaryngology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China.
The second Hospital, Medical College, Shantou University, Shantou, Guangdong Province, China.
JAMA Otolaryngol Head Neck Surg. 2022 Jul 1;148(7):612-620. doi: 10.1001/jamaoto.2022.0900.
Otitis media with effusion (OME) is one of the most common causes of acquired conductive hearing loss (CHL). Persistent hearing loss is associated with poor childhood speech and language development and other adverse consequence. However, to obtain accurate and reliable hearing thresholds largely requires a high degree of cooperation from the patients.
To predict CHL from otoscopic images using deep learning (DL) techniques and a logistic regression model based on tympanic membrane features.
DESIGN, SETTING, AND PARTICIPANTS: A retrospective diagnostic/prognostic study was conducted using 2790 otoscopic images obtained from multiple centers between January 2015 and November 2020. Participants were aged between 4 and 89 years. Of 1239 participants, there were 209 ears from children and adolescents (aged 4-18 years [16.87%]), 804 ears from adults (aged 18-60 years [64.89%]), and 226 ears from older people (aged >60 years, [18.24%]). Overall, 679 ears (54.8%) were from men. The 2790 otoscopic images were randomly assigned into a training set (2232 [80%]), and validation set (558 [20%]). The DL model was developed to predict an average air-bone gap greater than 10 dB. A logistic regression model was also developed based on otoscopic features.
The performance of the DL model in predicting CHL was measured using the area under the receiver operating curve (AUC), accuracy, and F1 score (a measure of the quality of a classifier, which is the harmonic mean of precision and recall; a higher F1 score means better performance). In addition, these evaluation parameters were compared to results obtained from the logistic regression model and predictions made by three otologists.
The performance of the DL model in predicting CHL showed the AUC of 0.74, accuracy of 81%, and F1 score of 0.89. This was better than the results from the logistic regression model (ie, AUC of 0.60, accuracy of 76%, and F1 score of 0.82), and much improved on the performance of the 3 otologists; accuracy of 16%, 30%, 39%, and F1 scores of 0.09, 0.18, and 0.25, respectively. Furthermore, the DL model took 2.5 seconds to predict from 205 otoscopic images, whereas the 3 otologists spent 633 seconds, 645 seconds, and 692 seconds, respectively.
The model in this diagnostic/prognostic study provided greater accuracy in prediction of CHL in ears with OME than those obtained from the logistic regression model and otologists. This indicates great potential for the use of artificial intelligence tools to facilitate CHL evaluation when CHL is unable to be measured.
中耳炎伴积液(OME)是获得性传导性听力损失(CHL)最常见的原因之一。持续性听力损失与儿童言语和语言发育不良以及其他不良后果有关。然而,要获得准确可靠的听力阈值,在很大程度上需要患者高度配合。
使用基于鼓膜特征的深度学习(DL)技术和逻辑回归模型,从耳镜图像预测 CHL。
设计、地点和参与者:这是一项回顾性诊断/预后研究,使用 2015 年 1 月至 2020 年 11 月期间在多个中心获得的 2790 张耳镜图像进行。参与者年龄在 4 至 89 岁之间。在 1239 名参与者中,有 209 只耳朵来自儿童和青少年(4-18 岁[16.87%]),804 只耳朵来自成年人(18-60 岁[64.89%]),226 只耳朵来自老年人(年龄>60 岁,[18.24%])。总体而言,679 只耳朵(54.8%)来自男性。2790 张耳镜图像被随机分配到训练集(2232[80%])和验证集(558[20%])。开发了 DL 模型来预测平均气骨间隙大于 10dB。还基于耳镜特征开发了逻辑回归模型。
使用接收器工作曲线下的面积(AUC)、准确性和 F1 分数(衡量分类器质量的指标,是精度和召回率的调和平均值;更高的 F1 分数意味着更好的性能)来衡量 DL 模型预测 CHL 的性能。此外,还将这些评估参数与逻辑回归模型和三位耳科医生的预测结果进行了比较。
DL 模型预测 CHL 的 AUC 为 0.74,准确性为 81%,F1 得分为 0.89。这优于逻辑回归模型的结果(即 AUC 为 0.60、准确性为 76%和 F1 得分为 0.82),并且大大优于三位耳科医生的表现;准确性分别为 16%、30%、39%和 F1 分数分别为 0.09、0.18 和 0.25。此外,DL 模型从 205 张耳镜图像中预测需要 2.5 秒,而三位耳科医生分别花费 633 秒、645 秒和 692 秒。
本研究中的诊断/预后模型在预测 OME 耳的 CHL 方面比逻辑回归模型和耳科医生提供了更高的准确性。这表明人工智能工具在无法测量 CHL 时,在促进 CHL 评估方面具有很大的潜力。