Suppr超能文献

基于视频气动耳镜和深度学习算法的传导性听力损失自动预测。

Automatic Prediction of Conductive Hearing Loss Using Video Pneumatic Otoscopy and Deep Learning Algorithm.

机构信息

Department of Otorhinolaryngology-Head and Neck Surgery, Hanyang University College of Medicine, Hanyang University Medical Center, Seoul, South Korea.

These authors contributed equally to this work.

出版信息

Ear Hear. 2022;43(5):1563-1573. doi: 10.1097/AUD.0000000000001217. Epub 2022 Mar 29.

Abstract

OBJECTIVES

Diseases of the middle ear can interfere with normal sound transmission, which results in conductive hearing loss. Since video pneumatic otoscopy (VPO) findings reveal not only the presence of middle ear effusions but also dynamic movements of the tympanic membrane and part of the ossicles, analyzing VPO images was expected to be useful in predicting the presence of middle ear transmission problems. Using a convolutional neural network (CNN), a deep neural network implementing computer vision, this preliminary study aimed to create a deep learning model that detects the presence of an air-bone gap, conductive component of hearing loss, by analyzing VPO findings.

DESIGN

The medical records of adult patients who underwent VPO tests and pure-tone audiometry (PTA) on the same day were reviewed for enrollment. Conductive hearing loss was defined as an average air-bone gap of more than 10 dB at 0.5, 1, 2, and 4 kHz on PTA. Two significant images from the original VPO videos, at the most medial position on positive pressure and the most laterally displaced position on negative pressure, were used for the analysis. Applying multi-column CNN architectures with individual backbones of pretrained CNN versions, the performance of each model was evaluated and compared for Inception-v3, VGG-16 or ResNet-50. The diagnostic accuracy predicting the presence of conductive component of hearing loss of the selected deep learning algorithm used was compared with experienced otologists.

RESULTS

The conductive hearing loss group consisted of 57 cases (mean air-bone gap = 25 ± 8 dB): 21 ears with effusion, 14 ears with malleus-incus fixation, 15 ears with stapes fixation including otosclerosis, one ear with a loose incus-stapes joint, 3 cases with adhesive otitis media, and 3 ears with middle ear masses including congenital cholesteatoma. The control group consisted of 76 cases with normal hearing thresholds without air-bone gaps. A total of 1130 original images including repeated measurements were obtained for the analysis. Of the various network architectures designed, the best was to feed each of the images into the individual backbones of Inception-v3 (three-column architecture) and concatenate the feature maps after the last convolutional layer from each column. In the selected model, the average performance of 10-fold cross-validation in predicting conductive hearing loss was 0.972 mean areas under the curve (mAUC), 91.6% sensitivity, 96.0% specificity, 94.4% positive predictive value, 93.9% negative predictive value, and 94.1% accuracy, which was superior to that of experienced otologists, whose performance had 0.773 mAUC and 79.0% accuracy on average. The algorithm detected over 85% of cases with stapes fixations or ossicular chain problems other than malleus-incus fixations. Visualization of the region of interest in the deep learning model revealed that the algorithm made decisions generally based on findings in the malleus and nearby tympanic membrane.

CONCLUSIONS

In this preliminary study, the deep learning algorithm created to analyze VPO images successfully detected the presence of conductive hearing losses caused by middle ear effusion, ossicular fixation, otosclerosis, and adhesive otitis media. Interpretation of VPO using the deep learning algorithm showed promise as a diagnostic tool to differentiate conductive hearing loss from sensorineural hearing loss, which would be especially useful for patients with poor cooperation.

摘要

目的

中耳疾病会干扰正常的声音传导,导致传导性听力损失。由于视频气动耳镜(VPO)检查不仅可以显示中耳积液的存在,还可以显示鼓膜和部分听小骨的动态运动,因此分析 VPO 图像有望有助于预测中耳传音问题的存在。本初步研究采用卷积神经网络(CNN),一种实现计算机视觉的深度神经网络,旨在创建一种深度学习模型,通过分析 VPO 结果来检测气骨导间隙的存在,即传导性听力损失的成分。

设计

对同一天接受 VPO 检查和纯音测听(PTA)的成年患者的病历进行了回顾性分析。传导性听力损失定义为 PTA 上 0.5、1、2 和 4 kHz 的平均气骨导间隙大于 10 dB。从原始 VPO 视频中选择两个具有代表性的图像,分别为正压时最内侧位置和负压时最外侧移位位置的图像,用于分析。应用具有预训练 CNN 版本的单个骨干的多列 CNN 架构,评估和比较了每个模型的性能,包括 Inception-v3、VGG-16 或 ResNet-50。比较了所选深度学习算法预测传导性听力损失成分存在的诊断准确性与有经验的耳科医生的准确性。

结果

传导性听力损失组包括 57 例(平均气骨导间隙=25±8 dB):21 耳有积液,14 耳听小骨固定,15 耳镫骨固定包括耳硬化症,1 耳听小骨-砧骨关节松弛,3 例黏连性中耳炎,3 耳中耳包括先天性胆脂瘤有肿块。对照组包括 76 例听力阈值正常、无气骨导间隙的患者。共获得包括重复测量的 1130 张原始图像进行分析。在所设计的各种网络架构中,最好的方法是将每张图像输入到 Inception-v3 的单个骨干(三列架构)中,并在每列最后一个卷积层后串联特征图。在所选择的模型中,在预测传导性听力损失方面,10 折交叉验证的平均性能为 0.972 曲线下面积(mAUC),91.6%的敏感性,96.0%的特异性,94.4%的阳性预测值,93.9%的阴性预测值和 94.1%的准确性,优于平均 mAUC 为 0.773 和准确性为 79.0%的有经验的耳科医生。该算法检测到超过 85%的镫骨固定或除锤骨-砧骨固定以外的听小骨链问题的病例。深度学习模型中感兴趣区域的可视化显示,该算法通常根据锤骨和附近鼓膜的发现做出决策。

结论

在这项初步研究中,创建的用于分析 VPO 图像的深度学习算法成功地检测到中耳积液、听小骨固定、耳硬化症和黏连性中耳炎引起的传导性听力损失的存在。使用深度学习算法解释 VPO 显示出作为一种区分传导性听力损失和感音神经性听力损失的诊断工具的潜力,这对于合作不佳的患者尤其有用。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验