Kim Hyun-Il, Kim Yuna, Kim Bomin, Shin Dae Youp, Lee Seong Jae, Choi Sang-Il
Department of Computer Science and Engineering, Dankook University, Yongin 16890, Korea.
Department of Rehabilitation Medicine, Dankook University Hospital, Cheonan 31116, Korea.
Diagnostics (Basel). 2021 Jun 23;11(7):1147. doi: 10.3390/diagnostics11071147.
Kinematic analysis of the hyoid bone in a videofluorosopic swallowing study (VFSS) is important for assessing dysphagia. However, calibrating the hyoid bone movement is time-consuming, and its reliability shows wide variation. Computer-assisted analysis has been studied to improve the efficiency and accuracy of hyoid bone identification and tracking, but its performance is limited. In this study, we aimed to design a robust network that can track hyoid bone movement automatically without human intervention. Using 69,389 frames from 197 VFSS files as the data set, a deep learning model for detection and trajectory prediction was constructed and trained by the BiFPN-U-Net(T) network. The present model showed improved performance when compared with the previous models: an area under the curve (AUC) of 0.998 for pixelwise accuracy, an accuracy of object detection of 99.5%, and a Dice similarity of 90.9%. The bounding box detection performance for the hyoid bone and reference objects was superior to that of other models, with a mean average precision of 95.9%. The estimation of the distance of hyoid bone movement also showed higher accuracy. The deep learning model proposed in this study could be used to detect and track the hyoid bone more efficiently and accurately in VFSS analysis.
在视频荧光吞咽造影研究(VFSS)中,舌骨的运动学分析对于评估吞咽困难至关重要。然而,校准舌骨运动耗时且可靠性差异很大。已对计算机辅助分析进行了研究,以提高舌骨识别和跟踪的效率与准确性,但其性能有限。在本研究中,我们旨在设计一种强大的网络,能够在无需人工干预的情况下自动跟踪舌骨运动。使用来自197个VFSS文件的69389帧作为数据集,通过BiFPN-U-Net(T)网络构建并训练了用于检测和轨迹预测的深度学习模型。与先前模型相比,当前模型表现出了更好的性能:像素精度的曲线下面积(AUC)为0.998,目标检测准确率为99.5%,骰子相似度为90.9%。舌骨和参考物体的边界框检测性能优于其他模型,平均精度均值为95.9%。舌骨运动距离的估计也显示出更高的准确性。本研究中提出的深度学习模型可用于在VFSS分析中更高效、准确地检测和跟踪舌骨。