Suppr
超能文献

基于深度学习的视频分析在电视荧光吞咽造影研究中自动检测穿透和误吸情况

Deep learning-based video analysis for automatically detecting penetration and aspiration in videofluoroscopic swallowing study.

作者信息

Kwak Soyoung, Kim Jeoung Kun, Moon Jun Sung, Lee Gun Woo, Kim Sungho, Chang Min Cheol

机构信息

Department of Physical Medicine and Rehabilitation, College of Medicine, Yeungnam University, Daegu, Republic of Korea.

Department of Business Administration, School of Business, Yeungnam University, Gyeongsan-si, Republic of Korea.

出版信息

Sci Rep. 2025 Jul 7;15(1):24296. doi: 10.1038/s41598-025-10397-0.

DOI:10.1038/s41598-025-10397-0

PMID:40624237

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12234746/

Abstract

The videofluoroscopic swallowing study (VFSS) is the gold standard for diagnosing dysphagia, but its interpretation is time-consuming and requires expertise. This study developed a deep learning model for automatically detecting penetration and aspiration in VFSS and assessed its diagnostic accuracy. Images corresponding to the highest and lowest positions of the hyoid bone -representing the moment of upper esophageal sphincter opening during swallow and the pre-swallow and post-swallow phases, respectively- were automatically extracted from VFSS videos, resulting in a total of 18,145 images from 1,467 patients. The model was trained with a convolutional neural network architecture, incorporating techniques to address class imbalance and optimize performance. The model achieved high diagnostic accuracy at the patient level, with the area under the receiver operating characteristic curve values of 0.935 (normal swallowing), 0.889 (penetration), and 0.845 (aspiration). However, despite strong performance in identifying normal swallowing, the model exhibited low sensitivity for detecting penetration and aspiration. The findings suggest that the proposed model may reduce interpretation time by minimizing the need for repeated video review to identify penetration or aspiration, enabling clinicians to focus on other clinically relevant VFSS findings. Future studies should address its limitations by analyzing full-frame VFSS data and incorporating multicenter datasets.

摘要

视频荧光吞咽造影检查（VFSS）是诊断吞咽困难的金标准，但其解读耗时且需要专业知识。本研究开发了一种深度学习模型，用于自动检测VFSS中的误吸和渗透，并评估其诊断准确性。分别代表吞咽过程中食管上括约肌开放时刻以及吞咽前和吞咽后阶段的舌骨最高和最低位置对应的图像，从VFSS视频中自动提取，共得到来自1467例患者的18145张图像。该模型采用卷积神经网络架构进行训练，纳入了解决类别不平衡和优化性能的技术。该模型在患者层面实现了较高的诊断准确性，受试者操作特征曲线下面积值分别为0.935（正常吞咽）、0.889（渗透）和0.845（误吸）。然而，尽管在识别正常吞咽方面表现出色，但该模型在检测渗透和误吸方面的敏感性较低。研究结果表明，所提出的模型可能通过减少反复查看视频以识别渗透或误吸的需求来缩短解读时间，使临床医生能够专注于其他与VFSS相关的临床发现。未来的研究应通过分析全帧VFSS数据并纳入多中心数据集来解决其局限性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8281/12234746/63b0e24aab4b/41598_2025_10397_Fig1_HTML.jpg

相似文献

Deep learning-based video analysis for automatically detecting penetration and aspiration in videofluoroscopic swallowing study.

Sci Rep. 2025 Jul 7;15(1):24296. doi: 10.1038/s41598-025-10397-0.

Accuracy of endoscopic and videofluoroscopic evaluations of swallowing for oropharyngeal dysphagia.

Laryngoscope. 2017 Sep;127(9):2002-2010. doi: 10.1002/lary.26419. Epub 2016 Nov 15.

Deep Learning Analysis to Automatically Detect the Presence of Penetration or Aspiration in Videofluoroscopic Swallowing Study.

J Korean Med Sci. 2022 Feb 14;37(6):e42. doi: 10.3346/jkms.2022.37.e42.

Correlation of the VFSS Esophageal Screen to High-Resolution Esophageal Manometry.

Laryngoscope. 2025 Jul;135(7):2283-2290. doi: 10.1002/lary.32157. Epub 2025 Mar 29.

Screening for aspiration risk associated with dysphagia in acute stroke.

Cochrane Database Syst Rev. 2021 Oct 18;10(10):CD012679. doi: 10.1002/14651858.CD012679.pub2.

Modifying the consistency of food and fluids for swallowing difficulties in dementia.

Cochrane Database Syst Rev. 2018 Sep 24;9(9):CD011077. doi: 10.1002/14651858.CD011077.pub2.

Endoscopic and videofluoroscopic evaluations of swallowing for dysphagia: A systematic review.

Braz J Otorhinolaryngol. 2025 Apr 9;91 Suppl 1(Suppl 1):101598. doi: 10.1016/j.bjorl.2025.101598.

Interventions for dysphagia in long-term, progressive muscle disease.

Cochrane Database Syst Rev. 2016 Feb 9;2(2):CD004303. doi: 10.1002/14651858.CD004303.pub4.

Deep learning detects retropharyngeal edema on MRI in patients with acute neck infections.

Eur Radiol Exp. 2025 Jun 19;9(1):60. doi: 10.1186/s41747-025-00599-6.

Artificial intelligence for diagnosing exudative age-related macular degeneration.

Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.

本文引用的文献

Artificial Intelligence in Videofluoroscopy Swallow Study Analysis: A Comprehensive Review.

Dysphagia. 2025 Feb 17. doi: 10.1007/s00455-025-10812-8.

Recent advancements and future directions in automatic swallowing analysis via videofluoroscopy: A review.

Comput Methods Programs Biomed. 2025 Feb;259:108505. doi: 10.1016/j.cmpb.2024.108505. Epub 2024 Nov 16.

Deep Learning Approaches for Medical Image Analysis and Diagnosis.

Cureus. 2024 May 2;16(5):e59507. doi: 10.7759/cureus.59507. eCollection 2024 May.

Artificial intelligence for human gunshot wound classification.

J Pathol Inform. 2023 Dec 30;15:100361. doi: 10.1016/j.jpi.2023.100361. eCollection 2024 Dec.

Medical image analysis using deep learning algorithms.

Front Public Health. 2023 Nov 7;11:1273253. doi: 10.3389/fpubh.2023.1273253. eCollection 2023.

Application of Deep Learning for Prediction of Alzheimer's Disease in PET/MR Imaging.

Bioengineering (Basel). 2023 Sep 24;10(10):1120. doi: 10.3390/bioengineering10101120.

The Use of Artificial Intelligence to Predict the Prognosis of Patients Undergoing Central Nervous System Rehabilitation: A Narrative Review.

Healthcare (Basel). 2023 Oct 6;11(19):2687. doi: 10.3390/healthcare11192687.

Revolutionizing healthcare: the role of artificial intelligence in clinical practice.

BMC Med Educ. 2023 Sep 22;23(1):689. doi: 10.1186/s12909-023-04698-z.

Detection of aspiration from images of a videofluoroscopic swallowing study adopting deep learning.

Oral Radiol. 2023 Jul;39(3):553-562. doi: 10.1007/s11282-023-00669-8. Epub 2023 Feb 8.

The kinematic features of hyoid bone movement during swallowing in different disease populations: A narrative review.

J Formos Med Assoc. 2022 Oct;121(10):1892-1899. doi: 10.1016/j.jfma.2022.04.007. Epub 2022 Apr 22.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

基于深度学习的视频分析在电视荧光吞咽造影研究中自动检测穿透和误吸情况

Deep learning-based video analysis for automatically detecting penetration and aspiration in videofluoroscopic swallowing study.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译