Wong Duo Wai-Chi, Wang Jiao, Cheung Sophia Ming-Yan, Lai Derek Ka-Hei, Chiu Armstrong Tat-San, Pu Dai, Cheung James Chung-Wai, Kwok Timothy Chi-Yui
Department of Biomedical Engineering, Faculty of Engineering, Hong Kong Polytechnic University, Hong Kong, China (Hong Kong).
Department of Clinical Laboratory, Hubei Provincial Hospital of Traditional Chinese Medicine, Wuhan, China.
J Med Internet Res. 2025 May 5;27:e65551. doi: 10.2196/65551.
Dysphagia affects more than half of older adults with dementia and is associated with a 10-fold increase in mortality. The development of accessible, objective, and reliable screening tools is crucial for early detection and management.
This systematic scoping review aimed to (1) examine the current state of the art in artificial intelligence (AI) and sensor-based technologies for dysphagia screening, (2) evaluate the performance of these AI-based screening tools, and (3) assess the methodological quality and rigor of studies on AI-based dysphagia screening tools.
We conducted a systematic literature search across CINAHL, Embase, PubMed, and Web of Science from inception to July 4, 2024, following the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) framework. In total, 2 independent researchers conducted the search, screening, and data extraction. Eligibility criteria included original studies using sensor-based instruments with AI to identify individuals with dysphagia or unsafe swallow events. We excluded studies on pediatric, infant, or postextubation dysphagia, as well as those using non-sensor-based assessments or diagnostic tools. We used a modified Quality Assessment of Diagnostic Accuracy Studies-2 tool to assess methodological quality, adding a "model" domain for AI-specific evaluation. Data were synthesized narratively.
This review included 24 studies involving 2979 participants (1717 with dysphagia and 1262 controls). In total, 75% (18/24) of the studies focused solely on per-individual classification rather than per-swallow event classification. Acoustic (13/24, 54%) and vibratory (9/24, 38%) signals were the primary modality sources. In total, 25% (6/24) of the studies used multimodal approaches, whereas 75% (18/24) used a single modality. Support vector machine was the most common AI model (15/24, 62%), with deep learning approaches emerging in recent years (3/24, 12%). Performance varied widely-accuracy ranged from 71.2% to 99%, area under the receiver operating characteristic curve ranged from 0.77 to 0.977, and sensitivity ranged from 63.6% to 100%. Multimodal systems generally outperformed unimodal systems. The methodological quality assessment revealed a risk of bias, particularly in patient selection (unclear in 18/24, 75% of the studies), index test (unclear in 23/24, 96% of the studies), and modeling (high risk in 13/24, 54% of the studies). Notably, no studies conducted external validation or domain adaptation testing, raising concerns about real-world applicability.
This review provides a comprehensive overview of technological advancements in AI and sensor-based dysphagia screening. While these developments show promise for continuous long-term tele-swallowing assessments, significant methodological limitations were identified. Future studies can explore how each modality can target specific anatomical regions and manifestations of dysphagia. This detailed understanding of how different modalities address various aspects of dysphagia can significantly benefit multimodal systems, enabling them to better handle the multifaceted nature of dysphagia conditions.
吞咽困难影响超过半数的老年痴呆患者,且与死亡率升高10倍相关。开发便捷、客观且可靠的筛查工具对于早期检测和管理至关重要。
本系统综述旨在(1)审视人工智能(AI)和基于传感器的技术在吞咽困难筛查方面的当前技术水平,(2)评估这些基于AI的筛查工具的性能,以及(3)评估基于AI的吞咽困难筛查工具研究的方法学质量和严谨性。
我们按照PRISMA-ScR(系统综述和Meta分析扩展版的首选报告项目)框架,从数据库建立至2024年7月4日在CINAHL、Embase、PubMed和Web of Science上进行了系统的文献检索。总共2名独立研究人员进行检索、筛选和数据提取。纳入标准包括使用基于传感器的仪器结合AI识别吞咽困难个体或不安全吞咽事件的原始研究。我们排除了关于儿科、婴儿或拔管后吞咽困难的研究,以及使用非基于传感器的评估或诊断工具的研究。我们使用改良的诊断准确性研究质量评估-2工具评估方法学质量,增加了一个用于AI特定评估的“模型”领域。数据进行了叙述性综合分析。
本综述纳入24项研究,涉及2979名参与者(1717名吞咽困难患者和1262名对照)。总共75%(18/24)的研究仅专注于个体分类而非每次吞咽事件分类。声学信号(13/24,54%)和振动信号(9/24,38%)是主要的模态来源。总共25%(6/24)的研究使用多模态方法,而75%(18/24)使用单模态。支持向量机是最常见的AI模型(15/24,62%),近年来深度学习方法也开始出现(3/24,12%)。性能差异很大——准确率从71.2%到99%不等,受试者工作特征曲线下面积从0.77到0.977不等,灵敏度从63.6%到100%不等。多模态系统通常优于单模态系统。方法学质量评估显示存在偏倚风险,特别是在患者选择方面(18/24,75%的研究不清楚)、指标测试方面(23/24,96%的研究不清楚)和建模方面(13/24,54%的研究高风险)。值得注意的是,没有研究进行外部验证或领域适应性测试,这引发了对实际应用适用性的担忧。
本综述全面概述了AI和基于传感器的吞咽困难筛查技术的进展。虽然这些进展显示出在持续长期远程吞咽评估方面的前景,但也发现了显著的方法学局限性。未来的研究可以探索每种模态如何针对吞咽困难的特定解剖区域和表现。对不同模态如何处理吞咽困难状况的各个方面的详细理解可以显著造福多模态系统,使其能够更好地应对吞咽困难状况的多面性。