基于音节的语音分析与日常对话的深度学习方法用于吞咽困难检测。

Deep learning approach for dysphagia detection by syllable-based speech analysis with daily conversations.

机构信息

Department of Rehabilitation Medicine, Konkuk University Medical Center, 120-1 Neungdong-ro, Gwangjin-gu, Seoul, 05030, Republic of Korea.

Center for Artificial Intelligence, Korea Institute of Science and Technology, 5 Hwarangro14-gil, Seongbuk-gu, Seoul, 02792, Republic of Korea.

出版信息

Sci Rep. 2024 Aug 31;14(1):20270. doi: 10.1038/s41598-024-70774-z.

DOI:10.1038/s41598-024-70774-z

PMID:39217249

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11365951/

Abstract

Dysphagia, a disorder affecting the ability to swallow, has a high prevalence among the older adults and can lead to serious health complications. Therefore, early detection of dysphagia is important. This study evaluated the effectiveness of a newly developed deep learning model that analyzes syllable-segmented data for diagnosing dysphagia, an aspect not addressed in prior studies. The audio data of daily conversations were collected from 16 patients with dysphagia and 24 controls. The presence of dysphagia was determined by videofluoroscopic swallowing study. The data were segmented into syllables using a speech-to-text model and analyzed with a convolutional neural network to perform binary classification between the dysphagia patients and control group. The proposed model in this study was assessed in two different aspects. Firstly, with syllable-segmented analysis, it demonstrated a diagnostic accuracy of 0.794 for dysphagia, a sensitivity of 0.901, a specificity of 0.687, a positive predictive value of 0.742, and a negative predictive value of 0.874. Secondly, at the individual level, it achieved an overall accuracy of 0.900 and area under the curve of 0.953. This research highlights the potential of deep learning modal as an early, non-invasive, and simple method for detecting dysphagia in everyday environments.

摘要

吞咽困难是一种影响吞咽能力的疾病，在老年人中发病率很高，并可能导致严重的健康并发症。因此，早期发现吞咽困难非常重要。本研究评估了一种新开发的深度学习模型在诊断吞咽困难方面的有效性，这是以前的研究中没有涉及到的方面。该研究从 16 名吞咽困难患者和 24 名对照组中收集了日常对话的音频数据。通过视频透视吞咽研究确定是否存在吞咽困难。使用语音到文本模型将数据分段为音节，并使用卷积神经网络进行分析，以在吞咽困难患者和对照组之间进行二进制分类。本研究提出的模型从两个不同方面进行了评估。首先，通过音节分段分析，该模型对吞咽困难的诊断准确率为 0.794，灵敏度为 0.901，特异性为 0.687，阳性预测值为 0.742，阴性预测值为 0.874。其次，在个体水平上，它的整体准确率为 0.900，曲线下面积为 0.953。这项研究强调了深度学习模型作为一种早期、非侵入性和简单的方法在日常环境中检测吞咽困难的潜力。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

基于音节的语音分析与日常对话的深度学习方法用于吞咽困难检测。

Deep learning approach for dysphagia detection by syllable-based speech analysis with daily conversations.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

基于音节的语音分析与日常对话的深度学习方法用于吞咽困难检测。

Deep learning approach for dysphagia detection by syllable-based speech analysis with daily conversations.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献