Division of Science and Technology, Graduate School of Sciences and Technology for Innovations, Tokushima University, Tokushima 770-8506, Japan.
Division of Science and Technology, Industrial and Social Science, Graduate School of Technology, Tokushima University, Tokushima 770-8506, Japan.
Sensors (Basel). 2024 May 11;24(10):3057. doi: 10.3390/s24103057.
Cervical auscultation is a simple, noninvasive method for diagnosing dysphagia, although the reliability of the method largely depends on the subjectivity and experience of the evaluator. Recently developed methods for the automatic detection of swallowing sounds facilitate a rough automatic diagnosis of dysphagia, although a reliable method of detection specialized in the peculiar feature patterns of swallowing sounds in actual clinical conditions has not been established. We investigated a novel approach for automatically detecting swallowing sounds by a method wherein basic statistics and dynamic features were extracted based on acoustic features: Mel Frequency Cepstral Coefficients and Mel Frequency Magnitude Coefficients, and an ensemble learning model combining Support Vector Machine and Multi-Layer Perceptron were applied. The evaluation of the effectiveness of the proposed method, based on a swallowing-sounds database synchronized to a video fluorographic swallowing study compiled from 74 advanced-age patients with dysphagia, demonstrated an outstanding performance. It achieved an F1-micro average of approximately 0.92 and an accuracy of 95.20%. The method, proven effective in the current clinical recording database, suggests a significant advancement in the objectivity of cervical auscultation. However, validating its efficacy in other databases is crucial for confirming its broad applicability and potential impact.
颈听诊是一种简单、无创的吞咽困难诊断方法,但该方法的可靠性在很大程度上取决于评估者的主观性和经验。最近开发的吞咽声音自动检测方法有助于对吞咽困难进行粗略的自动诊断,但尚未建立一种专门针对实际临床条件下吞咽声音特殊特征模式的可靠检测方法。我们研究了一种通过基于声学特征(梅尔频率倒谱系数和梅尔频率幅度系数)提取基本统计和动态特征的方法来自动检测吞咽声音的新方法,并应用了结合支持向量机和多层感知机的集成学习模型。基于从 74 名吞咽困难的高龄患者的视频荧光吞咽研究中同步编译的吞咽声音数据库,对所提出方法的有效性进行了评估,结果表明该方法具有出色的性能。它实现了约 0.92 的 F1-微平均值和 95.20%的准确率。该方法在当前临床记录数据库中已被证明有效,表明颈听诊的客观性有了显著提高。然而,在其他数据库中验证其疗效对于确认其广泛适用性和潜在影响至关重要。