IEEE Trans Biomed Eng. 2019 Feb;66(2):485-495. doi: 10.1109/TBME.2018.2849502. Epub 2018 Jun 21.
Croup, a respiratory tract infection common in children, causes an inflammation of the upper airway restricting normal breathing and producing cough sounds typically described as seallike "barking cough." Physicians use the existence of barking cough as the defining characteristic of croup. This paper aims to develop automated cough sound analysis methods to objectively diagnose croup.
In automating croup diagnosis, we propose the use of mathematical features inspired by the human auditory system. In particular, we utilize the cochleagram for feature extraction, a time-frequency representation where the frequency components are based on the frequency selectivity property of the human cochlea. Speech and cough share some similarities in the generation process and physiological wetware used. As such, we also propose the use of mel-frequency cepstral coefficients which has been shown to capture the relevant aspects of the short-term power spectrum of speech signals. Feature combination and backward sequential feature selection are also experimented with. Experimentation is performed on cough sound recordings from patients presenting various clinically diagnosed respiratory tract infections divided into croup and non-croup. The dataset is divided into training and test sets of 364 and 115 patients, respectively, with automatically segmented cough sound segments.
Croup and non-croup patient classification on the test dataset with the proposed methods achieve a sensitivity and specificity of 92.31% and 85.29%, respectively.
Experimental results show the significant improvement in automatic croup diagnosis against earlier methods.
This paper has the potential to automate croup diagnosis based solely on cough sound analysis.
喉炎是一种常见于儿童的呼吸道感染,会导致上呼吸道炎症,限制正常呼吸,并产生咳嗽声,通常被描述为类似“犬吠样”的“哮吼性咳嗽”。医生将哮吼性咳嗽的存在作为喉炎的特征。本文旨在开发自动化的咳嗽声音分析方法,以客观诊断喉炎。
在自动化的喉炎诊断中,我们提出使用受人类听觉系统启发的数学特征。具体来说,我们利用 Cochleagram 进行特征提取,这是一种时频表示,其中的频率分量基于人类耳蜗的频率选择性特性。咳嗽和语音在产生过程和使用的生理硬件方面有一些相似之处。因此,我们还提出使用梅尔频率倒谱系数,该系数已被证明可以捕获语音信号的短期功率谱的相关方面。我们还尝试了特征组合和反向序贯特征选择。实验是在呈现各种临床诊断的呼吸道感染的患者的咳嗽声音记录上进行的,分为喉炎和非喉炎。该数据集分为 364 名和 115 名患者的训练集和测试集,具有自动分割的咳嗽声音段。
使用所提出的方法对测试数据集上的喉炎和非喉炎患者进行分类,灵敏度和特异性分别为 92.31%和 85.29%。
实验结果表明,与早期方法相比,自动喉炎诊断有了显著的改进。
本文有可能仅基于咳嗽声音分析实现喉炎的自动化诊断。