IEEE J Biomed Health Inform. 2022 Nov;26(11):5364-5371. doi: 10.1109/JBHI.2022.3197910. Epub 2022 Nov 10.
In recent times, speech-based automatic disease detection systems have shown several promising results in biomedical and life science applications, especially in the case of respiratory diseases. It provides a quick, cost-effective, reliable, and non-invasive potential alternative detection option for COVID-19 in the ongoing pandemic scenario since the subject's voice can be remotely recorded and sent for further analysis. The existing COVID-19 detection methods including RT-PCR, and chest X-ray tests are not only costlier but also require the involvement of a trained technician. The present paper proposes a novel speech-based respiratory disease detection scheme for COVID-19 and Asthma using the Gradient Boosting Machine-based classifier. From the recorded speech samples, the spectral, cepstral, and periodicity features, as well as spectral descriptors, are computed and then homogeneously fused to obtain relevant statistical features. These features are subsequently used as inputs to the Gradient Boosting Machine. The various performance matrices of the proposed model have been obtained using thirteen sound categories' speech data collected from more than 50 countries using five standard datasets for accurate diagnosis of respiratory diseases including COVID-19. The overall average accuracy achieved by the proposed model using the stratified k-fold cross-validation test is above 97%. The analysis of various performance matrices demonstrates that under the current pandemic scenario, the proposed COVID-19 detection scheme can be gainfully employed by physicians.
近年来,基于语音的自动疾病检测系统在生物医学和生命科学应用中显示出了许多有前途的结果,特别是在呼吸疾病方面。在当前的大流行情况下,由于可以远程记录受检者的声音并将其发送进行进一步分析,因此它为 COVID-19 提供了一种快速、具有成本效益、可靠且非侵入性的潜在替代检测选项。现有的 COVID-19 检测方法,包括 RT-PCR 和胸部 X 光检查,不仅成本更高,而且还需要经过培训的技术人员的参与。本文提出了一种使用基于梯度提升机的分类器的新型基于语音的 COVID-19 和哮喘呼吸疾病检测方案。从记录的语音样本中,计算出光谱、倒谱和周期性特征以及光谱描述符,然后均匀融合以获得相关的统计特征。这些特征随后被用作梯度提升机的输入。使用来自 50 多个国家的超过 50 个国家的 13 个声音类别的语音数据,通过五个标准数据集,获得了所提出模型的各种性能矩阵,以准确诊断包括 COVID-19 在内的呼吸疾病。使用分层 k 折交叉验证测试,所提出模型的整体平均准确率超过 97%。对各种性能矩阵的分析表明,在当前大流行的情况下,所提出的 COVID-19 检测方案可以被医生有益地采用。