Rahman Tawsifur, Ibtehaz Nabil, Khandakar Amith, Hossain Md Sakib Abrar, Mekki Yosra Magdi Salih, Ezeddin Maymouna, Bhuiyan Enamul Haque, Ayari Mohamed Arselene, Tahir Anas, Qiblawey Yazan, Mahmud Sakib, Zughaier Susu M, Abbas Tariq, Al-Maadeed Somaya, Chowdhury Muhammad E H
Electrical Engineering Department, College of Engineering, Qatar University, Doha 2713, Qatar.
College of Medicine, Qatar University, Doha 2713, Qatar.
Diagnostics (Basel). 2022 Apr 7;12(4):920. doi: 10.3390/diagnostics12040920.
Problem-Since the outbreak of the COVID-19 pandemic, mass testing has become essential to reduce the spread of the virus. Several recent studies suggest that a significant number of COVID-19 patients display no physical symptoms whatsoever. Therefore, it is unlikely that these patients will undergo COVID-19 testing, which increases their chances of unintentionally spreading the virus. Currently, the primary diagnostic tool to detect COVID-19 is a reverse-transcription polymerase chain reaction (RT-PCR) test from the respiratory specimens of the suspected patient, which is invasive and a resource-dependent technique. It is evident from recent researches that asymptomatic COVID-19 patients cough and breathe in a different way than healthy people. Aim-This paper aims to use a novel machine learning approach to detect COVID-19 (symptomatic and asymptomatic) patients from the convenience of their homes so that they do not overburden the healthcare system and also do not spread the virus unknowingly by continuously monitoring themselves. Method-A Cambridge University research group shared such a dataset of cough and breath sound samples from 582 healthy and 141 COVID-19 patients. Among the COVID-19 patients, 87 were asymptomatic while 54 were symptomatic (had a dry or wet cough). In addition to the available dataset, the proposed work deployed a real-time deep learning-based backend server with a web application to crowdsource cough and breath datasets and also screen for COVID-19 infection from the comfort of the user's home. The collected dataset includes data from 245 healthy individuals and 78 asymptomatic and 18 symptomatic COVID-19 patients. Users can simply use the application from any web browser without installation and enter their symptoms, record audio clips of their cough and breath sounds, and upload the data anonymously. Two different pipelines for screening were developed based on the symptoms reported by the users: asymptomatic and symptomatic. An innovative and novel stacking CNN model was developed using three base learners from of eight state-of-the-art deep learning CNN algorithms. The stacking CNN model is based on a logistic regression classifier meta-learner that uses the spectrograms generated from the breath and cough sounds of symptomatic and asymptomatic patients as input using the combined (Cambridge and collected) dataset. Results-The stacking model outperformed the other eight CNN networks with the best classification performance for binary classification using cough sound spectrogram images. The accuracy, sensitivity, and specificity for symptomatic and asymptomatic patients were 96.5%, 96.42%, and 95.47% and 98.85%, 97.01%, and 99.6%, respectively. For breath sound spectrogram images, the metrics for binary classification of symptomatic and asymptomatic patients were 91.03%, 88.9%, and 91.5% and 80.01%, 72.04%, and 82.67%, respectively. Conclusion-The web-application QUCoughScope records coughing and breathing sounds, converts them to a spectrogram, and applies the best-performing machine learning model to classify the COVID-19 patients and healthy subjects. The result is then reported back to the test user in the application interface. Therefore, this novel system can be used by patients in their premises as a pre-screening method to aid COVID-19 diagnosis by prioritizing the patients for RT-PCR testing and thereby reducing the risk of spreading of the disease.
问题——自新冠疫情爆发以来,大规模检测已成为减少病毒传播的关键。近期多项研究表明,相当数量的新冠患者没有任何身体症状。因此,这些患者不太可能接受新冠检测,这增加了他们无意间传播病毒的几率。目前,检测新冠的主要诊断工具是对疑似患者呼吸道样本进行逆转录聚合酶链反应(RT-PCR)检测,这是一种侵入性且依赖资源的技术。近期研究表明,无症状新冠患者咳嗽和呼吸的方式与健康人不同。
目的——本文旨在使用一种新颖的机器学习方法,让新冠(有症状和无症状)患者能在家中方便地进行检测,这样既不会给医疗系统造成过重负担,也不会因持续自我监测而在不知情的情况下传播病毒。
方法——剑桥大学的一个研究小组分享了这样一个数据集,其中包含582名健康人和141名新冠患者的咳嗽和呼吸声音样本。在这些新冠患者中,87人无症状,54人有症状(干咳或湿咳)。除了现有的数据集外,本研究还部署了一个基于深度学习的实时后端服务器以及一个网络应用程序,用于众包咳嗽和呼吸数据集,并能让用户在家中舒适地筛查新冠感染情况。收集到的数据集包括245名健康个体以及78名无症状和18名有症状新冠患者的数据。用户只需从任何网络浏览器使用该应用程序,无需安装,输入自身症状,录制咳嗽和呼吸声音的音频片段,并匿名上传数据。根据用户报告的症状开发了两种不同的筛查流程:无症状和有症状。使用八种最先进的深度学习卷积神经网络(CNN)算法中的三个基础学习器,开发了一种创新的新型堆叠CNN模型。该堆叠CNN模型基于逻辑回归分类器元学习器,它使用有症状和无症状患者的呼吸和咳嗽声音生成的频谱图作为输入,使用的是合并后的(剑桥大学和收集到的)数据集。
结果——在使用咳嗽声音频谱图图像进行二元分类时,堆叠模型的分类性能优于其他八个CNN网络。有症状和无症状患者的准确率、灵敏度和特异性分别为9�.5%、96.42%和95.47%以及98.85%、97.01%和99.6%。对于呼吸声音频谱图图像,有症状和无症状患者二元分类的指标分别为91.03%、88.9%和91.5%以及80.01%、72.04%和82.67%。
结论——网络应用程序QUCoughScope记录咳嗽和呼吸声音,将其转换为频谱图,并应用性能最佳的机器学习模型对新冠患者和健康受试者进行分类。然后结果会在应用程序界面报告给测试用户。因此,这种新颖的系统可被患者在其住所用作预筛查方法,通过对患者进行RT-PCR检测的优先级排序来辅助新冠诊断,从而降低疾病传播风险。