Suppr超能文献

利用机器学习和全球智能手机记录对 COVID-19 咳嗽进行分类。

COVID-19 cough classification using machine learning and global smartphone recordings.

机构信息

Department of Electrical and Electronic Engineering, Stellenbosch University, South Africa.

SAMRC Centre for Tuberculosis Research, DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, South Africa.

出版信息

Comput Biol Med. 2021 Aug;135:104572. doi: 10.1016/j.compbiomed.2021.104572. Epub 2021 Jun 17.

Abstract

We present a machine learning based COVID-19 cough classifier which can discriminate COVID-19 positive coughs from both COVID-19 negative and healthy coughs recorded on a smartphone. This type of screening is non-contact, easy to apply, and can reduce the workload in testing centres as well as limit transmission by recommending early self-isolation to those who have a cough suggestive of COVID-19. The datasets used in this study include subjects from all six continents and contain both forced and natural coughs, indicating that the approach is widely applicable. The publicly available Coswara dataset contains 92 COVID-19 positive and 1079 healthy subjects, while the second smaller dataset was collected mostly in South Africa and contains 18 COVID-19 positive and 26 COVID-19 negative subjects who have undergone a SARS-CoV laboratory test. Both datasets indicate that COVID-19 positive coughs are 15%-20% shorter than non-COVID coughs. Dataset skew was addressed by applying the synthetic minority oversampling technique (SMOTE). A leave-p-out cross-validation scheme was used to train and evaluate seven machine learning classifiers: logistic regression (LR), k-nearest neighbour (KNN), support vector machine (SVM), multilayer perceptron (MLP), convolutional neural network (CNN), long short-term memory (LSTM) and a residual-based neural network architecture (Resnet50). Our results show that although all classifiers were able to identify COVID-19 coughs, the best performance was exhibited by the Resnet50 classifier, which was best able to discriminate between the COVID-19 positive and the healthy coughs with an area under the ROC curve (AUC) of 0.98. An LSTM classifier was best able to discriminate between the COVID-19 positive and COVID-19 negative coughs, with an AUC of 0.94 after selecting the best 13 features from a sequential forward selection (SFS). Since this type of cough audio classification is cost-effective and easy to deploy, it is potentially a useful and viable means of non-contact COVID-19 screening.

摘要

我们提出了一种基于机器学习的 COVID-19 咳嗽分类器,它可以区分智能手机上记录的 COVID-19 阳性咳嗽与 COVID-19 阴性和健康咳嗽。这种筛查是非接触式的,易于应用,可以减少测试中心的工作量,并通过建议那些有 COVID-19 咳嗽症状的人早期自我隔离来限制传播。本研究中使用的数据集包括来自六大洲的受试者,包含强制和自然咳嗽,表明该方法具有广泛的适用性。公开的 Coswara 数据集包含 92 例 COVID-19 阳性和 1079 例健康受试者,而第二个较小的数据集主要在南非收集,包含 18 例 COVID-19 阳性和 26 例 COVID-19 阴性经 SARS-CoV 实验室检测的受试者。两个数据集都表明,COVID-19 阳性咳嗽比非 COVID 咳嗽短 15%-20%。通过应用合成少数过采样技术 (SMOTE) 解决了数据集偏斜问题。采用留一法交叉验证方案对 7 种机器学习分类器进行训练和评估:逻辑回归 (LR)、k-近邻 (KNN)、支持向量机 (SVM)、多层感知器 (MLP)、卷积神经网络 (CNN)、长短期记忆 (LSTM) 和基于残差的神经网络架构 (Resnet50)。我们的结果表明,尽管所有分类器都能够识别 COVID-19 咳嗽,但 Resnet50 分类器的性能最佳,能够最好地区分 COVID-19 阳性和健康咳嗽,ROC 曲线下面积 (AUC) 为 0.98。LSTM 分类器能够最好地区分 COVID-19 阳性和 COVID-19 阴性咳嗽,在从顺序前向选择 (SFS) 中选择最佳的 13 个特征后,AUC 为 0.94。由于这种咳嗽音频分类具有成本效益且易于部署,因此它是一种潜在的有用且可行的非接触式 COVID-19 筛查方法。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验