Suppr超能文献

一种新颖的联合动态集成选择模型,用于从全血细胞计数中检测 COVID-19 ,以解决数据不平衡问题。

A novel combined dynamic ensemble selection model for imbalanced data to detect COVID-19 from complete blood count.

机构信息

College of Management and Economics, Tianjin University, Tianjin, 300072, China.

Business School, Nankai University, Tianjin, 300071, China.

出版信息

Comput Methods Programs Biomed. 2021 Nov;211:106444. doi: 10.1016/j.cmpb.2021.106444. Epub 2021 Sep 29.

Abstract

BACKGROUND

As blood testing is radiation-free, low-cost and simple to operate, some researchers use machine learning to detect COVID-19 from blood test data. However, few studies take into consideration the imbalanced data distribution, which can impair the performance of a classifier.

METHOD

A novel combined dynamic ensemble selection (DES) method is proposed for imbalanced data to detect COVID-19 from complete blood count. This method combines data preprocessing and improved DES. Firstly, we use the hybrid synthetic minority over-sampling technique and edited nearest neighbor (SMOTE-ENN) to balance data and remove noise. Secondly, in order to improve the performance of DES, a novel hybrid multiple clustering and bagging classifier generation (HMCBCG) method is proposed to reinforce the diversity and local regional competence of candidate classifiers.

RESULTS

The experimental results based on three popular DES methods show that the performance of HMCBCG is better than only use bagging. HMCBCG+KNE obtains the best performance for COVID-19 screening with 99.81% accuracy, 99.86% F1, 99.78% G-mean and 99.81% AUC.

CONCLUSION

Compared to other advanced methods, our combined DES model can improve accuracy, G-mean, F1 and AUC of COVID-19 screening.

摘要

背景

由于血液检测无辐射、成本低且操作简单,一些研究人员利用机器学习从血液检测数据中检测 COVID-19。然而,很少有研究考虑到数据分布不平衡的问题,这可能会影响分类器的性能。

方法

针对不平衡数据,我们提出了一种新的组合动态集成选择 (DES) 方法,用于从全血细胞计数中检测 COVID-19。该方法结合了数据预处理和改进的 DES。首先,我们使用混合合成少数过采样技术和编辑最近邻 (SMOTE-ENN) 来平衡数据并消除噪声。其次,为了提高 DES 的性能,我们提出了一种新的混合多聚类和袋装分类器生成 (HMCBCG) 方法,以增强候选分类器的多样性和局部区域竞争力。

结果

基于三种流行的 DES 方法的实验结果表明,HMCBCG 的性能优于仅使用袋装。HMCBCG+KNE 对 COVID-19 筛查的性能最佳,准确率为 99.81%,F1 值为 99.86%,G-mean 为 99.78%,AUC 为 99.81%。

结论

与其他先进方法相比,我们的组合 DES 模型可以提高 COVID-19 筛查的准确率、G-mean、F1 和 AUC。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/060c/8479386/fa05ec3e8ecc/gr1_lrg.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验