• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于代价敏感朴素贝叶斯堆叠集成的癌症分类。

Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble.

机构信息

School of Medical Information, Wannan Medical College, Wuhu 241002, China.

School of Computer and Information, Anhui Normal University, Wuhu 241002, China.

出版信息

Comput Math Methods Med. 2021 Apr 24;2021:5556992. doi: 10.1155/2021/5556992. eCollection 2021.

DOI:10.1155/2021/5556992
PMID:33986823
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8093037/
Abstract

Ensemble learning combines multiple learners to perform combinatorial learning, which has advantages of good flexibility and higher generalization performance. To achieve higher quality cancer classification, in this study, the fast correlation-based feature selection (FCBF) method was used to preprocess the data to eliminate irrelevant and redundant features. Then, the classification was carried out in the stacking ensemble learner. A library for support vector machine (LIBSVM), -nearest neighbor (KNN), decision tree C4.5 (C4.5), and random forest (RF) were used as the primary learners of the stacking ensemble. Given the imbalanced characteristics of cancer gene expression data, the embedding cost-sensitive naive Bayes was used as the metalearner of the stacking ensemble, which was represented as CSNB stacking. The proposed CSNB stacking method was applied to nine cancer datasets to further verify the classification performance of the model. Compared with other classification methods, such as single classifier algorithms and ensemble algorithms, the experimental results showed the effectiveness and robustness of the proposed method in processing different types of cancer data. This method may therefore help guide cancer diagnosis and research.

摘要

集成学习将多个学习者结合起来进行组合学习,具有良好的灵活性和更高的泛化性能的优点。为了实现更高质量的癌症分类,在本研究中,使用快速基于相关的特征选择(FCBF)方法对数据进行预处理,以消除不相关和冗余的特征。然后,在堆叠集成学习者中进行分类。支持向量机(LIBSVM)、-最近邻(KNN)、决策树 C4.5(C4.5)和随机森林(RF)等库被用作堆叠集成的基本学习者。考虑到癌症基因表达数据的不平衡特点,使用嵌入式成本敏感朴素贝叶斯作为堆叠集成的元学习者,称为 CSNB 堆叠。将所提出的 CSNB 堆叠方法应用于九个癌症数据集,以进一步验证模型的分类性能。与其他分类方法,如单分类器算法和集成算法相比,实验结果表明,该方法在处理不同类型的癌症数据方面具有有效性和鲁棒性。因此,该方法可能有助于指导癌症诊断和研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ed4/8093037/b4376f267f76/CMMM2021-5556992.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ed4/8093037/14744a2ca6bd/CMMM2021-5556992.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ed4/8093037/1570f1c429fa/CMMM2021-5556992.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ed4/8093037/ea94e3724f55/CMMM2021-5556992.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ed4/8093037/b4376f267f76/CMMM2021-5556992.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ed4/8093037/14744a2ca6bd/CMMM2021-5556992.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ed4/8093037/1570f1c429fa/CMMM2021-5556992.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ed4/8093037/ea94e3724f55/CMMM2021-5556992.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ed4/8093037/b4376f267f76/CMMM2021-5556992.007.jpg

相似文献

1
Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble.基于代价敏感朴素贝叶斯堆叠集成的癌症分类。
Comput Math Methods Med. 2021 Apr 24;2021:5556992. doi: 10.1155/2021/5556992. eCollection 2021.
2
A Hybrid Ensemble Algorithm Combining AdaBoost and Genetic Algorithm for Cancer Classification with Gene Expression Data.一种结合AdaBoost和遗传算法的混合集成算法用于基于基因表达数据的癌症分类
IEEE/ACM Trans Comput Biol Bioinform. 2021 May-Jun;18(3):863-870. doi: 10.1109/TCBB.2019.2952102. Epub 2021 Jun 3.
3
A discrete wavelet based feature extraction and hybrid classification technique for microarray data analysis.一种基于离散小波的微阵列数据分析特征提取与混合分类技术。
ScientificWorldJournal. 2014;2014:195470. doi: 10.1155/2014/195470. Epub 2014 Aug 6.
4
Classification of imbalanced data using machine learning algorithms to predict the risk of renal graft failures in Ethiopia.使用机器学习算法对不平衡数据进行分类,以预测埃塞俄比亚肾移植失败的风险。
BMC Med Inform Decis Mak. 2023 May 22;23(1):98. doi: 10.1186/s12911-023-02185-5.
5
A stacking ensemble deep learning approach to cancer type classification based on TCGA data.基于 TCGA 数据的癌症类型分类的堆叠集成深度学习方法。
Sci Rep. 2021 Aug 2;11(1):15626. doi: 10.1038/s41598-021-95128-x.
6
A new hybrid ensemble machine-learning model for severity risk assessment and post-COVID prediction system.一种新的混合集成机器学习模型,用于严重程度风险评估和 COVID 后预测系统。
Math Biosci Eng. 2022 Apr 13;19(6):6102-6123. doi: 10.3934/mbe.2022285.
7
Can-Evo-Ens: Classifier stacking based evolutionary ensemble system for prediction of human breast cancer using amino acid sequences.Can-Evo-Ens:基于分类器堆叠的进化集成系统,用于利用氨基酸序列预测人类乳腺癌。
J Biomed Inform. 2015 Apr;54:256-69. doi: 10.1016/j.jbi.2015.01.004. Epub 2015 Jan 21.
8
A GA-stacking ensemble approach for forecasting energy consumption in a smart household: A comparative study of ensemble methods.基于 GA 堆叠的智能家居能耗预测集成方法研究:集成方法比较
J Environ Manage. 2024 Jul;364:121264. doi: 10.1016/j.jenvman.2024.121264. Epub 2024 Jun 12.
9
R-HEFS: Rough set based heterogeneous ensemble feature selection method for medical data classification.基于粗糙集的异质集成特征选择方法在医学数据分类中的应用。
Artif Intell Med. 2021 Apr;114:102049. doi: 10.1016/j.artmed.2021.102049. Epub 2021 Mar 6.
10
Feature weight estimation for gene selection: a local hyperlinear learning approach.特征权重估计在基因选择中的应用:一种局部超线性学习方法。
BMC Bioinformatics. 2014 Mar 14;15:70. doi: 10.1186/1471-2105-15-70.

引用本文的文献

1
Unlocking artificial intelligence, machine learning and deep learning to combat therapeutic resistance in metastatic castration-resistant prostate cancer: a comprehensive review.解锁人工智能、机器学习和深度学习以对抗转移性去势抵抗性前列腺癌中的治疗抵抗:一项综述
Ecancermedicalscience. 2025 Jul 29;19:1953. doi: 10.3332/ecancer.2025.1953. eCollection 2025.
2
Development and validation of an explainable machine learning model for predicting postoperative pulmonary complications after lung cancer surgery: a machine learning study.用于预测肺癌手术后肺部并发症的可解释机器学习模型的开发与验证:一项机器学习研究
EClinicalMedicine. 2025 Aug 1;86:103386. doi: 10.1016/j.eclinm.2025.103386. eCollection 2025 Aug.
3

本文引用的文献

1
Performance-weighted-voting model: An ensemble machine learning method for cancer type classification using whole-exome sequencing mutation.性能加权投票模型:一种使用全外显子组测序突变进行癌症类型分类的集成机器学习方法。
Quant Biol. 2020 Dec 24;8(4):347-358. doi: 10.1007/s40484-020-0226-1. Epub 2020 Dec 7.
2
Application of decision tree-based ensemble learning in the classification of breast cancer.基于决策树的集成学习在乳腺癌分类中的应用。
Comput Biol Med. 2021 Jan;128:104089. doi: 10.1016/j.compbiomed.2020.104089. Epub 2020 Oct 31.
3
Correntropy induced loss based sparse robust graph regularized extreme learning machine for cancer classification.
Enhanced prediction of spine surgery outcomes using advanced machine learning techniques and oversampling methods.
使用先进的机器学习技术和过采样方法加强脊柱手术结果的预测。
Health Inf Sci Syst. 2025 Mar 4;13(1):24. doi: 10.1007/s13755-025-00343-9. eCollection 2025 Dec.
4
Ensemble machine learning framework for predicting maternal health risk during pregnancy.用于预测孕期产妇健康风险的集成机器学习框架。
Sci Rep. 2024 Sep 14;14(1):21483. doi: 10.1038/s41598-024-71934-x.
5
The Application of Artificial Intelligence to Cancer Research: A Comprehensive Guide.人工智能在癌症研究中的应用:全面指南。
Technol Cancer Res Treat. 2024 Jan-Dec;23:15330338241250324. doi: 10.1177/15330338241250324.
6
SVM-DO: identification of tumor-discriminating mRNA signatures via support vector machines supported by Disease Ontology.SVM-DO:通过由疾病本体论支持的支持向量机识别肿瘤鉴别mRNA特征
Turk J Biol. 2023 Dec 14;47(6):349-365. doi: 10.55730/1300-0152.2670. eCollection 2023.
7
Machine learning (ML) techniques to predict breast cancer in imbalanced datasets: a systematic review.用于预测不平衡数据集中乳腺癌的机器学习(ML)技术:一项系统综述。
J Cancer Surviv. 2025 Feb;19(1):270-294. doi: 10.1007/s11764-023-01465-3. Epub 2023 Sep 26.
8
A Bayesian optimization tunning integrated multi-stacking classifier framework for the prediction of radiodermatitis from 4D-CT of patients underwent breast cancer radiotherapy.一种用于从接受乳腺癌放疗患者的4D-CT预测放射性皮炎的贝叶斯优化调谐集成多堆叠分类器框架。
Front Oncol. 2023 Jun 13;13:1152020. doi: 10.3389/fonc.2023.1152020. eCollection 2023.
9
Brain Tumor Classification based on Improved Stacked Ensemble Deep Learning Methods.基于改进堆叠集成深度学习方法的脑肿瘤分类。
Asian Pac J Cancer Prev. 2023 Jun 1;24(6):2141-2148. doi: 10.31557/APJCP.2023.24.6.2141.
10
A Risk Prediction Model for Physical Restraints Among Older Chinese Adults in Long-term Care Facilities: Machine Learning Study.长期护理机构中老年人身体约束的风险预测模型:机器学习研究。
J Med Internet Res. 2023 Apr 6;25:e43815. doi: 10.2196/43815.
基于相关熵损失的稀疏鲁棒图正则化极限学习机用于癌症分类。
BMC Bioinformatics. 2020 Oct 7;21(1):445. doi: 10.1186/s12859-020-03790-1.
4
Automated classification of protein subcellular localization in immunohistochemistry images to reveal biomarkers in colon cancer.利用免疫组织化学图像中的蛋白质亚细胞定位的自动分类来揭示结肠癌中的生物标志物。
BMC Bioinformatics. 2020 Sep 9;21(1):398. doi: 10.1186/s12859-020-03731-y.
5
Bayesian Hyper-LASSO Classification for Feature Selection with Application to Endometrial Cancer RNA-seq Data.贝叶斯超 LASSO 分类用于特征选择及其在子宫内膜癌 RNA-seq 数据中的应用。
Sci Rep. 2020 Jun 16;10(1):9747. doi: 10.1038/s41598-020-66466-z.
6
Stacking Ensemble Technique for Classifying Breast Cancer.用于乳腺癌分类的堆叠集成技术
Healthc Inform Res. 2019 Oct;25(4):283-288. doi: 10.4258/hir.2019.25.4.283. Epub 2019 Oct 31.
7
Unsupervised feature selection algorithm for multiclass cancer classification of gene expression RNA-Seq data.无监督特征选择算法在基因表达 RNA-Seq 数据的多类癌症分类中的应用。
Genomics. 2020 Mar;112(2):1916-1925. doi: 10.1016/j.ygeno.2019.11.004. Epub 2019 Nov 20.
8
CPEM: Accurate cancer type classification based on somatic alterations using an ensemble of a random forest and a deep neural network.CPEM:基于随机森林和深度神经网络集成的体细胞改变的准确癌症类型分类。
Sci Rep. 2019 Nov 15;9(1):16927. doi: 10.1038/s41598-019-53034-3.
9
Cancer classification of single-cell gene expression data by neural network.基于神经网络的单细胞基因表达数据的癌症分类。
Bioinformatics. 2020 Mar 1;36(5):1360-1366. doi: 10.1093/bioinformatics/btz772.
10
Clustering-based undersampling with random over sampling examples and support vector machine for imbalanced classification of breast cancer diagnosis.基于聚类的欠采样与随机过采样示例和支持向量机在乳腺癌诊断中的不平衡分类。
Comput Assist Surg (Abingdon). 2019 Oct;24(sup2):62-72. doi: 10.1080/24699322.2019.1649074. Epub 2019 Aug 12.