• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Can-Evo-Ens:基于分类器堆叠的进化集成系统,用于利用氨基酸序列预测人类乳腺癌。

Can-Evo-Ens: Classifier stacking based evolutionary ensemble system for prediction of human breast cancer using amino acid sequences.

作者信息

Ali Safdar, Majid Abdul

机构信息

Department of Computer & Information Sciences, Pakistan Institute of Engineering & Applied Sciences, Nilore, 45650 Islamabad, Pakistan.

出版信息

J Biomed Inform. 2015 Apr;54:256-69. doi: 10.1016/j.jbi.2015.01.004. Epub 2015 Jan 21.

DOI:10.1016/j.jbi.2015.01.004
PMID:25617669
Abstract

The diagnostic of human breast cancer is an intricate process and specific indicators may produce negative results. In order to avoid misleading results, accurate and reliable diagnostic system for breast cancer is indispensable. Recently, several interesting machine-learning (ML) approaches are proposed for prediction of breast cancer. To this end, we developed a novel classifier stacking based evolutionary ensemble system "Can-Evo-Ens" for predicting amino acid sequences associated with breast cancer. In this paper, first, we selected four diverse-type of ML algorithms of Naïve Bayes, K-Nearest Neighbor, Support Vector Machines, and Random Forest as base-level classifiers. These classifiers are trained individually in different feature spaces using physicochemical properties of amino acids. In order to exploit the decision spaces, the preliminary predictions of base-level classifiers are stacked. Genetic programming (GP) is then employed to develop a meta-classifier that optimal combine the predictions of the base classifiers. The most suitable threshold value of the best-evolved predictor is computed using Particle Swarm Optimization technique. Our experiments have demonstrated the robustness of Can-Evo-Ens system for independent validation dataset. The proposed system has achieved the highest value of Area Under Curve (AUC) of ROC Curve of 99.95% for cancer prediction. The comparative results revealed that proposed approach is better than individual ML approaches and conventional ensemble approaches of AdaBoostM1, Bagging, GentleBoost, and Random Subspace. It is expected that the proposed novel system would have a major impact on the fields of Biomedical, Genomics, Proteomics, Bioinformatics, and Drug Development.

摘要

人类乳腺癌的诊断是一个复杂的过程,特定指标可能会产生阴性结果。为避免产生误导性结果,乳腺癌的准确可靠诊断系统必不可少。最近,人们提出了几种有趣的机器学习(ML)方法用于乳腺癌预测。为此,我们开发了一种基于分类器堆叠的新型进化集成系统“Can-Evo-Ens”,用于预测与乳腺癌相关的氨基酸序列。在本文中,首先,我们选择了朴素贝叶斯、K近邻、支持向量机和随机森林这四种不同类型的ML算法作为基础分类器。这些分类器利用氨基酸的物理化学性质在不同特征空间中分别进行训练。为了利用决策空间,对基础分类器的初步预测结果进行堆叠。然后采用遗传编程(GP)来开发一个元分类器,以优化组合基础分类器的预测结果。使用粒子群优化技术计算最佳进化预测器的最合适阈值。我们的实验证明了Can-Evo-Ens系统对独立验证数据集的稳健性。所提出的系统在癌症预测的ROC曲线下面积(AUC)方面达到了99.95%的最高值。比较结果表明,所提出的方法优于个体ML方法以及AdaBoostM1、Bagging、GentleBoost和随机子空间等传统集成方法。预计所提出的新系统将对生物医学、基因组学、蛋白质组学、生物信息学和药物开发等领域产生重大影响。

相似文献

1
Can-Evo-Ens: Classifier stacking based evolutionary ensemble system for prediction of human breast cancer using amino acid sequences.Can-Evo-Ens:基于分类器堆叠的进化集成系统,用于利用氨基酸序列预测人类乳腺癌。
J Biomed Inform. 2015 Apr;54:256-69. doi: 10.1016/j.jbi.2015.01.004. Epub 2015 Jan 21.
2
IDM-PhyChm-Ens: intelligent decision-making ensemble methodology for classification of human breast cancer using physicochemical properties of amino acids.IDM-PhyChm-Ens:基于氨基酸物理化学性质的人类乳腺癌分类智能决策集成方法
Amino Acids. 2014 Apr;46(4):977-93. doi: 10.1007/s00726-013-1659-x. Epub 2014 Jan 4.
3
HBC-Evo: predicting human breast cancer by exploiting amino acid sequence-based feature spaces and evolutionary ensemble system.
Amino Acids. 2015 Jan;47(1):217-21. doi: 10.1007/s00726-014-1871-3. Epub 2014 Dec 10.
4
Mito-GSAAC: mitochondria prediction using genetic ensemble classifier and split amino acid composition.Mito-GSAAC:基于遗传集成分类器和分裂氨基酸组成的线粒体预测。
Amino Acids. 2012 Apr;42(4):1443-54. doi: 10.1007/s00726-011-0888-0. Epub 2011 Mar 29.
5
Can-CSC-GBE: Developing Cost-sensitive Classifier with Gentleboost Ensemble for breast cancer classification using protein amino acids and imbalanced data.Can-CSC-GBE:使用蛋白质氨基酸和不均衡数据,通过Gentleboost集成开发用于乳腺癌分类的成本敏感分类器。
Comput Biol Med. 2016 Jun 1;73:38-46. doi: 10.1016/j.compbiomed.2016.04.002. Epub 2016 Apr 5.
6
Prediction of human breast and colon cancers from imbalanced data using nearest neighbor and support vector machines.基于最近邻算法和支持向量机的不平衡数据在人类乳腺癌和结肠癌预测中的应用。
Comput Methods Programs Biomed. 2014 Mar;113(3):792-808. doi: 10.1016/j.cmpb.2014.01.001. Epub 2014 Jan 10.
7
Reviewing ensemble classification methods in breast cancer.综述乳腺癌中的集成分类方法。
Comput Methods Programs Biomed. 2019 Aug;177:89-112. doi: 10.1016/j.cmpb.2019.05.019. Epub 2019 May 20.
8
GPCR-MPredictor: multi-level prediction of G protein-coupled receptors using genetic ensemble.GPCR-MPredictor:基于遗传集成的 G 蛋白偶联受体多层次预测
Amino Acids. 2012 May;42(5):1809-23. doi: 10.1007/s00726-011-0902-6. Epub 2011 Apr 20.
9
Computational methods for ubiquitination site prediction using physicochemical properties of protein sequences.利用蛋白质序列的物理化学性质进行泛素化位点预测的计算方法。
BMC Bioinformatics. 2016 Mar 3;17:116. doi: 10.1186/s12859-016-0959-z.
10
Mixture classification model based on clinical markers for breast cancer prognosis.基于临床标志物的乳腺癌预后混合分类模型。
Artif Intell Med. 2010 Feb-Mar;48(2-3):129-37. doi: 10.1016/j.artmed.2009.07.008. Epub 2009 Dec 14.

引用本文的文献

1
Differentiation of supratentorial single brain metastasis and glioblastoma by using peri-enhancing oedema region-derived radiomic features and multiple classifiers.利用瘤周水肿区衍生的放射组学特征和多分类器对幕上单发脑转移瘤和胶质母细胞瘤进行鉴别。
Eur Radiol. 2020 May;30(5):3015-3022. doi: 10.1007/s00330-019-06460-w. Epub 2020 Jan 31.