• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 NPPC 集成的基因表达数据的癌症分类。

Cancer classification from gene expression data by NPPC ensemble.

机构信息

Department of Electronics and Communication Engineering, MCKV Institute of Engineering, 243, G.T. Road (N), Liluah, Howrah.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2011 May-Jun;8(3):659-71. doi: 10.1109/TCBB.2010.36.

DOI:10.1109/TCBB.2010.36
PMID:20479504
Abstract

The most important application of microarray in gene expression analysis is to classify the unknown tissue samples according to their gene expression levels with the help of known sample expression levels. In this paper, we present a nonparallel plane proximal classifier (NPPC) ensemble that ensures high classification accuracy of test samples in a computer-aided diagnosis (CAD) framework than that of a single NPPC model. For each data set only, a few genes are selected by using a mutual information criterion. Then a genetic algorithm-based simultaneous feature and model selection scheme is used to train a number of NPPC expert models in multiple subspaces by maximizing cross-validation accuracy. The members of the ensemble are selected by the performance of the trained models on a validation set. Besides the usual majority voting method, we have introduced minimum average proximity-based decision combiner for NPPC ensemble. The effectiveness of the NPPC ensemble and the proposed new approach of combining decisions for cancer diagnosis are studied and compared with support vector machine (SVM) classifier in a similar framework. Experimental results on cancer data sets show that the NPPC ensemble offers comparable testing accuracy to that of SVM ensemble with reduced training time on average.

摘要

微阵列在基因表达分析中的最重要应用是根据已知样本的表达水平,通过帮助未知组织样本的基因表达水平进行分类。在本文中,我们提出了一种非平行平面近端分类器(NPPC)集成,该集成在计算机辅助诊断(CAD)框架中比单个 NPPC 模型能够确保对测试样本的高分类准确性。对于每个数据集,仅使用互信息标准选择少数几个基因。然后,使用基于遗传算法的同时特征和模型选择方案,通过最大化交叉验证准确性,在多个子空间中训练多个 NPPC 专家模型。通过在验证集上对训练模型的性能进行选择,来选择集成成员。除了常用的多数投票方法外,我们还为 NPPC 集成引入了基于最小平均接近度的决策组合器。在类似的框架中,研究并比较了 NPPC 集成和提出的用于癌症诊断的新决策组合方法的有效性,与支持向量机(SVM)分类器进行比较。在癌症数据集上的实验结果表明,NPPC 集成在平均训练时间减少的情况下,可提供与 SVM 集成相当的测试准确性。

相似文献

1
Cancer classification from gene expression data by NPPC ensemble.基于 NPPC 集成的基因表达数据的癌症分类。
IEEE/ACM Trans Comput Biol Bioinform. 2011 May-Jun;8(3):659-71. doi: 10.1109/TCBB.2010.36.
2
Tumor classification based on non-negative matrix factorization using gene expression data.基于基因表达数据的非负矩阵分解的肿瘤分类。
IEEE Trans Nanobioscience. 2011 Jun;10(2):86-93. doi: 10.1109/TNB.2011.2144998. Epub 2011 Jul 7.
3
Mixture classification model based on clinical markers for breast cancer prognosis.基于临床标志物的乳腺癌预后混合分类模型。
Artif Intell Med. 2010 Feb-Mar;48(2-3):129-37. doi: 10.1016/j.artmed.2009.07.008. Epub 2009 Dec 14.
4
Ensemble gene selection by grouping for microarray data classification.基于分组的微阵列数据分类的集成基因选择。
J Biomed Inform. 2010 Feb;43(1):81-7. doi: 10.1016/j.jbi.2009.08.010. Epub 2009 Aug 20.
5
Stable gene selection from microarray data via sample weighting.基于样本加权的基因芯片数据中稳定基因的选择。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jan-Feb;9(1):262-72. doi: 10.1109/TCBB.2011.47. Epub 2011 Mar 3.
6
SVM-RFE with MRMR filter for gene selection.基于 MRMR 滤波器的 SVM-RFE 基因选择方法。
IEEE Trans Nanobioscience. 2010 Mar;9(1):31-7. doi: 10.1109/TNB.2009.2035284. Epub 2009 Oct 30.
7
A granular computing approach to gene selection.一种用于基因选择的粒度计算方法。
Biomed Mater Eng. 2014;24(1):1307-14. doi: 10.3233/BME-130933.
8
A multiple kernel support vector machine scheme for feature selection and rule extraction from gene expression data of cancer tissue.一种用于从癌组织基因表达数据中进行特征选择和规则提取的多核支持向量机方案。
Artif Intell Med. 2007 Oct;41(2):161-75. doi: 10.1016/j.artmed.2007.07.008. Epub 2007 Sep 11.
9
Genetic programming based ensemble system for microarray data classification.基于遗传编程的微阵列数据分类集成系统。
Comput Math Methods Med. 2015;2015:193406. doi: 10.1155/2015/193406. Epub 2015 Feb 25.
10
Decision forest for classification of gene expression data.决策森林用于基因表达数据分类。
Comput Biol Med. 2010 Aug;40(8):698-704. doi: 10.1016/j.compbiomed.2010.06.004. Epub 2010 Jun 29.

引用本文的文献

1
An ensemble machine learning model based on multiple filtering and supervised attribute clustering algorithm for classifying cancer samples.一种基于多重过滤和监督属性聚类算法的集成机器学习模型,用于对癌症样本进行分类。
PeerJ Comput Sci. 2021 Sep 16;7:e671. doi: 10.7717/peerj-cs.671. eCollection 2021.
2
Random Subspace Aggregation for Cancer Prediction with Gene Expression Profiles.基于基因表达谱的癌症预测随机子空间聚合
Biomed Res Int. 2016;2016:4596326. doi: 10.1155/2016/4596326. Epub 2016 Nov 24.
3
Recognition of multiple imbalanced cancer types based on DNA microarray data using ensemble classifiers.
基于 DNA 微阵列数据的多种不平衡癌症类型的识别,使用集成分类器。
Biomed Res Int. 2013;2013:239628. doi: 10.1155/2013/239628. Epub 2013 Aug 26.