• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

与微阵列癌症分类相关且重要的有监督基因簇。

Relevant and significant supervised gene clusters for microarray cancer classification.

机构信息

Machine Intelligence Unit, Indian Statistical Institute, 203 B. T. Road, Kolkata, 700 108, India.

出版信息

IEEE Trans Nanobioscience. 2012 Jun;11(2):161-8. doi: 10.1109/TNB.2012.2193590. Epub 2012 Apr 27.

DOI:10.1109/TNB.2012.2193590
PMID:22552589
Abstract

An important application of microarray data in functional genomics is to classify samples according to their gene expression profiles such as to classify cancer versus normal samples or to classify different types or subtypes of cancer. One of the major tasks with gene expression data is to find co-regulated gene groups whose collective expression is strongly associated with sample categories. In this regard, a gene clustering algorithm is proposed to group genes from microarray data. It directly incorporates the information of sample categories in the grouping process for finding groups of co-regulated genes with strong association to the sample categories, yielding a supervised gene clustering algorithm. The average expression of the genes from each cluster acts as its representative. Some significant representatives are taken to form the reduced feature set to build the classifiers for cancer classification. The mutual information is used to compute both gene-gene redundancy and gene-class relevance. The performance of the proposed method, along with a comparison with existing methods, is studied on six cancer microarray data sets using the predictive accuracy of naive Bayes classifier, K-nearest neighbor rule, and support vector machine. An important finding is that the proposed algorithm is shown to be effective for identifying biologically significant gene clusters with excellent predictive capability.

摘要

微阵列数据在功能基因组学中的一个重要应用是根据其基因表达谱对样本进行分类,例如将癌症与正常样本分类,或将不同类型或亚型的癌症分类。基因表达数据的主要任务之一是找到共同调节的基因群,其集体表达与样本类别强烈相关。在这方面,提出了一种基因聚类算法来对微阵列数据中的基因进行分组。它在分组过程中直接纳入样本类别的信息,以找到与样本类别强烈相关的共同调节基因组,从而产生有监督的基因聚类算法。每个聚类的基因的平均表达作为其代表。选择一些重要的代表来形成减少的特征集,以构建用于癌症分类的分类器。互信息用于计算基因-基因冗余和基因-类别相关性。使用朴素贝叶斯分类器、K-最近邻规则和支持向量机在六个癌症微阵列数据集上研究了所提出方法的性能,并与现有方法进行了比较。一个重要的发现是,所提出的算法被证明能够有效地识别具有出色预测能力的生物学上有意义的基因簇。

相似文献

1
Relevant and significant supervised gene clusters for microarray cancer classification.与微阵列癌症分类相关且重要的有监督基因簇。
IEEE Trans Nanobioscience. 2012 Jun;11(2):161-8. doi: 10.1109/TNB.2012.2193590. Epub 2012 Apr 27.
2
Fuzzy-rough supervised attribute clustering algorithm and classification of microarray data.模糊粗糙监督属性聚类算法与微阵列数据分类
IEEE Trans Syst Man Cybern B Cybern. 2011 Feb;41(1):222-33. doi: 10.1109/TSMCB.2010.2050684. Epub 2010 Jun 10.
3
f-Information measures for efficient selection of discriminative genes from microarray data.用于从微阵列数据中有效选择鉴别基因的f-信息度量。
IEEE Trans Biomed Eng. 2009 Apr;56(4):1063-9. doi: 10.1109/TBME.2008.2004502. Epub 2008 Sep 16.
4
An efficient statistical feature selection approach for classification of gene expression data.一种用于基因表达数据分类的高效统计特征选择方法。
J Biomed Inform. 2011 Aug;44(4):529-35. doi: 10.1016/j.jbi.2011.01.001. Epub 2011 Jan 15.
5
Multiclass cancer classification by support vector machines with class-wise optimized genes and probability estimates.基于类别优化基因和概率估计的支持向量机进行多类别癌症分类
J Theor Biol. 2009 Aug 7;259(3):533-40. doi: 10.1016/j.jtbi.2009.04.013. Epub 2009 May 3.
6
Tumor classification ranking from microarray data.基于微阵列数据的肿瘤分类排名
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S21. doi: 10.1186/1471-2164-9-S2-S21.
7
Cancer DNA microarray analysis considering multi-subclass with graph-based clustering method.基于图形聚类方法的多亚类癌症DNA微阵列分析
J Biosci Bioeng. 2008 Nov;106(5):442-8. doi: 10.1263/jbb.106.442.
8
Many accurate small-discriminatory feature subsets exist in microarray transcript data: biomarker discovery.微阵列转录数据中存在许多准确的小判别特征子集:生物标志物发现。
BMC Bioinformatics. 2005 Apr 13;6:97. doi: 10.1186/1471-2105-6-97.
9
Gene selection for classification of cancers using probabilistic model building genetic algorithm.使用概率模型构建遗传算法进行癌症分类的基因选择
Biosystems. 2005 Dec;82(3):208-25. doi: 10.1016/j.biosystems.2005.07.003. Epub 2005 Aug 22.
10
A gene selection method for classifying cancer samples using 1D discrete wavelet transform.一种使用一维离散小波变换对癌症样本进行分类的基因选择方法。
Int J Comput Biol Drug Des. 2009;2(4):398-411. doi: 10.1504/IJCBDD.2009.030769. Epub 2009 Jan 4.

引用本文的文献

1
An ensemble machine learning model based on multiple filtering and supervised attribute clustering algorithm for classifying cancer samples.一种基于多重过滤和监督属性聚类算法的集成机器学习模型,用于对癌症样本进行分类。
PeerJ Comput Sci. 2021 Sep 16;7:e671. doi: 10.7717/peerj-cs.671. eCollection 2021.
2
Sulfatase 2 Is Associated with Steroid Resistance in Childhood Nephrotic Syndrome.硫酸酯酶2与儿童肾病综合征的类固醇抵抗有关。
J Clin Med. 2021 Feb 2;10(3):523. doi: 10.3390/jcm10030523.
3
Rough sets for in silico identification of differentially expressed miRNAs.
基于粗糙集理论的差异表达 miRNA 的计算机识别。
Int J Nanomedicine. 2013;8 Suppl 1(Suppl 1):63-74. doi: 10.2147/IJN.S40739. Epub 2013 Sep 16.