• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用非重复变量的多个分类器的一致性分析:在微阵列基因表达数据中的诊断应用

Consensus analysis of multiple classifiers using non-repetitive variables: diagnostic application to microarray gene expression data.

作者信息

Su Zhenqiang, Hong Huixiao, Perkins Roger, Shao Xueguang, Cai Wensheng, Tong Weida

机构信息

Department of Chemistry, University of Science and Technology of China, Hefei, Anhui 230026, China.

出版信息

Comput Biol Chem. 2007 Feb;31(1):48-56. doi: 10.1016/j.compbiolchem.2007.01.001. Epub 2007 Jan 4.

DOI:10.1016/j.compbiolchem.2007.01.001
PMID:17303535
Abstract

Class prediction based on DNA microarray data has been emerged as one of the most important application of bioinformatics for diagnostics/prognostics. Robust classifiers are needed that use most biologically relevant genes embedded in the data. A consensus approach that combines multiple classifiers has attributes that mitigate this difficulty compared to a single classifier. A new classification method named as consensus analysis of multiple classifiers using non-repetitive variables (CAMCUN) was proposed for the analysis of hyper-dimensional gene expression data. The CAMCUN method combined multiple classifiers, each of which was built from distinct, non-repeated genes that were selected for effectiveness in class differentiation. Thus, the CAMCUN utilized most biologically relevant genes in the final classifier. The CAMCUN algorithm was demonstrated to give consistently more accurate predictions for two well-known datasets for prostate cancer and leukemia. Importantly, the CAMCUN algorithm employed an integrated 10-fold cross-validation and randomization test to assess the degree of confidence of the predictions for unknown samples.

摘要

基于DNA微阵列数据的类别预测已成为生物信息学在诊断/预后方面最重要的应用之一。需要强大的分类器来使用数据中嵌入的最具生物学相关性的基因。与单一分类器相比,结合多个分类器的共识方法具有缓解这一困难的属性。提出了一种名为使用非重复变量的多个分类器的共识分析(CAMCUN)的新分类方法,用于分析高维基因表达数据。CAMCUN方法结合了多个分类器,每个分类器都由为类别区分有效性而选择的不同、不重复的基因构建而成。因此,CAMCUN在最终分类器中利用了最具生物学相关性的基因。对于前列腺癌和白血病的两个著名数据集,CAMCUN算法被证明能持续给出更准确的预测。重要的是,CAMCUN算法采用了集成的10倍交叉验证和随机化测试来评估对未知样本预测的置信度。

相似文献

1
Consensus analysis of multiple classifiers using non-repetitive variables: diagnostic application to microarray gene expression data.使用非重复变量的多个分类器的一致性分析:在微阵列基因表达数据中的诊断应用
Comput Biol Chem. 2007 Feb;31(1):48-56. doi: 10.1016/j.compbiolchem.2007.01.001. Epub 2007 Jan 4.
2
Multiclass cancer classification by support vector machines with class-wise optimized genes and probability estimates.基于类别优化基因和概率估计的支持向量机进行多类别癌症分类
J Theor Biol. 2009 Aug 7;259(3):533-40. doi: 10.1016/j.jtbi.2009.04.013. Epub 2009 May 3.
3
Tumor classification ranking from microarray data.基于微阵列数据的肿瘤分类排名
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S21. doi: 10.1186/1471-2164-9-S2-S21.
4
Selecting a minimal number of relevant genes from microarray data to design accurate tissue classifiers.从微阵列数据中选择最少数量的相关基因以设计精确的组织分类器。
Biosystems. 2007 Jul-Aug;90(1):78-86. doi: 10.1016/j.biosystems.2006.07.002. Epub 2006 Jul 10.
5
A new classification model with simple decision rule for discovering optimal feature gene pairs.一种具有简单决策规则的新分类模型,用于发现最优特征基因对。
Comput Biol Med. 2007 Nov;37(11):1637-46. doi: 10.1016/j.compbiomed.2007.03.004. Epub 2007 May 7.
6
Robust prostate cancer marker genes emerge from direct integration of inter-study microarray data.通过整合研究间的微阵列数据直接鉴定出稳健的前列腺癌标志物基因。
Bioinformatics. 2005 Oct 15;21(20):3905-11. doi: 10.1093/bioinformatics/bti647. Epub 2005 Aug 30.
7
Biomarker discovery across annotated and unannotated microarray datasets using semi-supervised learning.使用半监督学习在有注释和无注释的微阵列数据集中发现生物标志物。
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S7. doi: 10.1186/1471-2164-9-S2-S7.
8
Induction of comprehensible models for gene expression datasets by subgroup discovery methodology.通过子群发现方法为基因表达数据集诱导可理解模型。
J Biomed Inform. 2004 Aug;37(4):269-84. doi: 10.1016/j.jbi.2004.07.007.
9
PACK: Profile Analysis using Clustering and Kurtosis to find molecular classifiers in cancer.PACK:使用聚类和峰度进行轮廓分析以寻找癌症中的分子分类器。
Bioinformatics. 2006 Sep 15;22(18):2269-75. doi: 10.1093/bioinformatics/btl174. Epub 2006 May 8.
10
Interpretable gene expression classifier with an accurate and compact fuzzy rule base for microarray data analysis.用于微阵列数据分析的具有准确且紧凑模糊规则库的可解释基因表达分类器。
Biosystems. 2006 Sep;85(3):165-76. doi: 10.1016/j.biosystems.2006.01.002. Epub 2006 Feb 21.

引用本文的文献

1
A spatially localized DNA linear classifier for cancer diagnosis.用于癌症诊断的空间局部化 DNA 线性分类器。
Nat Commun. 2024 May 29;15(1):4583. doi: 10.1038/s41467-024-48869-y.
2
sNebula, a network-based algorithm to predict binding between human leukocyte antigens and peptides.sNebula,一种基于网络的算法,用于预测人类白细胞抗原与肽之间的结合。
Sci Rep. 2016 Aug 25;6:32115. doi: 10.1038/srep32115.
3
An investigation of biomarkers derived from legacy microarray data for their utility in the RNA-seq era.对源自传统微阵列数据的生物标志物在RNA测序时代的效用进行的一项调查。
Genome Biol. 2014 Dec 3;15(12):523. doi: 10.1186/s13059-014-0523-y.
4
Selecting a single model or combining multiple models for microarray-based classifier development?--a comparative analysis based on large and diverse datasets generated from the MAQC-II project.基于 MAQC-II 项目生成的大型且多样化数据集的比较分析:选择单个模型还是组合多个模型用于基于微阵列的分类器开发?
BMC Bioinformatics. 2011 Oct 18;12 Suppl 10(Suppl 10):S3. doi: 10.1186/1471-2105-12-S10-S3.
5
geneCBR: a translational tool for multiple-microarray analysis and integrative information retrieval for aiding diagnosis in cancer research.基因CBR:一种用于多微阵列分析和综合信息检索的转化工具,以辅助癌症研究中的诊断。
BMC Bioinformatics. 2009 Jun 18;10:187. doi: 10.1186/1471-2105-10-187.
6
An integrated method for cancer classification and rule extraction from microarray data.一种从微阵列数据中进行癌症分类和规则提取的综合方法。
J Biomed Sci. 2009 Feb 24;16(1):25. doi: 10.1186/1423-0127-16-25.
7
Very Important Pool (VIP) genes--an application for microarray-based molecular signatures.非常重要的基因池(VIP)基因——基于微阵列的分子特征的一种应用。
BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S9. doi: 10.1186/1471-2105-9-S9-S9.