• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于癌症和正常基因表达数据分类与预测的集成依赖模型。

Ensemble dependence model for classification and prediction of cancer and normal gene expression data.

作者信息

Qiu Peng, Wang Z Jane, Liu K J Ray

机构信息

Department of Electrical and Computer Engineering, University of Maryland College Park, MD 20742, USA.

出版信息

Bioinformatics. 2005 Jul 15;21(14):3114-21. doi: 10.1093/bioinformatics/bti483. Epub 2005 May 6.

DOI:10.1093/bioinformatics/bti483
PMID:15879455
Abstract

MOTIVATION

DNA microarray technologies make it possible to simultaneously monitor thousands of genes' expression levels. A topic of great interest is to study the different expression profiles between microarray samples from cancer patients and normal subjects, by classifying them at gene expression levels. Currently, various clustering methods have been proposed in the literature to classify cancer and normal samples based on microarray data, and they are predominantly data-driven approaches. In this paper, we propose an alternative approach, a model-driven approach, which can reveal the relationship between the global gene expression profile and the subject's health status, and thus is promising in predicting the early development of cancer.

RESULTS

In this work, we propose an ensemble dependence model, aimed at exploring the group dependence relationship of gene clusters. Under the framework of hypothesis-testing, we employ genes' dependence relationship as a feature to model and classify cancer and normal samples. The proposed classification scheme is applied to several real cancer datasets, including cDNA, Affymetrix microarray and proteomic data. It is noted that the proposed method yields very promising performance. We further investigate the eigenvalue pattern of the proposed method, and we discover different patterns between cancer and normal samples. Moreover, the transition between cancer and normal patterns suggests that the eigenvalue pattern of the proposed models may have potential to predict the early stage of cancer development. In addition, we examine the effects of possible model mismatch on the proposed scheme.

摘要

动机

DNA微阵列技术使同时监测数千个基因的表达水平成为可能。一个备受关注的课题是通过在基因表达水平上对癌症患者和正常受试者的微阵列样本进行分类,来研究它们不同的表达谱。目前,文献中已经提出了各种聚类方法,用于基于微阵列数据对癌症样本和正常样本进行分类,并且它们主要是数据驱动的方法。在本文中,我们提出了一种替代方法,即模型驱动方法,该方法可以揭示全局基因表达谱与受试者健康状况之间的关系,因此在预测癌症的早期发展方面很有前景。

结果

在这项工作中,我们提出了一种集成依赖模型,旨在探索基因簇的组间依赖关系。在假设检验的框架下,我们将基因的依赖关系作为一个特征来对癌症样本和正常样本进行建模和分类。所提出的分类方案应用于几个真实的癌症数据集,包括cDNA、Affymetrix微阵列和蛋白质组学数据。值得注意的是,所提出的方法产生了非常有前景的性能。我们进一步研究了所提出方法的特征值模式,并且我们发现了癌症样本和正常样本之间的不同模式。此外,癌症模式和正常模式之间的转变表明,所提出模型的特征值模式可能具有预测癌症发展早期阶段的潜力。此外,我们研究了可能的模型不匹配对所提出方案的影响。

相似文献

1
Ensemble dependence model for classification and prediction of cancer and normal gene expression data.用于癌症和正常基因表达数据分类与预测的集成依赖模型。
Bioinformatics. 2005 Jul 15;21(14):3114-21. doi: 10.1093/bioinformatics/bti483. Epub 2005 May 6.
2
Independent component analysis-based penalized discriminant method for tumor classification using gene expression data.基于独立成分分析的惩罚判别方法用于利用基因表达数据进行肿瘤分类
Bioinformatics. 2006 Aug 1;22(15):1855-62. doi: 10.1093/bioinformatics/btl190. Epub 2006 May 18.
3
Structured polychotomous machine diagnosis of multiple cancer types using gene expression.使用基因表达对多种癌症类型进行结构化多分类机器诊断。
Bioinformatics. 2006 Apr 15;22(8):950-8. doi: 10.1093/bioinformatics/btl029. Epub 2006 Feb 1.
4
Dependence network modeling for biomarker identification.用于生物标志物识别的依赖网络建模
Bioinformatics. 2007 Jan 15;23(2):198-206. doi: 10.1093/bioinformatics/btl553. Epub 2006 Oct 31.
5
Multiclass cancer classification and biomarker discovery using GA-based algorithms.使用基于遗传算法的算法进行多类别癌症分类和生物标志物发现。
Bioinformatics. 2005 Jun 1;21(11):2691-7. doi: 10.1093/bioinformatics/bti419. Epub 2005 Apr 6.
6
A combination of rough-based feature selection and RBF neural network for classification using gene expression data.一种基于粗糙集的特征选择与径向基函数神经网络相结合的方法,用于利用基因表达数据进行分类。
IEEE Trans Nanobioscience. 2008 Mar;7(1):91-9. doi: 10.1109/TNB.2008.2000142.
7
Cancer classification and prediction using logistic regression with Bayesian gene selection.使用贝叶斯基因选择的逻辑回归进行癌症分类和预测。
J Biomed Inform. 2004 Aug;37(4):249-59. doi: 10.1016/j.jbi.2004.07.009.
8
A method for predicting disease subtypes in presence of misclassification among training samples using gene expression: application to human breast cancer.一种在训练样本存在错误分类的情况下利用基因表达预测疾病亚型的方法:应用于人类乳腺癌。
Bioinformatics. 2006 Feb 1;22(3):317-25. doi: 10.1093/bioinformatics/bti738. Epub 2005 Nov 2.
9
Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE.基于最大间隔准则的递归基因选择:与支持向量机递归特征消除法的比较
BMC Bioinformatics. 2006 Dec 25;7:543. doi: 10.1186/1471-2105-7-543.
10
Effects of replacing the unreliable cDNA microarray measurements on the disease classification based on gene expression profiles and functional modules.基于基因表达谱和功能模块,替换不可靠的cDNA微阵列测量值对疾病分类的影响。
Bioinformatics. 2006 Dec 1;22(23):2883-9. doi: 10.1093/bioinformatics/btl339. Epub 2006 Jun 29.

引用本文的文献

1
A framework model using multifilter feature selection to enhance colon cancer classification.基于多滤波器特征选择的结肠癌分类增强框架模型。
PLoS One. 2021 Apr 16;16(4):e0249094. doi: 10.1371/journal.pone.0249094. eCollection 2021.
2
Research resource: EPSLiM: ensemble predictor for short linear motifs in nuclear hormone receptors.研究资源:EPSLiM:核激素受体中短线性基序的集成预测器。
Mol Endocrinol. 2014 May;28(5):768-77. doi: 10.1210/me.2014-1006. Epub 2014 Mar 28.
3
Inferring phenotypic properties from single-cell characteristics.
从单细胞特征推断表型特征。
PLoS One. 2012;7(5):e37038. doi: 10.1371/journal.pone.0037038. Epub 2012 May 25.
4
A mixture model with a reference-based automatic selection of components for disease classification from protein and/or gene expression levels.一种基于参考的混合物模型,用于根据蛋白质和/或基因表达水平自动选择疾病分类的成分。
BMC Bioinformatics. 2011 Dec 30;12:496. doi: 10.1186/1471-2105-12-496.
5
Discovering biological progression underlying microarray samples.发现微阵列样本中的生物学进展。
PLoS Comput Biol. 2011 Apr;7(4):e1001123. doi: 10.1371/journal.pcbi.1001123. Epub 2011 Apr 14.
6
Exploring the within- and between-class correlation distributions for tumor classification.探讨肿瘤分类的类内和类间相关系数分布。
Proc Natl Acad Sci U S A. 2010 Apr 13;107(15):6737-42. doi: 10.1073/pnas.0910140107. Epub 2010 Mar 25.
7
Method of regulatory network that can explore protein regulations for disease classification.用于疾病分类的调控网络方法,可以探索蛋白质调控。
Artif Intell Med. 2010 Feb-Mar;48(2-3):119-27. doi: 10.1016/j.artmed.2009.07.011. Epub 2009 Dec 3.
8
MULTI-K: accurate classification of microarray subtypes using ensemble k-means clustering.MULTI-K:使用集成 k-均值聚类进行微阵列亚型的准确分类。
BMC Bioinformatics. 2009 Aug 22;10:260. doi: 10.1186/1471-2105-10-260.
9
Differential dependency network analysis to identify condition-specific topological changes in biological networks.差异依赖网络分析以识别生物网络中特定条件下的拓扑变化。
Bioinformatics. 2009 Feb 15;25(4):526-32. doi: 10.1093/bioinformatics/btn660. Epub 2008 Dec 26.
10
SDED: a novel filter method for cancer-related gene selection.SDED:一种用于癌症相关基因选择的新型过滤方法。
Bioinformation. 2008 Apr 11;2(7):301-3. doi: 10.6026/97320630002301.