Suppr超能文献

一种整合全基因组表达数据与生物学知识的多变量方法。

A multivariate approach for integrating genome-wide expression data and biological knowledge.

作者信息

Kong Sek Won, Pu William T, Park Peter J

机构信息

Department of Cardiology 300 Longwood Avenue, Boston, MA 02115, USA.

出版信息

Bioinformatics. 2006 Oct 1;22(19):2373-80. doi: 10.1093/bioinformatics/btl401. Epub 2006 Jul 28.

Abstract

MOTIVATION

Several statistical methods that combine analysis of differential gene expression with biological knowledge databases have been proposed for a more rapid interpretation of expression data. However, most such methods are based on a series of univariate statistical tests and do not properly account for the complex structure of gene interactions.

RESULTS

We present a simple yet effective multivariate statistical procedure for assessing the correlation between a subspace defined by a group of genes and a binary phenotype. A subspace is deemed significant if the samples corresponding to different phenotypes are well separated in that subspace. The separation is measured using Hotelling's T(2) statistic, which captures the covariance structure of the subspace. When the dimension of the subspace is larger than that of the sample space, we project the original data to a smaller orthonormal subspace. We use this method to search through functional pathway subspaces defined by Reactome, KEGG, BioCarta and Gene Ontology. To demonstrate its performance, we apply this method to the data from two published studies, and visualize the results in the principal component space.

摘要

动机

已经提出了几种将差异基因表达分析与生物知识数据库相结合的统计方法,以便更快速地解释表达数据。然而,大多数此类方法基于一系列单变量统计检验,并未充分考虑基因相互作用的复杂结构。

结果

我们提出了一种简单而有效的多变量统计程序,用于评估由一组基因定义的子空间与二元表型之间的相关性。如果对应于不同表型的样本在该子空间中能够很好地分离,则该子空间被认为是显著的。使用Hotelling's T(2)统计量来衡量分离程度,该统计量捕获了子空间的协方差结构。当子空间的维度大于样本空间的维度时,我们将原始数据投影到一个较小的正交子空间。我们使用这种方法在由Reactome、KEGG、BioCarta和基因本体定义的功能通路子空间中进行搜索。为了证明其性能,我们将此方法应用于两项已发表研究的数据,并在主成分空间中可视化结果。

相似文献

2
A factor analysis model for functional genomics.一种用于功能基因组学的因子分析模型。
BMC Bioinformatics. 2006 Apr 21;7:216. doi: 10.1186/1471-2105-7-216.

引用本文的文献

5
Gene Set Analysis: Challenges, Opportunities, and Future Research.基因集分析:挑战、机遇与未来研究
Front Genet. 2020 Jun 30;11:654. doi: 10.3389/fgene.2020.00654. eCollection 2020.

本文引用的文献

3
Discovering statistically significant pathways in expression profiling studies.在基因表达谱研究中发现具有统计学意义的通路。
Proc Natl Acad Sci U S A. 2005 Sep 20;102(38):13544-9. doi: 10.1073/pnas.0506577102. Epub 2005 Sep 8.
7
Reactome: a knowledgebase of biological pathways.Reactome:生物通路知识库。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D428-32. doi: 10.1093/nar/gki072.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验