用于全基因组表达数据处理与建模的奇异值分解

Singular value decomposition for genome-wide expression data processing and modeling.

作者信息

Alter O, Brown P O, Botstein D

机构信息

Departments of Genetics and Biochemistry, Stanford University, Stanford, CA 94305, USA.

出版信息

Proc Natl Acad Sci U S A. 2000 Aug 29;97(18):10101-6. doi: 10.1073/pnas.97.18.10101.

DOI:10.1073/pnas.97.18.10101

PMID:10963673

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC27718/

Abstract

We describe the use of singular value decomposition in transforming genome-wide expression data from genes x arrays space to reduced diagonalized "eigengenes" x "eigenarrays" space, where the eigengenes (or eigenarrays) are unique orthonormal superpositions of the genes (or arrays). Normalizing the data by filtering out the eigengenes (and eigenarrays) that are inferred to represent noise or experimental artifacts enables meaningful comparison of the expression of different genes across different arrays in different experiments. Sorting the data according to the eigengenes and eigenarrays gives a global picture of the dynamics of gene expression, in which individual genes and arrays appear to be classified into groups of similar regulation and function, or similar cellular state and biological phenotype, respectively. After normalization and sorting, the significant eigengenes and eigenarrays can be associated with observed genome-wide effects of regulators, or with measured samples, in which these regulators are overactive or underactive, respectively.

摘要

我们描述了奇异值分解在将全基因组表达数据从基因×阵列空间转换为简化的对角化“特征基因”×“特征阵列”空间中的应用，其中特征基因（或特征阵列）是基因（或阵列）的独特正交归一化叠加。通过滤除被推断为代表噪声或实验假象的特征基因（和特征阵列）对数据进行归一化，能够在不同实验中对不同阵列上不同基因的表达进行有意义的比较。根据特征基因和特征阵列对数据进行排序，可给出基因表达动态的全局图景，其中单个基因和阵列似乎分别被分类为具有相似调控和功能的组，或具有相似细胞状态和生物学表型的组。在归一化和排序之后，显著的特征基因和特征阵列可分别与观察到的调节因子的全基因组效应相关联，或与这些调节因子分别过度活跃或活性不足的测量样本相关联。

相似文献

Singular value decomposition for genome-wide expression data processing and modeling.

Proc Natl Acad Sci U S A. 2000 Aug 29;97(18):10101-6. doi: 10.1073/pnas.97.18.10101.

A tensor higher-order singular value decomposition for integrative analysis of DNA microarray data from different studies.

Proc Natl Acad Sci U S A. 2007 Nov 20;104(47):18371-6. doi: 10.1073/pnas.0709146104. Epub 2007 Nov 14.

Singular value decomposition of genome-scale mRNA lengths distribution reveals asymmetry in RNA gel electrophoresis band broadening.

Proc Natl Acad Sci U S A. 2006 Aug 8;103(32):11828-33. doi: 10.1073/pnas.0604756103. Epub 2006 Jul 28.

Constraint structure analysis of gene expression.

Funct Integr Genomics. 2000 Nov;1(3):174-85. doi: 10.1007/s101420000018.

Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organisms.

Proc Natl Acad Sci U S A. 2003 Mar 18;100(6):3351-6. doi: 10.1073/pnas.0530258100. Epub 2003 Mar 11.

Genomic signal processing: from matrix algebra to genetic networks.

Methods Mol Biol. 2007;377:17-60. doi: 10.1007/978-1-59745-390-5_2.

Genome wide oscillations in expression. Wavelet analysis of time series data from yeast expression arrays uncovers the dynamic architecture of phenotype.

Mol Biol Rep. 2001;28(2):73-82. doi: 10.1023/a:1017909012215.

Whole genome genetic-typing in yeast using high-density oligonucleotide arrays.

Parasitology. 1999;118 Suppl:S73-80. doi: 10.1017/s0031182099004047.

Genomic dissection of the cell-type-specification circuit in Saccharomyces cerevisiae.

Proc Natl Acad Sci U S A. 2004 Dec 28;101(52):18069-74. doi: 10.1073/pnas.0407611102. Epub 2004 Dec 16.

Integrative analysis of genome-scale data by using pseudoinverse projection predicts novel correlation between DNA replication and RNA transcription.

Proc Natl Acad Sci U S A. 2004 Nov 23;101(47):16577-82. doi: 10.1073/pnas.0406767101. Epub 2004 Nov 15.

引用本文的文献

Multimodal integration strategies for clinical application in oncology.

Front Pharmacol. 2025 Aug 20;16:1609079. doi: 10.3389/fphar.2025.1609079. eCollection 2025.

Accelerated midlife endocrine and bioenergetic brain aging in APOE4 females.

Front Aging Neurosci. 2025 Aug 18;17:1632877. doi: 10.3389/fnagi.2025.1632877. eCollection 2025.

Peripheral blood DNA methylation predicts the early onset of primary tumor in TP53 mutation carriers.

Nat Commun. 2025 Aug 26;16(1):7976. doi: 10.1038/s41467-025-62894-5.

Fluctuation structure predicts genome-wide perturbation outcomes.

Res Sq. 2025 Aug 12:rs.3.rs-7304871. doi: 10.21203/rs.3.rs-7304871/v1.

Motion-invariant variational autoencoding of brain structural connectomes.

Imaging Neurosci (Camb). 2024 Oct 7;2. doi: 10.1162/imag_a_00303. eCollection 2024.

Generative prediction of causal gene sets responsible for complex traits.

Proc Natl Acad Sci U S A. 2025 Jun 17;122(24):e2415071122. doi: 10.1073/pnas.2415071122. Epub 2025 Jun 12.

Optimizing imputation strategies for mass spectrometry-based proteomics considering intensity and missing value rates.

Comput Struct Biotechnol J. 2025 May 3;27:1818-1826. doi: 10.1016/j.csbj.2025.04.041. eCollection 2025.

Quality Control Standards for Batch Effect Evaluation and Correction in Mass Spectrometry Imaging.

Anal Chem. 2025 May 27;97(20):10919-10928. doi: 10.1021/acs.analchem.5c02020. Epub 2025 May 12.

Phenotype and psychometric characterization of Phelan-McDermid syndrome patients: pioneering towards personalized medicine.

Front Psychiatry. 2025 Mar 4;16:1511962. doi: 10.3389/fpsyt.2025.1511962. eCollection 2025.

Simplicity within biological complexity.

Bioinform Adv. 2025 Feb 6;5(1):vbae164. doi: 10.1093/bioadv/vbae164. eCollection 2025.

本文引用的文献

Knowledge-based analysis of microarray gene expression data by using support vector machines.

Proc Natl Acad Sci U S A. 2000 Jan 4;97(1):262-7. doi: 10.1073/pnas.97.1.262.

Systematic determination of genetic network architecture.

Nat Genet. 1999 Jul;22(3):281-5. doi: 10.1038/10343.

Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays.

Proc Natl Acad Sci U S A. 1999 Jun 8;96(12):6745-50. doi: 10.1073/pnas.96.12.6745.

Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation.

Proc Natl Acad Sci U S A. 1999 Mar 16;96(6):2907-12. doi: 10.1073/pnas.96.6.2907.

Cluster analysis and display of genome-wide expression patterns.

Proc Natl Acad Sci U S A. 1998 Dec 8;95(25):14863-8. doi: 10.1073/pnas.95.25.14863.

Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization.

Mol Biol Cell. 1998 Dec;9(12):3273-97. doi: 10.1091/mbc.9.12.3273.

Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation.

Nat Biotechnol. 1998 Oct;16(10):939-45. doi: 10.1038/nbt1098-939.

Multiplexed biochemical assays with biological chips.

Nature. 1993 Aug 5;364(6437):555-6. doi: 10.1038/364555a0.

Quantitative monitoring of gene expression patterns with a complementary DNA microarray.

Science. 1995 Oct 20;270(5235):467-70. doi: 10.1126/science.270.5235.467.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于全基因组表达数据处理与建模的奇异值分解

Singular value decomposition for genome-wide expression data processing and modeling.

作者信息

Alter O, Brown P O, Botstein D

机构信息

Departments of Genetics and Biochemistry, Stanford University, Stanford, CA 94305, USA.

出版信息

Proc Natl Acad Sci U S A. 2000 Aug 29;97(18):10101-6. doi: 10.1073/pnas.97.18.10101.

DOI:10.1073/pnas.97.18.10101

PMID:10963673

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC27718/

Abstract

摘要

用于全基因组表达数据处理与建模的奇异值分解

Singular value decomposition for genome-wide expression data processing and modeling.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于全基因组表达数据处理与建模的奇异值分解

Singular value decomposition for genome-wide expression data processing and modeling.

作者信息

机构信息

出版信息