微阵列数据的稳健奇异值分解分析

Robust singular value decomposition analysis of microarray data.

作者信息

Liu Li, Hawkins Douglas M, Ghosh Sujoy, Young S Stanley

机构信息

National Institute of Statistical Sciences, P.O. Box 14006, Research Triangle Park, NC 27709-4006, USA.

出版信息

Proc Natl Acad Sci U S A. 2003 Nov 11;100(23):13167-72. doi: 10.1073/pnas.1733249100. Epub 2003 Oct 27.

DOI:10.1073/pnas.1733249100

PMID:14581611

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC263735/

Abstract

In microarray data there are a number of biological samples, each assessed for the level of gene expression for a typically large number of genes. There is a need to examine these data with statistical techniques to help discern possible patterns in the data. Our technique applies a combination of mathematical and statistical methods to progressively take the data set apart so that different aspects can be examined for both general patterns and very specific effects. Unfortunately, these data tables are often corrupted with extreme values (outliers), missing values, and non-normal distributions that preclude standard analysis. We develop a robust analysis method to address these problems. The benefits of this robust analysis will be both the understanding of large-scale shifts in gene effects and the isolation of particular sample-by-gene effects that might be either unusual interactions or the result of experimental flaws. Our method requires a single pass and does not resort to complex "cleaning" or imputation of the data table before analysis. We illustrate the method with a commercial data set.

摘要

在微阵列数据中，有许多生物样本，每个样本都针对大量基因的基因表达水平进行评估。需要使用统计技术来检查这些数据，以帮助识别数据中可能存在的模式。我们的技术应用数学和统计方法的组合，逐步剖析数据集，以便可以从总体模式和非常具体的效应两个方面来检查不同的方面。不幸的是，这些数据表经常被极端值（异常值）、缺失值和非正态分布所破坏，从而妨碍了标准分析。我们开发了一种稳健的分析方法来解决这些问题。这种稳健分析的好处在于既能理解基因效应的大规模变化，又能分离出特定的样本与基因效应，这些效应可能是异常相互作用或实验缺陷的结果。我们的方法只需一次遍历，在分析之前无需对数据表进行复杂的“清理”或插补。我们用一个商业数据集来说明该方法。

相似文献

Robust singular value decomposition analysis of microarray data.

Proc Natl Acad Sci U S A. 2003 Nov 11;100(23):13167-72. doi: 10.1073/pnas.1733249100. Epub 2003 Oct 27.

Robust imputation method for missing values in microarray data.

BMC Bioinformatics. 2007 May 3;8 Suppl 2(Suppl 2):S6. doi: 10.1186/1471-2105-8-S2-S6.

DNA microarray data imputation and significance analysis of differential expression.

Bioinformatics. 2005 Nov 15;21(22):4155-61. doi: 10.1093/bioinformatics/bti638. Epub 2005 Aug 23.

Including probe-level measurement error in robust mixture clustering of replicated microarray gene expression.

Stat Appl Genet Mol Biol. 2010;9:Article42. doi: 10.2202/1544-6115.1600. Epub 2010 Dec 9.

The influence of missing value imputation on detection of differentially expressed genes from microarray data.

Bioinformatics. 2005 Dec 1;21(23):4272-9. doi: 10.1093/bioinformatics/bti708. Epub 2005 Oct 10.

Collateral missing value imputation: a new robust missing value estimation algorithm for microarray data.

Bioinformatics. 2005 May 15;21(10):2417-23. doi: 10.1093/bioinformatics/bti345. Epub 2005 Feb 24.

Ameliorative missing value imputation for robust biological knowledge inference.

J Biomed Inform. 2008 Aug;41(4):499-514. doi: 10.1016/j.jbi.2007.10.005. Epub 2007 Dec 31.

Sequential local least squares imputation estimating missing value of microarray data.

Comput Biol Med. 2008 Oct;38(10):1112-20. doi: 10.1016/j.compbiomed.2008.08.006. Epub 2008 Sep 30.

Approximate sample size calculations with microarray data: an illustration.

Stat Appl Genet Mol Biol. 2006;5:Article25. doi: 10.2202/1544-6115.1227. Epub 2006 Oct 9.

Integration of diverse microarray data types.

Methods Mol Biol. 2009;556:205-16. doi: 10.1007/978-1-60327-192-9_15.

引用本文的文献

White matter brain structure predicts language performance and learning success.

Hum Brain Mapp. 2023 Mar;44(4):1445-1455. doi: 10.1002/hbm.26132. Epub 2022 Nov 18.

An introduction to new robust linear and monotonic correlation coefficients.

BMC Bioinformatics. 2021 Mar 31;22(1):170. doi: 10.1186/s12859-021-04098-4.

The Decomposition and Forecasting of Mutual Investment Funds Using Singular Spectrum Analysis.

Entropy (Basel). 2020 Jan 9;22(1):83. doi: 10.3390/e22010083.

FarmTest: Factor-adjusted robust multiple testing with approximate false discovery control.

J Am Stat Assoc. 2019;114(528):1880-1893. doi: 10.1080/01621459.2018.1527700. Epub 2019 Mar 20.

LARGE COVARIANCE ESTIMATION THROUGH ELLIPTICAL FACTOR MODELS.

Ann Stat. 2018 Aug;46(4):1383-1414. doi: 10.1214/17-AOS1588. Epub 2018 Jun 27.

Cancer Subtype Discovery Using Prognosis-Enhanced Neural Network Classifier in Multigenomic Data.

Technol Cancer Res Treat. 2018 Jan 1;17:1533033818790509. doi: 10.1177/1533033818790509.

Normalization and Technical Variation in Gene Expression Measurements.

J Res Natl Inst Stand Technol. 2006 Oct 1;111(5):361-72. doi: 10.6028/jres.111.026. Print 2006 Sep-Oct.

Applications of a Novel Clustering Approach Using Non-Negative Matrix Factorization to Environmental Research in Public Health.

Int J Environ Res Public Health. 2016 May 18;13(5):509. doi: 10.3390/ijerph13050509.

Principal component analysis for designed experiments.

BMC Bioinformatics. 2015;16 Suppl 18(Suppl 18):S7. doi: 10.1186/1471-2105-16-S18-S7. Epub 2015 Dec 9.

Identification of bicluster regions in a binary matrix and its applications.

PLoS One. 2013 Aug 5;8(8):e71680. doi: 10.1371/journal.pone.0071680. Print 2013.

本文引用的文献

Immunosuppressive effect of polycyclic aromatic hydrocarbons by induction of apoptosis of pre-B lymphocytes of bone marrow.

Acta Medica (Hradec Kralove). 2002;45(4):123-8.

The role of neutrophil apoptosis in influencing tissue repair.

J Wound Care. 2003 Jan;12(1):13-6. doi: 10.12968/jowc.2003.12.1.26458.

Treating Crohn's disease by inducing T lymphocyte apoptosis.

Ann N Y Acad Sci. 2002 Nov;973:166-80. doi: 10.1111/j.1749-6632.2002.tb04628.x.

Redox events in HTLV-1 Tax-induced apoptotic T-cell death.

Antioxid Redox Signal. 2002 Jun;4(3):471-7. doi: 10.1089/15230860260196263.

Potential methods to circumvent blocks in apoptosis in lymphomas.

Curr Opin Oncol. 2002 Sep;14(5):490-503. doi: 10.1097/00001622-200209000-00004.

Glucose-6-phosphate dehydrogenase deficiency, the UDP-glucuronosyl transferase 1A1 gene, and neonatal hyperbilirubinemia.

Gastroenterology. 2002 Jul;123(1):127-33. doi: 10.1053/gast.2002.34173.

Large-scale analysis of the human and mouse transcriptomes.

Proc Natl Acad Sci U S A. 2002 Apr 2;99(7):4465-70. doi: 10.1073/pnas.012025199. Epub 2002 Mar 19.

Influence of bilirubin uridine diphosphate-glucuronosyltransferase 1A promoter polymorphisms on serum bilirubin levels and cholelithiasis in children with sickle cell anemia.

J Pediatr Hematol Oncol. 2001 Oct;23(7):448-51. doi: 10.1097/00043426-200110000-00011.

A novel intronic mutation results in the use of a cryptic splice acceptor site within the coding region of UGT1A1, causing Crigler-Najjar syndrome type 1.

Mol Genet Metab. 2002 Feb;75(2):134-42. doi: 10.1006/mgme.2001.3284.

Mechanisms of immune evasion by renal cell carcinoma: tumor-induced T-lymphocyte apoptosis and NFkappaB suppression.

Urology. 2002 Jan;59(1):9-14. doi: 10.1016/s0090-4295(01)01503-5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

微阵列数据的稳健奇异值分解分析

Robust singular value decomposition analysis of microarray data.

作者信息

Liu Li, Hawkins Douglas M, Ghosh Sujoy, Young S Stanley

机构信息

National Institute of Statistical Sciences, P.O. Box 14006, Research Triangle Park, NC 27709-4006, USA.

出版信息

Proc Natl Acad Sci U S A. 2003 Nov 11;100(23):13167-72. doi: 10.1073/pnas.1733249100. Epub 2003 Oct 27.

DOI:10.1073/pnas.1733249100

PMID:14581611

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC263735/

Abstract

摘要

微阵列数据的稳健奇异值分解分析

Robust singular value decomposition analysis of microarray data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

微阵列数据的稳健奇异值分解分析

Robust singular value decomposition analysis of microarray data.

作者信息

机构信息

出版信息