Suppr超能文献

用于分析来自多个生物样本的相关质谱数据的多变量两部分统计方法。

Multivariate two-part statistics for analysis of correlated mass spectrometry data from multiple biological specimens.

作者信息

Taylor Sandra L, Ruhaak L Renee, Weiss Robert H, Kelly Karen, Kim Kyoungmi

机构信息

Division of Biostatistics, Department of Public Health Sciences, University of California Davis, CA, 95616, USA.

Department of Clinical Chemistry and Laboratory Medicine, Leiden University Medical Center, Leiden, The Netherlands.

出版信息

Bioinformatics. 2017 Jan 1;33(1):17-25. doi: 10.1093/bioinformatics/btw578. Epub 2016 Sep 4.

Abstract

MOTIVATION

High through-put mass spectrometry (MS) is now being used to profile small molecular compounds across multiple biological sample types from the same subjects with the goal of leveraging information across biospecimens. Multivariate statistical methods that combine information from all biospecimens could be more powerful than the usual univariate analyses. However, missing values are common in MS data and imputation can impact between-biospecimen correlation and multivariate analysis results.

RESULTS

We propose two multivariate two-part statistics that accommodate missing values and combine data from all biospecimens to identify differentially regulated compounds. Statistical significance is determined using a multivariate permutation null distribution. Relative to univariate tests, the multivariate procedures detected more significant compounds in three biological datasets. In a simulation study, we showed that multi-biospecimen testing procedures were more powerful than single-biospecimen methods when compounds are differentially regulated in multiple biospecimens but univariate methods can be more powerful if compounds are differentially regulated in only one biospecimen.

AVAILABILITY AND IMPLEMENTATION

We provide R functions to implement and illustrate our method as supplementary information CONTACT: sltaylor@ucdavis.eduSupplementary information: Supplementary data are available at Bioinformatics online.

摘要

动机

高通量质谱(MS)现正用于分析来自同一受试者的多种生物样本类型中的小分子化合物,目的是利用跨生物样本的信息。结合所有生物样本信息的多变量统计方法可能比通常的单变量分析更强大。然而,缺失值在质谱数据中很常见,插补会影响生物样本间的相关性和多变量分析结果。

结果

我们提出了两种多变量两部分统计方法,它们能够处理缺失值,并结合所有生物样本的数据来识别差异调节的化合物。使用多变量置换零分布来确定统计显著性。相对于单变量检验,多变量方法在三个生物数据集中检测到了更多显著的化合物。在一项模拟研究中,我们表明当化合物在多个生物样本中差异调节时,多生物样本测试方法比单生物样本方法更强大,但如果化合物仅在一个生物样本中差异调节,单变量方法可能更强大。

可用性和实现方式

我们提供了R函数来实现和说明我们的方法,作为补充信息。联系方式:sltaylor@ucdavis.edu。补充信息:补充数据可在《生物信息学》在线获取。

相似文献

1
Multivariate two-part statistics for analysis of correlated mass spectrometry data from multiple biological specimens.
Bioinformatics. 2017 Jan 1;33(1):17-25. doi: 10.1093/bioinformatics/btw578. Epub 2016 Sep 4.
4
Missing value imputation for LC-MS metabolomics data by incorporating metabolic network and adduct ion relations.
Bioinformatics. 2018 May 1;34(9):1555-1561. doi: 10.1093/bioinformatics/btx816.
5
Mass spectrometry-based technologies for high-throughput metabolomics.
Bioanalysis. 2009 Dec;1(9):1665-84. doi: 10.4155/bio.09.158.
6
Statistical and Multivariate Analysis of MS-Based Plant Metabolomics Data.
Methods Mol Biol. 2018;1778:285-296. doi: 10.1007/978-1-4939-7819-9_20.
7
Missing Value Imputation Approach for Mass Spectrometry-based Metabolomics Data.
Sci Rep. 2018 Jan 12;8(1):663. doi: 10.1038/s41598-017-19120-0.
8
MetTailor: dynamic block summary and intensity normalization for robust analysis of mass spectrometry data in metabolomics.
Bioinformatics. 2015 Nov 15;31(22):3645-52. doi: 10.1093/bioinformatics/btv434. Epub 2015 Jul 27.
10
metaX: a flexible and comprehensive software for processing metabolomics data.
BMC Bioinformatics. 2017 Mar 21;18(1):183. doi: 10.1186/s12859-017-1579-y.

引用本文的文献

本文引用的文献

1
Two-sample tests and one-way MANOVA for multivariate biomarker data with nondetects.
Stat Med. 2016 Sep 10;35(20):3623-44. doi: 10.1002/sim.6945. Epub 2016 Mar 21.
6
Influence of missing values substitutes on multivariate analysis of metabolomics data.
Metabolites. 2014 Jun 16;4(2):433-52. doi: 10.3390/metabo4020433.
7
Metabolomic biomarkers in serum and urine in women with preeclampsia.
PLoS One. 2014 Mar 17;9(3):e91923. doi: 10.1371/journal.pone.0091923. eCollection 2014.
8
Evaluation of glycomic profiling as a diagnostic biomarker for epithelial ovarian cancer.
Cancer Epidemiol Biomarkers Prev. 2014 Apr;23(4):611-21. doi: 10.1158/1055-9965.EPI-13-1073. Epub 2014 Feb 20.
9
Accounting for undetected compounds in statistical analyses of mass spectrometry 'omic studies.
Stat Appl Genet Mol Biol. 2013 Dec;12(6):703-22. doi: 10.1515/sagmb-2013-0021.
10
Serum and tissue metabolomics of head and neck cancer.
Cancer Genomics Proteomics. 2013 Sep-Oct;10(5):233-8.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验