用于分析时间进程“组学”数据的线性混合模型样条框架

A Linear Mixed Model Spline Framework for Analysing Time Course 'Omics' Data.

作者信息

Straube Jasmin, Gorse Alain-Dominique, Huang Bevan Emma, Lê Cao Kim-Anh

机构信息

QFAB Bioinformatics, Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia; The University of Queensland Diamantina Institute, Translational Research Institute, Brisbane, QLD, Australia.

QFAB Bioinformatics, Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia.

出版信息

PLoS One. 2015 Aug 27;10(8):e0134540. doi: 10.1371/journal.pone.0134540. eCollection 2015.

DOI:10.1371/journal.pone.0134540

PMID:26313144

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4551847/

Abstract

Time course 'omics' experiments are becoming increasingly important to study system-wide dynamic regulation. Despite their high information content, analysis remains challenging. 'Omics' technologies capture quantitative measurements on tens of thousands of molecules. Therefore, in a time course 'omics' experiment molecules are measured for multiple subjects over multiple time points. This results in a large, high-dimensional dataset, which requires computationally efficient approaches for statistical analysis. Moreover, methods need to be able to handle missing values and various levels of noise. We present a novel, robust and powerful framework to analyze time course 'omics' data that consists of three stages: quality assessment and filtering, profile modelling, and analysis. The first step consists of removing molecules for which expression or abundance is highly variable over time. The second step models each molecular expression profile in a linear mixed model framework which takes into account subject-specific variability. The best model is selected through a serial model selection approach and results in dimension reduction of the time course data. The final step includes two types of analysis of the modelled trajectories, namely, clustering analysis to identify groups of correlated profiles over time, and differential expression analysis to identify profiles which differ over time and/or between treatment groups. Through simulation studies we demonstrate the high sensitivity and specificity of our approach for differential expression analysis. We then illustrate how our framework can bring novel insights on two time course 'omics' studies in breast cancer and kidney rejection. The methods are publicly available, implemented in the R CRAN package lmms.

摘要

时间进程“组学”实验对于研究全系统动态调控变得越来越重要。尽管它们具有很高的信息含量，但分析仍然具有挑战性。“组学”技术可对数以万计的分子进行定量测量。因此，在时间进程“组学”实验中，会在多个时间点对多个受试者的分子进行测量。这会产生一个大型的高维数据集，需要计算效率高的方法进行统计分析。此外，方法还需要能够处理缺失值和各种噪声水平。我们提出了一个新颖、稳健且强大的框架来分析时间进程“组学”数据，该框架包括三个阶段：质量评估与过滤、轮廓建模和分析。第一步包括去除那些表达或丰度随时间变化很大的分子。第二步在一个线性混合模型框架中对每个分子表达轮廓进行建模，该框架考虑了受试者特异性变异性。通过串行模型选择方法选择最佳模型，从而实现时间进程数据的降维。最后一步包括对建模轨迹的两种类型的分析，即聚类分析以识别随时间相关轮廓的组，以及差异表达分析以识别随时间和/或治疗组之间不同的轮廓。通过模拟研究，我们证明了我们的方法在差异表达分析中的高灵敏度和特异性。然后，我们说明了我们的框架如何能够在两项乳腺癌和肾移植排斥反应的时间进程“组学”研究中带来新的见解。这些方法是公开可用的，在R CRAN包lmms中实现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cf7/4551847/883725eb3c58/pone.0134540.g001.jpg

相似文献

A Linear Mixed Model Spline Framework for Analysing Time Course 'Omics' Data.

PLoS One. 2015 Aug 27;10(8):e0134540. doi: 10.1371/journal.pone.0134540. eCollection 2015.

Integrating multiple 'omics' analysis for microbial biology: application and methodologies.

Microbiology (Reading). 2010 Feb;156(Pt 2):287-301. doi: 10.1099/mic.0.034793-0. Epub 2009 Nov 12.

Multi-omics facilitated variable selection in Cox-regression model for cancer prognosis prediction.

Methods. 2017 Jul 15;124:100-107. doi: 10.1016/j.ymeth.2017.06.010. Epub 2017 Jun 13.

Novel multivariate methods for integration of genomics and proteomics data: applications in a kidney transplant rejection study.

OMICS. 2014 Nov;18(11):682-95. doi: 10.1089/omi.2014.0062.

What did we learn from 'omics' studies in osteoarthritis.

Curr Opin Rheumatol. 2018 Jan;30(1):114-120. doi: 10.1097/BOR.0000000000000460.

Guidelines for the design, analysis and interpretation of 'omics' data: focus on human endometrium.

Hum Reprod Update. 2014 Jan-Feb;20(1):12-28. doi: 10.1093/humupd/dmt048. Epub 2013 Sep 29.

Handling missing rows in multi-omics data integration: multiple imputation in multiple factor analysis framework.

BMC Bioinformatics. 2016 Oct 3;17(1):402. doi: 10.1186/s12859-016-1273-5.

Global proteomics profiling improves drug sensitivity prediction: results from a multi-omics, pan-cancer modeling approach.

Bioinformatics. 2018 Apr 15;34(8):1353-1362. doi: 10.1093/bioinformatics/btx766.

A general framework for integrative analysis of incomplete multiomics data.

Genet Epidemiol. 2020 Oct;44(7):646-664. doi: 10.1002/gepi.22328. Epub 2020 Jul 21.

Predicting censored survival data based on the interactions between meta-dimensional omics data in breast cancer.

J Biomed Inform. 2015 Aug;56:220-8. doi: 10.1016/j.jbi.2015.05.019. Epub 2015 Jun 3.

引用本文的文献

Site- and cell-type-specific miRNA and mRNA genes and networks across the cortex, striatum, and hypothalamus.

Commun Biol. 2025 Jul 1;8(1):969. doi: 10.1038/s42003-025-08371-7.

Predicting Bacterial Vaginosis Development using Artificial Neural Networks.

medRxiv. 2025 May 5:2025.05.02.25326872. doi: 10.1101/2025.05.02.25326872.

Integrating -Omic Technologies across Modality, Space, and Time to Decipher Remodeling in Cardiac Disease.

Curr Cardiol Rep. 2025 Mar 21;27(1):74. doi: 10.1007/s11886-025-02226-7.

Mathematical Modeling and Inference of Epidermal Growth Factor-Induced Mitogen-Activated Protein Kinase Cell Signaling Pathways.

Int J Mol Sci. 2024 Sep 23;25(18):10204. doi: 10.3390/ijms251810204.

Non-invasive VOCs detection to monitor the gut microbiota metabolism in-vitro.

Sci Rep. 2024 Jul 9;14(1):15842. doi: 10.1038/s41598-024-66303-7.

A population-based urinary and plasma metabolomics study of environmental exposure to cadmium.

Environ Health Prev Med. 2024;29:22. doi: 10.1265/ehpm.23-00218.

Serial Sampling of the Small Airway Epithelium to Identify Persistent Smoking-dysregulated Genes.

Am J Respir Crit Care Med. 2023 Oct 1;208(7):780-790. doi: 10.1164/rccm.202204-0786OC.

Statistical Detection of Differentially Abundant Proteins in Experiments with Repeated Measures Designs and Isobaric Labeling.

J Proteome Res. 2023 Aug 4;22(8):2641-2659. doi: 10.1021/acs.jproteome.3c00155. Epub 2023 Jul 19.

Phosphoproteomics data-driven signalling network inference: Does it work?

Comput Struct Biotechnol J. 2022 Dec 15;21:432-443. doi: 10.1016/j.csbj.2022.12.010. eCollection 2023.

Benchmarking tools for detecting longitudinal differential expression in proteomics data allows establishing a robust reproducibility optimization regression approach.

Nat Commun. 2022 Dec 22;13(1):7877. doi: 10.1038/s41467-022-35564-z.

本文引用的文献

Senior-Loken syndrome secondary to NPHP5/IQCB1 mutation in an Iranian family.

NDT Plus. 2011 Dec;4(6):421-3. doi: 10.1093/ndtplus/sfr096. Epub 2011 Aug 18.

Antitumor effects and molecular mechanisms of figitumumab, a humanized monoclonal antibody to IGF-1 receptor, in esophageal carcinoma.

Sci Rep. 2014 Oct 31;4:6855. doi: 10.1038/srep06855.

A recursively partitioned mixture model for clustering time-course gene expression data.

Transl Cancer Res. 2014;3(3):217-232. doi: 10.3978/j.issn.2218-676X.2014.06.04.

Data-based filtering for replicated high-throughput transcriptome sequencing experiments.

Bioinformatics. 2013 Sep 1;29(17):2146-52. doi: 10.1093/bioinformatics/btt350. Epub 2013 Jul 2.

Evolutionary principles of modular gene regulation in yeasts.

Elife. 2013 Jun 18;2:e00603. doi: 10.7554/eLife.00603.

Links between metabolism and cancer.

Genes Dev. 2012 May 1;26(9):877-90. doi: 10.1101/gad.189365.112.

A statistical framework for biomarker discovery in metabolomic time course data.

Bioinformatics. 2011 Jul 15;27(14):1979-85. doi: 10.1093/bioinformatics/btr289.

Insulin-like growth factor-dependent proliferation and survival of triple-negative breast cancer cells: implications for therapy.

Neoplasia. 2011 Jun;13(6):504-15. doi: 10.1593/neo.101590.

Unraveling cancer chemoimmunotherapy mechanisms by gene and protein expression profiling of responses to cyclophosphamide.

Cancer Res. 2011 May 15;71(10):3528-39. doi: 10.1158/0008-5472.CAN-10-4523. Epub 2011 Mar 28.

Impulse control: temporal dynamics in gene transcription.

Cell. 2011 Mar 18;144(6):886-96. doi: 10.1016/j.cell.2011.02.015.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于分析时间进程“组学”数据的线性混合模型样条框架

A Linear Mixed Model Spline Framework for Analysing Time Course 'Omics' Data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献