用于转化表型遗传分析的扭曲线性混合模型。

Warped linear mixed models for the genetic analysis of transformed phenotypes.

作者信息

Fusi Nicolo, Lippert Christoph, Lawrence Neil D, Stegle Oliver

机构信息

eScience Group, Microsoft Research, Los Angeles, California 90024, USA.

Department of Computer Science, University of Sheffield, Sheffield S10 2HQ, UK.

出版信息

Nat Commun. 2014 Sep 19;5:4890. doi: 10.1038/ncomms5890.

DOI:10.1038/ncomms5890

PMID:25234577

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4199105/

Abstract

Linear mixed models (LMMs) are a powerful and established tool for studying genotype-phenotype relationships. A limitation of the LMM is that the model assumes Gaussian distributed residuals, a requirement that rarely holds in practice. Violations of this assumption can lead to false conclusions and loss in power. To mitigate this problem, it is common practice to pre-process the phenotypic values to make them as Gaussian as possible, for instance by applying logarithmic or other nonlinear transformations. Unfortunately, different phenotypes require different transformations, and choosing an appropriate transformation is challenging and subjective. Here we present an extension of the LMM that estimates an optimal transformation from the observed data. In simulations and applications to real data from human, mouse and yeast, we show that using transformations inferred by our model increases power in genome-wide association studies and increases the accuracy of heritability estimation and phenotype prediction.

摘要

线性混合模型（LMMs）是研究基因型与表型关系的一种强大且成熟的工具。LMM的一个局限性在于该模型假设残差呈高斯分布，而这一要求在实际中很少成立。违背这一假设可能会导致错误的结论和功效损失。为了缓解这个问题，常见的做法是对表型值进行预处理，使其尽可能呈高斯分布，例如通过应用对数或其他非线性变换。不幸的是，不同的表型需要不同的变换，而选择合适的变换具有挑战性且主观。在此，我们提出了LMM的一种扩展，它能从观测数据中估计出最优变换。在对人类、小鼠和酵母的真实数据进行的模拟和应用中，我们表明使用我们的模型推断出的变换可提高全基因组关联研究的功效，并提高遗传力估计和表型预测的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a3/4199105/07fbb889ae90/ncomms5890-f1.jpg

相似文献

Warped linear mixed models for the genetic analysis of transformed phenotypes.

Nat Commun. 2014 Sep 19;5:4890. doi: 10.1038/ncomms5890.

Transformation of Summary Statistics from Linear Mixed Model Association on All-or-None Traits to Odds Ratio.

Genetics. 2018 Apr;208(4):1397-1408. doi: 10.1534/genetics.117.300360. Epub 2018 Feb 2.

Efficient estimation of SNP heritability using Gaussian predictive process in large scale cohort studies.

PLoS Genet. 2022 Apr 20;18(4):e1010151. doi: 10.1371/journal.pgen.1010151. eCollection 2022 Apr.

Principal component regression and linear mixed model in association analysis of structured samples: competitors or complements?

Genet Epidemiol. 2015 Mar;39(3):149-55. doi: 10.1002/gepi.21879. Epub 2014 Dec 23.

A robust DF-REML framework for variance components estimation in genetic studies.

Bioinformatics. 2017 Nov 15;33(22):3584-3594. doi: 10.1093/bioinformatics/btx457.

Further improvements to linear mixed models for genome-wide association studies.

Sci Rep. 2014 Nov 12;4:6874. doi: 10.1038/srep06874.

Efficient permutation-based genome-wide association studies for normal and skewed phenotypic distributions.

Bioinformatics. 2022 Sep 16;38(Suppl_2):ii5-ii12. doi: 10.1093/bioinformatics/btac455.

Genome-wide barebones regression scan for mixed-model association analysis.

Theor Appl Genet. 2020 Jan;133(1):51-58. doi: 10.1007/s00122-019-03439-5. Epub 2019 Sep 24.

The benefits of permutation-based genome-wide association studies.

J Exp Bot. 2024 Sep 11;75(17):5377-5389. doi: 10.1093/jxb/erae280.

Fast heritability estimation based on MINQUE and batch training.

Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac115.

引用本文的文献

Estimating the total variance explained by whole-brain imaging for zero-inflated outcomes.

Commun Biol. 2024 Jul 9;7(1):836. doi: 10.1038/s42003-024-06504-y.

FLOURY ENDOSPERM19 encoding a class I glutamine amidotransferase affects grain quality in rice.

Mol Breed. 2021 May 27;41(5):36. doi: 10.1007/s11032-021-01226-z. eCollection 2021 May.

Unraveling the genetics underlying micronutrient signatures of diversity panel present in brown rice through genome-ionome linkages.

Plant J. 2023 Feb;113(4):749-771. doi: 10.1111/tpj.16080. Epub 2023 Jan 18.

Dissecting Complex Traits Using Omics Data: A Review on the Linear Mixed Models and Their Application in GWAS.

Plants (Basel). 2022 Nov 28;11(23):3277. doi: 10.3390/plants11233277.

Genome-wide association, prediction and heritability in bacteria with application to .

NAR Genom Bioinform. 2022 Feb 22;4(1):lqac011. doi: 10.1093/nargab/lqac011. eCollection 2022 Mar.

DeepNull models non-linear covariate effects to improve phenotypic prediction and association power.

Nat Commun. 2022 Jan 11;13(1):241. doi: 10.1038/s41467-021-27930-0.

Novel QTLs for salinity tolerance revealed by genome-wide association studies of biomass, chlorophyll and tissue ion content in 176 rice landraces from Bangladesh.

PLoS One. 2021 Nov 5;16(11):e0259456. doi: 10.1371/journal.pone.0259456. eCollection 2021.

Genome-Wide Association Mapping Identifies Novel Panicle Morphology Loci and Candidate Genes in Sorghum.

Front Plant Sci. 2021 Oct 5;12:743838. doi: 10.3389/fpls.2021.743838. eCollection 2021.

OsTPR boosts the superior grains through increase in upper secondary rachis branches without incurring a grain quality penalty.

Plant Biotechnol J. 2021 Jul;19(7):1396-1411. doi: 10.1111/pbi.13560. Epub 2021 Feb 20.

Genome-wide association coupled gene to gene interaction studies unveil novel epistatic targets among major effect loci impacting rice grain chalkiness.

Plant Biotechnol J. 2021 May;19(5):910-925. doi: 10.1111/pbi.13516. Epub 2020 Dec 9.

本文引用的文献

Efficient multivariate linear mixed model algorithms for genome-wide association studies.

Nat Methods. 2014 Apr;11(4):407-9. doi: 10.1038/nmeth.2848. Epub 2014 Feb 16.

Underestimation of heritability using a mixed model with a polygenic covariance structure in a genome-wide association study for complex traits.

Eur J Hum Genet. 2014 Jun;22(6):851-4. doi: 10.1038/ejhg.2013.236. Epub 2013 Oct 23.

A unified framework for association analysis with multiple related phenotypes.

PLoS One. 2013 Jul 5;8(7):e65245. doi: 10.1371/journal.pone.0065245. Print 2013.

Detecting regulatory gene-environment interactions with unmeasured environmental factors.

Bioinformatics. 2013 Jun 1;29(11):1382-9. doi: 10.1093/bioinformatics/btt148. Epub 2013 Apr 4.

Polygenic modeling with bayesian sparse linear mixed models.

PLoS Genet. 2013;9(2):e1003264. doi: 10.1371/journal.pgen.1003264. Epub 2013 Feb 7.

Finding the sources of missing heritability in a yeast cross.

Nature. 2013 Feb 14;494(7436):234-7. doi: 10.1038/nature11867. Epub 2013 Feb 3.

A Lasso multi-marker mixed model for association mapping with population structure correction.

Bioinformatics. 2013 Jan 15;29(2):206-14. doi: 10.1093/bioinformatics/bts669. Epub 2012 Nov 22.

A mixed-model approach for genome-wide association studies of correlated traits in structured populations.

Nat Genet. 2012 Sep;44(9):1066-71. doi: 10.1038/ng.2376. Epub 2012 Aug 19.

Heritability in the genome-wide association era.

Hum Genet. 2012 Oct;131(10):1655-64. doi: 10.1007/s00439-012-1199-6. Epub 2012 Jul 21.

An efficient multi-locus mixed-model approach for genome-wide association studies in structured populations.

Nat Genet. 2012 Jun 17;44(7):825-30. doi: 10.1038/ng.2314.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于转化表型遗传分析的扭曲线性混合模型。

Warped linear mixed models for the genetic analysis of transformed phenotypes.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献