单倍型对系统发育影响的层次建模

Hierarchical Modelling of Haplotype Effects on a Phylogeny.

作者信息

Selle Maria Lie, Steinsland Ingelin, Lindgren Finn, Brajkovic Vladimir, Cubric-Curik Vlatka, Gorjanc Gregor

机构信息

Department of Mathematical Sciences, Norwegian University of Science and Technology (NTNU), Trondheim, Norway.

School of Mathematics, University of Edinburgh, Edinburgh, United Kingdom.

出版信息

Front Genet. 2021 Jan 15;11:531218. doi: 10.3389/fgene.2020.531218. eCollection 2020.

DOI:10.3389/fgene.2020.531218

PMID:33519886

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7844322/

Abstract

We introduce a hierarchical model to estimate haplotype effects based on phylogenetic relationships between haplotypes and their association with observed phenotypes. In a population there are many, but not all possible, distinct haplotypes and few observations per haplotype. Further, haplotype frequencies tend to vary substantially. Such data structure challenge estimation of haplotype effects. However, haplotypes often differ only due to few mutations, and leveraging similarities can improve the estimation of effects. We build on extensive literature and develop an autoregressive model of order one that models haplotype effects by leveraging phylogenetic relationships described with a directed acyclic graph. The phylogenetic relationships can be either in a form of a tree or a network, and we refer to the model as the haplotype network model. The model can be included as a component in a phenotype model to estimate associations between haplotypes and phenotypes. Our key contribution is that we obtain a sparse model, and by using hierarchical autoregression, the flow of information between similar haplotypes is estimated from the data. A simulation study shows that the hierarchical model can improve estimates of haplotype effects compared to an independent haplotype model, especially with few observations for a specific haplotype. We also compared it to a mutation model and observed comparable performance, though the haplotype model has the potential to capture background specific effects. We demonstrate the model with a study of mitochondrial haplotype effects on milk yield in cattle. We provide R code to fit the model with the INLA package.

摘要

我们引入一种分层模型，以基于单倍型之间的系统发育关系及其与观察到的表型的关联来估计单倍型效应。在一个种群中，存在许多但并非所有可能的不同单倍型，并且每个单倍型的观察值很少。此外，单倍型频率往往差异很大。这种数据结构对单倍型效应的估计提出了挑战。然而，单倍型通常仅因少数突变而不同，利用相似性可以改进效应估计。我们以大量文献为基础，开发了一个一阶自回归模型，该模型通过利用用有向无环图描述的系统发育关系来对单倍型效应进行建模。系统发育关系可以是树状或网络状形式，我们将该模型称为单倍型网络模型。该模型可以作为一个组件包含在表型模型中，以估计单倍型与表型之间的关联。我们的关键贡献在于我们获得了一个稀疏模型，并且通过使用分层自回归，从数据中估计相似单倍型之间的信息流。一项模拟研究表明，与独立单倍型模型相比，分层模型可以改进单倍型效应的估计，特别是对于特定单倍型观察值较少的情况。我们还将其与突变模型进行了比较，观察到了可比的性能，尽管单倍型模型有潜力捕捉背景特定效应。我们通过一项关于线粒体单倍型对奶牛产奶量影响的研究来展示该模型。我们提供了使用INLA软件包拟合该模型的R代码。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7318/7844322/e767270e1534/fgene-11-531218-g0001.jpg

相似文献

Hierarchical Modelling of Haplotype Effects on a Phylogeny.

Front Genet. 2021 Jan 15;11:531218. doi: 10.3389/fgene.2020.531218. eCollection 2020.

A Bayesian hierarchical model for detecting haplotype-haplotype and haplotype-environment interactions in genetic association studies.

Hum Hered. 2011;71(3):148-60. doi: 10.1159/000324841. Epub 2011 Jul 20.

Phenotypic and genetic effects of recessive haplotypes on yield, longevity, and fertility.

J Dairy Sci. 2016 Sep;99(9):7274-7288. doi: 10.3168/jds.2015-10777. Epub 2016 Jul 7.

Estimating Haplotype Structure and Frequencies: A Bayesian Approach to Unknown Design in Pooled Genomic Data.

J Comput Biol. 2024 Aug;31(8):708-726. doi: 10.1089/cmb.2023.0211. Epub 2024 Jul 3.

Genomic Prediction Accuracy Using Haplotypes Defined by Size and Hierarchical Clustering Based on Linkage Disequilibrium.

Front Genet. 2020 Mar 6;11:134. doi: 10.3389/fgene.2020.00134. eCollection 2020.

Association test algorithm between a qualitative phenotype and a haplotype or haplotype set using simultaneous estimation of haplotype frequencies, diplotype configurations and diplotype-based penetrances.

Genetics. 2004 Dec;168(4):2339-48. doi: 10.1534/genetics.103.024653.

Geographical origin of Plasmodium vivax in the Hainan Island, China: insights from mitochondrial genome.

Malar J. 2023 Mar 8;22(1):84. doi: 10.1186/s12936-023-04520-7.

Comparative chloroplast-specific SNP and nSCoT markers analysis and population structure study in kiwifruit plants.

Hereditas. 2024 May 17;161(1):18. doi: 10.1186/s41065-024-00321-3.

A coalescence-guided hierarchical Bayesian method for haplotype inference.

Am J Hum Genet. 2006 Aug;79(2):313-22. doi: 10.1086/506276. Epub 2006 Jun 28.

Exact coalescent simulation of new haplotype data from existing reference haplotypes.

Bioinformatics. 2012 Mar 15;28(6):838-44. doi: 10.1093/bioinformatics/bts033. Epub 2012 Jan 17.

引用本文的文献

Tree-based QTL mapping with expected local genetic relatedness matrices.

Am J Hum Genet. 2023 Dec 7;110(12):2077-2091. doi: 10.1016/j.ajhg.2023.10.017.

Development of a Predictive Statistical Pharmacological Model for Local Anesthetic Agent Effects with Bayesian Hierarchical Model Parameter Estimation.

Medicines (Basel). 2023 Nov 15;10(11):61. doi: 10.3390/medicines10110061.

Tree-based QTL mapping with expected local genetic relatedness matrices.

bioRxiv. 2023 Apr 8:2023.04.07.536093. doi: 10.1101/2023.04.07.536093.

The Consequences of Mitochondrial T10432C Mutation in Cika Cattle: A "Potential" Model for Leber's Hereditary Optic Neuropathy.

Int J Mol Sci. 2022 Jun 6;23(11):6335. doi: 10.3390/ijms23116335.

A genealogical estimate of genetic relationships.

Am J Hum Genet. 2022 May 5;109(5):812-824. doi: 10.1016/j.ajhg.2022.03.016. Epub 2022 Apr 12.

Spatial modelling improves genetic evaluation in smallholder breeding programs.

Genet Sel Evol. 2020 Nov 16;52(1):69. doi: 10.1186/s12711-020-00588-w.

Inferring the Allelic Series at QTL in Multiparental Populations.

Genetics. 2020 Dec;216(4):957-983. doi: 10.1534/genetics.120.303393. Epub 2020 Oct 20.

本文引用的文献

Spatial disease mapping using directed acyclic graph auto-regressive (DAGAR) models.

Bayesian Anal. 2019 Dec;14(4):1221-1244. doi: 10.1214/19-ba1177. Epub 2019 Oct 3.

Phylogenetic Tree Inference: A Top-Down Approach to Track Tumor Evolution.

Front Genet. 2020 Feb 7;10:1371. doi: 10.3389/fgene.2019.01371. eCollection 2019.

Beyond Brownian Motion and the Ornstein-Uhlenbeck Process: Stochastic Diffusion Models for the Evolution of Quantitative Characters.

Am Nat. 2020 Feb;195(2):145-165. doi: 10.1086/706339. Epub 2019 Dec 17.

Selecting Closely-Linked SNPs Based on Local Epistatic Effects for Haplotype Construction Improves Power of Association Mapping.

G3 (Bethesda). 2019 Dec 3;9(12):4115-4126. doi: 10.1534/g3.119.400451.

Genomic predictions in purebreds with a multibreed genomic relationship matrix1.

J Anim Sci. 2019 Nov 4;97(11):4418-4427. doi: 10.1093/jas/skz296.

Inferring whole-genome histories in large population datasets.

Nat Genet. 2019 Sep;51(9):1330-1338. doi: 10.1038/s41588-019-0483-y. Epub 2019 Sep 2.

Genetic analyses of diverse populations improves discovery for complex traits.

Nature. 2019 Jun;570(7762):514-518. doi: 10.1038/s41586-019-1310-4. Epub 2019 Jun 19.

Evolutionary perspectives on polygenic selection, missing heritability, and GWAS.

Hum Genet. 2020 Jan;139(1):5-21. doi: 10.1007/s00439-019-02040-6. Epub 2019 Jun 14.

Revisiting a Key Innovation in Evolutionary Biology: Felsenstein's "Phylogenies and the Comparative Method".

Am Nat. 2019 Jun;193(6):755-772. doi: 10.1086/703055. Epub 2019 Apr 23.

A decade of Genome Medicine: toward precision medicine.

Genome Med. 2019 Feb 28;11(1):13. doi: 10.1186/s13073-019-0624-z.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

单倍型对系统发育影响的层次建模

Hierarchical Modelling of Haplotype Effects on a Phylogeny.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献