• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于亲缘关系对单核苷酸多态性关联分析的影响

On the impact of relatedness on SNP association analysis.

作者信息

Gross Arnd, Tönjes Anke, Scholz Markus

机构信息

Institute for Medical Informatics, Statistics and Epidemiology, University of Leipzig, Haertelstrasse 16-18, Leipzig, 04107, Germany.

LIFE - Leipzig Research Center for Civilization Diseases, University of Leipzig, Philipp-Rosenthal-Strasse 27, Leipzig, 04103, Germany.

出版信息

BMC Genet. 2017 Dec 6;18(1):104. doi: 10.1186/s12863-017-0571-x.

DOI:10.1186/s12863-017-0571-x
PMID:29212447
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5719591/
Abstract

BACKGROUND

When testing for SNP (single nucleotide polymorphism) associations in related individuals, observations are not independent. Simple linear regression assuming independent normally distributed residuals results in an increased type I error and the power of the test is also affected in a more complicate manner. Inflation of type I error is often successfully corrected by genomic control. However, this reduces the power of the test when relatedness is of concern. In the present paper, we derive explicit formulae to investigate how heritability and strength of relatedness contribute to variance inflation of the effect estimate of the linear model. Further, we study the consequences of variance inflation on hypothesis testing and compare the results with those of genomic control correction. We apply the developed theory to the publicly available HapMap trio data (N=129), the Sorbs (a self-contained population with N=977 characterised by a cryptic relatedness structure) and synthetic family studies with different sample sizes (ranging from N=129 to N=999) and different degrees of relatedness.

RESULTS

We derive explicit and easily to apply approximation formulae to estimate the impact of relatedness on the variance of the effect estimate of the linear regression model. Variance inflation increases with increasing heritability. Relatedness structure also impacts the degree of variance inflation as shown for example family structures. Variance inflation is smallest for HapMap trios, followed by a synthetic family study corresponding to the trio data but with larger sample size than HapMap. Next strongest inflation is observed for the Sorbs, and finally, for a synthetic family study with a more extreme relatedness structure but with similar sample size as the Sorbs. Type I error increases rapidly with increasing inflation. However, for smaller significance levels, power increases with increasing inflation while the opposite holds for larger significance levels. When genomic control is applied, type I error is preserved while power decreases rapidly with increasing variance inflation.

CONCLUSIONS

Stronger relatedness as well as higher heritability result in increased variance of the effect estimate of simple linear regression analysis. While type I error rates are generally inflated, the behaviour of power is more complex since power can be increased or reduced in dependence on relatedness and the heritability of the phenotype. Genomic control cannot be recommended to deal with inflation due to relatedness. Although it preserves type I error, the loss in power can be considerable. We provide a simple formula for estimating variance inflation given the relatedness structure and the heritability of a trait of interest. As a rule of thumb, variance inflation below 1.05 does not require correction and simple linear regression analysis is still appropriate.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0df/5719591/362776825454/12863_2017_571_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0df/5719591/4f5df61ab565/12863_2017_571_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0df/5719591/538b5b3d3353/12863_2017_571_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0df/5719591/362776825454/12863_2017_571_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0df/5719591/4f5df61ab565/12863_2017_571_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0df/5719591/538b5b3d3353/12863_2017_571_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0df/5719591/362776825454/12863_2017_571_Fig3_HTML.jpg
摘要

背景

在对相关个体进行单核苷酸多态性(SNP)关联测试时,观测值并非相互独立。假设残差独立且呈正态分布的简单线性回归会导致I型错误增加,并且检验效能也会以更复杂的方式受到影响。I型错误的膨胀通常可通过基因组控制成功校正。然而,当考虑亲缘关系时,这会降低检验效能。在本文中,我们推导出显式公式,以研究遗传力和亲缘关系强度如何导致线性模型效应估计值的方差膨胀。此外,我们研究方差膨胀对假设检验的影响,并将结果与基因组控制校正的结果进行比较。我们将所发展的理论应用于公开可用的HapMap三人组数据(N = 129)、索布人(一个具有神秘亲缘关系结构的独立群体,N = 977)以及不同样本量(从N = 129到N = 999)和不同亲缘关系程度的合成家系研究。

结果

我们推导出了显式且易于应用的近似公式,以估计亲缘关系对线性回归模型效应估计值方差的影响。方差膨胀随遗传力的增加而增加。亲缘关系结构也会影响方差膨胀程度,例如家族结构所示。HapMap三人组的方差膨胀最小,其次是与三人组数据对应的合成家系研究,但样本量比HapMap大。索布人的方差膨胀次之,最后是具有更极端亲缘关系结构但样本量与索布人相似的合成家系研究。I型错误随方差膨胀的增加而迅速增加。然而,对于较小的显著性水平,效能随方差膨胀的增加而增加,而对于较大的显著性水平则相反。应用基因组控制时,I型错误得以保留,但效能会随着方差膨胀的增加而迅速降低。

结论

更强的亲缘关系以及更高的遗传力会导致简单线性回归分析效应估计值的方差增加。虽然I型错误率通常会膨胀,但效能的行为更为复杂,因为效能可能会根据亲缘关系和表型的遗传力而增加或降低。不建议使用基因组控制来处理因亲缘关系导致的膨胀。尽管它保留了I型错误,但效能损失可能相当大。我们提供了一个简单的公式,用于在给定亲缘关系结构和感兴趣性状的遗传力的情况下估计方差膨胀。经验法则是,方差膨胀低于1.05时无需校正,简单线性回归分析仍然适用。

相似文献

1
On the impact of relatedness on SNP association analysis.关于亲缘关系对单核苷酸多态性关联分析的影响
BMC Genet. 2017 Dec 6;18(1):104. doi: 10.1186/s12863-017-0571-x.
2
Statistical distributions of test statistics used for quantitative trait association mapping in structured populations.结构群体中用于数量性状关联作图的检验统计量的统计分布。
Genet Sel Evol. 2012 Nov 12;44(1):32. doi: 10.1186/1297-9686-44-32.
3
Population-genetic comparison of the Sorbian isolate population in Germany with the German KORA population using genome-wide SNP arrays.利用全基因组 SNP 芯片对德国索布族隔离人群与德国 KORA 人群进行群体遗传学比较。
BMC Genet. 2011 Jul 28;12:67. doi: 10.1186/1471-2156-12-67.
4
Accounting for relatedness in family based genetic association studies.在基于家系的基因关联研究中考虑亲缘关系。
Hum Hered. 2007;64(4):234-42. doi: 10.1159/000103861. Epub 2007 Jun 14.
5
Partitioning Phenotypic Variance Due to Parent-of-Origin Effects Using Genomic Relatedness Matrices.使用基因组亲缘关系矩阵划分源于亲本效应的表型变异
Behav Genet. 2018 Jan;48(1):67-79. doi: 10.1007/s10519-017-9880-0. Epub 2017 Nov 2.
6
A family-based joint test for mean and variance heterogeneity for quantitative traits.一种基于家系的数量性状均值和方差异质性联合检验。
Ann Hum Genet. 2015 Jan;79(1):46-56. doi: 10.1111/ahg.12089. Epub 2014 Nov 13.
7
Leveraging LD eigenvalue regression to improve the estimation of SNP heritability and confounding inflation.利用 LD 特征值回归提高 SNP 遗传力和混杂膨胀的估计。
Am J Hum Genet. 2022 May 5;109(5):802-811. doi: 10.1016/j.ajhg.2022.03.013. Epub 2022 Apr 13.
8
Beyond SNP heritability: Polygenicity and discoverability of phenotypes estimated with a univariate Gaussian mixture model.超越 SNP 遗传力:使用单变量高斯混合模型估计表型的多基因性和可发现性。
PLoS Genet. 2020 May 19;16(5):e1008612. doi: 10.1371/journal.pgen.1008612. eCollection 2020 May.
9
Comparison of variance components, ANOVA and regression of offspring on midparent (ROMP) methods for SNP markers.单核苷酸多态性(SNP)标记的方差成分比较、方差分析以及子代对中亲值回归(ROMP)方法
Genet Epidemiol. 2001;21 Suppl 1:S794-9. doi: 10.1002/gepi.2001.21.s1.s794.
10
Insights into the genetic architecture of morphological traits in two passerine bird species.对两种雀形目鸟类形态特征遗传结构的见解。
Heredity (Edinb). 2017 Sep;119(3):197-205. doi: 10.1038/hdy.2017.29. Epub 2017 Jun 14.

引用本文的文献

1
MethylGenotyper: Accurate Estimation of SNP Genotypes and Genetic Relatedness from DNA Methylation Data.MethylGenotyper:从 DNA 甲基化数据中准确估计 SNP 基因型和遗传关系。
Genomics Proteomics Bioinformatics. 2024 Sep 13;22(3). doi: 10.1093/gpbjnl/qzae044.
2
Genome-wide meta-analysis of phytosterols reveals five novel loci and a detrimental effect on coronary atherosclerosis.全基因组荟萃分析揭示了植物甾醇的五个新位点,并对冠状动脉粥样硬化有不良影响。
Nat Commun. 2022 Jan 10;13(1):143. doi: 10.1038/s41467-021-27706-6.
3
Genome wide association study of body weight and feed efficiency traits in a commercial broiler chicken population, a re-visitation.

本文引用的文献

1
Fast Genome-Wide QTL Association Mapping on Pedigree and Population Data.基于系谱和群体数据的快速全基因组QTL关联图谱分析
Genet Epidemiol. 2017 Apr;41(3):174-186. doi: 10.1002/gepi.21988. Epub 2016 Dec 12.
2
Improving power and accuracy of genome-wide association studies via a multi-locus mixed linear model methodology.通过多基因座混合线性模型方法提高全基因组关联研究的效能和准确性。
Sci Rep. 2016 Jan 20;6:19444. doi: 10.1038/srep19444.
3
Genome wide meta-analysis highlights the role of genetic variation in RARRES2 in the regulation of circulating serum chemerin.
对商业肉鸡群体的体重和饲料效率性状进行全基因组关联研究,再访。
Sci Rep. 2019 Jan 29;9(1):922. doi: 10.1038/s41598-018-37216-z.
全基因组荟萃分析突出了RARRES2基因变异在循环血清chemerin调节中的作用。
PLoS Genet. 2014 Dec 18;10(12):e1004854. doi: 10.1371/journal.pgen.1004854. eCollection 2014 Dec.
4
Comparison of methods to account for relatedness in genome-wide association studies with family-based data.在基于家系数据的全基因组关联研究中考虑亲缘关系的方法比较。
PLoS Genet. 2014 Jul 17;10(7):e1004445. doi: 10.1371/journal.pgen.1004445. eCollection 2014 Jul.
5
Advantages and pitfalls in the application of mixed-model association methods.混合模型关联方法应用的优缺点。
Nat Genet. 2014 Feb;46(2):100-6. doi: 10.1038/ng.2876.
6
Region-based association analysis of human quantitative traits in related individuals.基于区域的相关个体人类数量性状的关联分析。
PLoS One. 2013 Jun 17;8(6):e65395. doi: 10.1371/journal.pone.0065395. Print 2013.
7
Statistical distributions of test statistics used for quantitative trait association mapping in structured populations.结构群体中用于数量性状关联作图的检验统计量的统计分布。
Genet Sel Evol. 2012 Nov 12;44(1):32. doi: 10.1186/1297-9686-44-32.
8
Genome-wide efficient mixed-model analysis for association studies.全基因组高效混合模型关联分析。
Nat Genet. 2012 Jun 17;44(7):821-4. doi: 10.1038/ng.2310.
9
Population-genetic comparison of the Sorbian isolate population in Germany with the German KORA population using genome-wide SNP arrays.利用全基因组 SNP 芯片对德国索布族隔离人群与德国 KORA 人群进行群体遗传学比较。
BMC Genet. 2011 Jul 28;12:67. doi: 10.1186/1471-2156-12-67.
10
Genetic variation in the Sorbs of eastern Germany in the context of broader European genetic diversity.在更广泛的欧洲遗传多样性背景下,德国东部索布人的遗传变异。
Eur J Hum Genet. 2011 Sep;19(9):995-1001. doi: 10.1038/ejhg.2011.65. Epub 2011 May 11.