全基因组关联研究的统计效能增强

Enrichment of statistical power for genome-wide association studies.

作者信息

Li Meng, Liu Xiaolei, Bradbury Peter, Yu Jianming, Zhang Yuan-Ming, Todhunter Rory J, Buckler Edward S, Zhang Zhiwu

机构信息

Institute for Genomic Diversity, Cornell University, Ithaca 14853, New York, USA.

出版信息

BMC Biol. 2014 Oct 17;12:73. doi: 10.1186/s12915-014-0073-5.

DOI:10.1186/s12915-014-0073-5

PMID:25322753

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4210555/

Abstract

BACKGROUND

The inheritance of most human diseases and agriculturally important traits is controlled by many genes with small effects. Identifying these genes, while simultaneously controlling false positives, is challenging. Among available statistical methods, the mixed linear model (MLM) has been the most flexible and powerful for controlling population structure and individual unequal relatedness (kinship), the two common causes of spurious associations. The introduction of the compressed MLM (CMLM) method provided additional opportunities for optimization by adding two new model parameters: grouping algorithms and number of groups.

RESULTS

This study introduces another model parameter to develop an enriched CMLM (ECMLM). The parameter involves algorithms to define kinship between groups (that is, kinship algorithms). The ECMLM calculates kinship using several different algorithms and then chooses the best combination between kinship algorithms and grouping algorithms.

CONCLUSION

Simulations show that the ECMLM increases statistical power. In some cases, the magnitude of power gained by using ECMLM instead of CMLM is larger than the improvement found by using CMLM instead of MLM.

摘要

背景

大多数人类疾病和农业重要性状的遗传由许多效应较小的基因控制。识别这些基因，同时控制假阳性，具有挑战性。在现有的统计方法中，混合线性模型（MLM）在控制群体结构和个体不等亲缘关系（亲属关系）这两个导致虚假关联的常见原因方面最为灵活且强大。压缩混合线性模型（CMLM）方法的引入通过添加两个新的模型参数：分组算法和组数，提供了更多优化机会。

结果

本研究引入另一个模型参数来开发富集压缩混合线性模型（ECMLM）。该参数涉及定义组间亲属关系的算法（即亲属关系算法）。ECMLM使用几种不同算法计算亲属关系，然后在亲属关系算法和分组算法之间选择最佳组合。

结论

模拟表明ECMLM提高了统计功效。在某些情况下，使用ECMLM而非CMLM所获得的功效提升幅度大于使用CMLM而非MLM所发现的改进幅度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e33/4210555/b20648300a29/12915_2014_73_Fig1_HTML.jpg

相似文献

Enrichment of statistical power for genome-wide association studies.全基因组关联研究的统计效能增强

BMC Biol. 2014 Oct 17;12:73. doi: 10.1186/s12915-014-0073-5.

Methodological implementation of mixed linear models in multi-locus genome-wide association studies.多基因座全基因组关联研究中混合线性模型的方法学实施。

Brief Bioinform. 2018 Jul 20;19(4):700-712. doi: 10.1093/bib/bbw145.

GAPIT Version 2: An Enhanced Integrated Tool for Genomic Association and Prediction.GAPIT 版本 2：一个用于基因组关联和预测的增强型综合工具。

Plant Genome. 2016 Jul;9(2). doi: 10.3835/plantgenome2015.11.0120.

Comparing Different Statistical Models and Multiple Testing Corrections for Association Mapping in Soybean and Maize.比较大豆和玉米关联定位中的不同统计模型及多重检验校正

Front Plant Sci. 2020 Feb 25;10:1794. doi: 10.3389/fpls.2019.01794. eCollection 2019.

A SUPER powerful method for genome wide association study.一种用于全基因组关联研究的超强大方法。

PLoS One. 2014 Sep 23;9(9):e107684. doi: 10.1371/journal.pone.0107684. eCollection 2014.

GAPIT Version 3: Boosting Power and Accuracy for Genomic Association and Prediction.GAPIT 版本 3：提高基因组关联和预测的能力和准确性。

Genomics Proteomics Bioinformatics. 2021 Aug;19(4):629-640. doi: 10.1016/j.gpb.2021.08.005. Epub 2021 Sep 4.

Iterative Usage of Fixed and Random Effect Models for Powerful and Efficient Genome-Wide Association Studies.用于强大且高效的全基因组关联研究的固定效应模型和随机效应模型的迭代使用

PLoS Genet. 2016 Feb 1;12(2):e1005767. doi: 10.1371/journal.pgen.1005767. eCollection 2016 Feb.

Nonmetric multidimensional scaling corrects for population structure in association mapping with different sample types.非度量多维标度法可校正不同样本类型关联映射中群体结构的影响。

Genetics. 2009 Jul;182(3):875-88. doi: 10.1534/genetics.108.098863. Epub 2009 May 4.

The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.).在玉米（Zea mays L.）全基因组关联研究中使用靶向标记子集来考虑群体结构和相关性

G3 (Bethesda). 2016 Aug 9;6(8):2365-74. doi: 10.1534/g3.116.029090.

Simultaneous SNP selection and adjustment for population structure in high dimensional prediction models.高维预测模型中同时进行 SNP 选择和群体结构调整。

PLoS Genet. 2020 May 4;16(5):e1008766. doi: 10.1371/journal.pgen.1008766. eCollection 2020 May.

引用本文的文献

Deciphering the regulatory network of carbon isotope discrimination in bread wheat through genome-wide association studies and genomic prediction.通过全基因组关联研究和基因组预测解析面包小麦中碳同位素歧视的调控网络。

Theor Appl Genet. 2025 Aug 13;138(9):212. doi: 10.1007/s00122-025-04980-2.

Genome-wide association study of blood vitamin D metabolites and bone remodelling markers in pigs.猪血液维生素D代谢物与骨重塑标志物的全基因组关联研究。

BMC Genomics. 2025 Aug 2;26(1):718. doi: 10.1186/s12864-025-11914-1.

GWAS Procedures for Gene Mapping in Diverse Populations With Complex Structures.复杂结构多样化人群中基因定位的全基因组关联研究程序

Bio Protoc. 2025 Apr 20;15(8):e5284. doi: 10.21769/BioProtoc.5284.

The Genome of Lolium multiflorum Reveals the Genetic Architecture of Paraquat Resistance.多花黑麦草基因组揭示了百草枯抗性的遗传结构。

Mol Ecol. 2025 May;34(10):e17775. doi: 10.1111/mec.17775. Epub 2025 Apr 26.

Deciphering the regulatory network of lignocellulose biosynthesis in bread wheat through genome-wide association studies.通过全基因组关联研究解析面包小麦中木质纤维素生物合成的调控网络。

Theor Appl Genet. 2025 Mar 28;138(4):85. doi: 10.1007/s00122-025-04868-1.

Genome-wide association mapping and genomic prediction analyses reveal the genetic architecture of grain yield and agronomic traits under drought and optimum conditions in maize.全基因组关联图谱绘制和基因组预测分析揭示了干旱和适宜条件下玉米产量及农艺性状的遗传结构。

BMC Plant Biol. 2025 Feb 1;25(1):135. doi: 10.1186/s12870-025-06135-3.

Reframing Formalin: A Molecular Opportunity Enabling Historical Epigenomics and Retrospective Gene Expression Studies.重新审视福尔马林：一个开启历史表观基因组学和回顾性基因表达研究的分子契机。

Mol Ecol Resour. 2025 Apr;25(3):e14065. doi: 10.1111/1755-0998.14065. Epub 2025 Jan 2.

Dissection of the Genetic Basis of Genotype by Environment Interactions for Morphological Traits and Protein Content in Winter Wheat Panel Grown in Morocco and Spain.解析摩洛哥和西班牙种植的冬小麦群体中形态性状和蛋白质含量的基因型与环境互作的遗传基础

Plants (Basel). 2024 May 27;13(11):1477. doi: 10.3390/plants13111477.

Genome-wide association study and expression of candidate genes for Fe and Zn concentration in sorghum grains.全基因组关联研究和候选基因在高粱籽粒中铁和锌浓度的表达。

Sci Rep. 2024 Jun 3;14(1):12729. doi: 10.1038/s41598-024-63308-0.

Computational tools for plant genomics and breeding.植物基因组学和育种的计算工具。

Sci China Life Sci. 2024 Aug;67(8):1579-1590. doi: 10.1007/s11427-024-2578-6. Epub 2024 Apr 23.

本文引用的文献

GWAS of 126,559 individuals identifies genetic variants associated with educational attainment.对 126559 人的全基因组关联研究发现了与受教育程度相关的遗传变异。

Science. 2013 Jun 21;340(6139):1467-71. doi: 10.1126/science.1235488. Epub 2013 May 30.

GWAS meta-analysis and replication identifies three new susceptibility loci for ovarian cancer.GWAS 荟萃分析和复制确定了三个新的卵巢癌易感性位点。

Nat Genet. 2013 Apr;45(4):362-70, 370e1-2. doi: 10.1038/ng.2564.

Properties and modeling of GWAS when complex disease risk is due to non-complementing, deleterious mutations in genes of large effect.当复杂疾病风险归因于具有大效应的基因中非互补的有害突变时，GWAS 的性质和建模。

PLoS Genet. 2013;9(2):e1003258. doi: 10.1371/journal.pgen.1003258. Epub 2013 Feb 21.

GAPIT: genome association and prediction integrated tool.GAPIT：基因组关联和预测综合工具。

Bioinformatics. 2012 Sep 15;28(18):2397-9. doi: 10.1093/bioinformatics/bts444. Epub 2012 Jul 13.

An efficient multi-locus mixed-model approach for genome-wide association studies in structured populations.一种在结构群体中进行全基因组关联研究的高效多基因混合模型方法。

Nat Genet. 2012 Jun 17;44(7):825-30. doi: 10.1038/ng.2314.

Canine hip dysplasia is predictable by genotyping.犬髋关节发育不良可通过基因分型预测。

Osteoarthritis Cartilage. 2011 Apr;19(4):420-9. doi: 10.1016/j.joca.2010.12.011. Epub 2011 Jan 5.

Genome-wide association studies of 14 agronomic traits in rice landraces.对水稻地方品种 14 个农艺性状的全基因组关联研究。

Nat Genet. 2010 Nov;42(11):961-7. doi: 10.1038/ng.695. Epub 2010 Oct 24.

Differential genetic regulation of canine hip dysplasia and osteoarthritis.犬髋关节发育不良和骨关节炎的差异遗传调控。

PLoS One. 2010 Oct 11;5(10):e13219. doi: 10.1371/journal.pone.0013219.

Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index.对 249796 人的关联分析揭示了 18 个与体重指数相关的新位点。

Nat Genet. 2010 Nov;42(11):937-48. doi: 10.1038/ng.686. Epub 2010 Oct 10.

Common SNPs explain a large proportion of the heritability for human height.常见的单核苷酸多态性解释了人类身高遗传的很大一部分。

Nat Genet. 2010 Jul;42(7):565-9. doi: 10.1038/ng.608. Epub 2010 Jun 20.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

全基因组关联研究的统计效能增强

Enrichment of statistical power for genome-wide association studies.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献