全基因组互作分析的改进统计学方法。

Improved statistics for genome-wide interaction analysis.

机构信息

Faculty of Medicine, Yamagata University, Yamagata, Japan.

出版信息

PLoS Genet. 2012;8(4):e1002625. doi: 10.1371/journal.pgen.1002625. Epub 2012 Apr 5.

DOI:10.1371/journal.pgen.1002625

PMID:22496670

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3320596/

Abstract

Recently, Wu and colleagues [1] proposed two novel statistics for genome-wide interaction analysis using case/control or case-only data. In computer simulations, their proposed case/control statistic outperformed competing approaches, including the fast-epistasis option in PLINK and logistic regression analysis under the correct model; however, reasons for its superior performance were not fully explored. Here we investigate the theoretical properties and performance of Wu et al.'s proposed statistics and explain why, in some circumstances, they outperform competing approaches. Unfortunately, we find minor errors in the formulae for their statistics, resulting in tests that have higher than nominal type 1 error. We also find minor errors in PLINK's fast-epistasis and case-only statistics, although theory and simulations suggest that these errors have only negligible effect on type 1 error. We propose adjusted versions of all four statistics that, both theoretically and in computer simulations, maintain correct type 1 error rates under the null hypothesis. We also investigate statistics based on correlation coefficients that maintain similar control of type 1 error. Although designed to test specifically for interaction, we show that some of these previously-proposed statistics can, in fact, be sensitive to main effects at one or both loci, particularly in the presence of linkage disequilibrium. We propose two new "joint effects" statistics that, provided the disease is rare, are sensitive only to genuine interaction effects. In computer simulations we find, in most situations considered, that highest power is achieved by analysis under the correct genetic model. Such an analysis is unachievable in practice, as we do not know this model. However, generally high power over a wide range of scenarios is exhibited by our joint effects and adjusted Wu statistics. We recommend use of these alternative or adjusted statistics and urge caution when using Wu et al.'s originally-proposed statistics, on account of the inflated error rate that can result.

摘要

最近，Wu 及其同事 [1] 提出了两种用于使用病例/对照或仅病例数据进行全基因组相互作用分析的新统计方法。在计算机模拟中，他们提出的病例/对照统计量优于竞争方法，包括 PLINK 中的快速上位效应选项和正确模型下的逻辑回归分析；然而，其优越性能的原因并未得到充分探讨。在这里，我们研究了 Wu 等人提出的统计方法的理论性质和性能，并解释了为什么在某些情况下它们优于竞争方法。不幸的是，我们发现他们的统计公式存在小错误，导致检验的Ⅰ型错误率高于名义值。我们还发现了 PLINK 的快速上位效应和仅病例统计中的小错误，尽管理论和模拟表明这些错误对Ⅰ型错误率只有微不足道的影响。我们提出了这四种统计方法的调整版本，这些调整版本在零假设下理论上和计算机模拟中都保持正确的Ⅰ型错误率。我们还研究了基于相关系数的统计方法，这些方法在保持Ⅰ型错误率控制相似的情况下也能保持类似的效果。虽然这些统计方法是专门为检验相互作用而设计的，但我们发现其中一些先前提出的统计方法实际上可能对一个或两个位点的主效应敏感，特别是在存在连锁不平衡的情况下。我们提出了两种新的“联合效应”统计方法，在疾病罕见的情况下，这些方法只对真正的相互作用效应敏感。在计算机模拟中，我们发现，在大多数考虑的情况下，在正确的遗传模型下进行分析可以获得最高的功效。然而，由于我们不知道这个模型，实际上这种分析是不可能的。然而，在广泛的场景下，我们的联合效应和调整后的 Wu 统计方法通常表现出较高的功效。我们建议使用这些替代或调整后的统计方法，并鉴于可能导致的误差率升高，谨慎使用 Wu 等人最初提出的统计方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29cd/3320596/898ce1370b5b/pgen.1002625.g001.jpg

相似文献

Improved statistics for genome-wide interaction analysis.全基因组互作分析的改进统计学方法。

PLoS Genet. 2012;8(4):e1002625. doi: 10.1371/journal.pgen.1002625. Epub 2012 Apr 5.

A novel genome-information content-based statistic for genome-wide association analysis designed for next-generation sequencing data.一种基于基因组信息含量的新型统计方法，用于针对下一代测序数据的全基因组关联分析。

J Comput Biol. 2012 Jun;19(6):731-44. doi: 10.1089/cmb.2012.0035. Epub 2012 May 31.

From interaction to co-association --a Fisher r-to-z transformation-based simple statistic for real world genome-wide association study.从相互作用到共同关联--基于 Fisher r-to-z 变换的真实世界全基因组关联研究的简单统计量。

PLoS One. 2013 Jul 29;8(7):e70774. doi: 10.1371/journal.pone.0070774. Print 2013.

To control false positives in gene-gene interaction analysis: two novel conditional entropy-based approaches.在基因-基因相互作用分析中控制假阳性：两种基于条件熵的新方法。

PLoS One. 2013 Dec 10;8(12):e81984. doi: 10.1371/journal.pone.0081984. eCollection 2013.

Further investigations of the W-test for pairwise epistasis testing.用于成对上位性检验的W检验的进一步研究。

Wellcome Open Res. 2017 Jul 21;2:54. doi: 10.12688/wellcomeopenres.11926.1. eCollection 2017.

A cost-effective statistical method to correct for differential genotype misclassification when performing case-control genetic association.一种在进行病例对照基因关联研究时校正差异基因型错误分类的经济有效的统计方法。

Hum Hered. 2010;70(2):102-8. doi: 10.1159/000314470. Epub 2010 Jul 3.

METAINTER: meta-analysis of multiple regression models in genome-wide association studies.METAINTER：全基因组关联研究中多元回归模型的荟萃分析

Bioinformatics. 2015 Jan 15;31(2):151-7. doi: 10.1093/bioinformatics/btu629. Epub 2014 Sep 23.

Rapid and accurate multiple testing correction and power estimation for millions of correlated markers.针对数百万个相关标记物进行快速准确的多重检验校正和效能估计。

PLoS Genet. 2009 Apr;5(4):e1000456. doi: 10.1371/journal.pgen.1000456. Epub 2009 Apr 17.

IndOR: a new statistical procedure to test for SNP-SNP epistasis in genome-wide association studies.IndOR：一种用于全基因组关联研究中 SNP-SNP 互作检验的新统计方法。

Stat Med. 2012 Sep 20;31(21):2359-73. doi: 10.1002/sim.5364. Epub 2012 Jun 18.

Allowing for population stratification in case-only studies of gene-environment interaction, using genomic control.在仅针对病例的基因-环境相互作用研究中，采用基因组对照来考虑人群分层。

Hum Genet. 2015 Oct;134(10):1117-25. doi: 10.1007/s00439-015-1593-y. Epub 2015 Aug 22.

引用本文的文献

Epistasis regulates genetic control of cardiac hypertrophy.上位性调控心脏肥大的遗传控制。

Nat Cardiovasc Res. 2025 Jun;4(6):740-760. doi: 10.1038/s44161-025-00656-8. Epub 2025 Jun 5.

Genome-wide epistasis analysis reveals significant epistatic signals associated with Parkinson's disease risk.全基因组上位性分析揭示了与帕金森病风险相关的显著上位性信号。

Brain. 2025 Jun 3;148(6):2060-2074. doi: 10.1093/brain/awae398.

Identifying X-chromosome variants associated with age-related macular degeneration.识别与年龄相关性黄斑变性相关的X染色体变异。

Hum Mol Genet. 2024 Dec 6;33(24):2085-2093. doi: 10.1093/hmg/ddae141.

Poor statistical power in population-based association study of gene interaction.基于人群的基因相互作用关联研究中统计效能较差。

BMC Med Genomics. 2024 Apr 27;17(1):111. doi: 10.1186/s12920-024-01884-w.

Learning epistatic polygenic phenotypes with Boolean interactions.学习具有布尔交互作用的上位多基因表型。

PLoS One. 2024 Apr 16;19(4):e0298906. doi: 10.1371/journal.pone.0298906. eCollection 2024.

BridGE: a pathway-based analysis tool for detecting genetic interactions from GWAS.BridGE：一种基于通路的分析工具，用于从 GWAS 中检测遗传相互作用。

Nat Protoc. 2024 May;19(5):1400-1435. doi: 10.1038/s41596-024-00954-8. Epub 2024 Mar 21.

Epistasis regulates genetic control of cardiac hypertrophy.上位性调控心脏肥大的遗传控制。

medRxiv. 2024 May 4:2023.11.06.23297858. doi: 10.1101/2023.11.06.23297858.

A multi-threaded approach to genotype pattern mining for detecting digenic disease genes.一种用于检测双基因疾病基因的多线程基因型模式挖掘方法。

Front Genet. 2023 Aug 24;14:1222517. doi: 10.3389/fgene.2023.1222517. eCollection 2023.

Genetic Dissection of Epistatic Interactions Contributing Yield-Related Agronomic Traits in Rice Using the Compressed Mixed Model.利用压缩混合模型对水稻产量相关农艺性状上位性互作进行遗传剖析

Plants (Basel). 2022 Sep 26;11(19):2504. doi: 10.3390/plants11192504.

Detecting genetic epistasis by differential departure from independence.通过差异独立性偏离检测遗传上位性。

Mol Genet Genomics. 2022 Jul;297(4):911-924. doi: 10.1007/s00438-022-01893-3. Epub 2022 May 23.

本文引用的文献

A linear complexity phasing method for thousands of genomes.一种用于数千个基因组的线性复杂度相位分析方法。

Nat Methods. 2011 Dec 4;9(2):179-81. doi: 10.1038/nmeth.1785.

Genome partitioning of genetic variation for complex traits using common SNPs.利用常见 SNP 对复杂性状的遗传变异进行基因组分区。

Nat Genet. 2011 Jun;43(6):519-25. doi: 10.1038/ng.823. Epub 2011 May 8.

Genome-wide association study identifies 12 new susceptibility loci for primary biliary cirrhosis.全基因组关联研究鉴定出原发性胆汁性胆管炎的 12 个新易感性位点。

Nat Genet. 2011 Mar 13;43(4):329-32. doi: 10.1038/ng.789.

Large-scale exploration of gene-gene interactions in prostate cancer using a multistage genome-wide association study.利用多阶段全基因组关联研究大规模探索前列腺癌中的基因-基因相互作用。

Cancer Res. 2011 May 1;71(9):3287-95. doi: 10.1158/0008-5472.CAN-10-2646. Epub 2011 Mar 3.

EPIBLASTER-fast exhaustive two-locus epistasis detection strategy using graphical processing units.利用图形处理单元的 EPIBLASTER——快速穷尽双基因座上位性检测策略。

Eur J Hum Genet. 2011 Apr;19(4):465-71. doi: 10.1038/ejhg.2010.196. Epub 2010 Dec 8.

The meaning of interaction.相互作用的含义。

Hum Hered. 2010;70(4):269-77. doi: 10.1159/000321967. Epub 2010 Dec 8.

A novel statistic for genome-wide interaction analysis.一种用于全基因组互作分析的新统计量。

PLoS Genet. 2010 Sep 23;6(9):e1001131. doi: 10.1371/journal.pgen.1001131.

Common SNPs explain a large proportion of the heritability for human height.常见的单核苷酸多态性解释了人类身高遗传的很大一部分。

Nat Genet. 2010 Jul;42(7):565-9. doi: 10.1038/ng.608. Epub 2010 Jun 20.

Using principal components of genetic variation for robust and powerful detection of gene-gene interactions in case-control and case-only studies.利用遗传变异的主成分在病例对照和仅病例研究中稳健且有效地检测基因-基因交互作用。

Am J Hum Genet. 2010 Mar 12;86(3):331-42. doi: 10.1016/j.ajhg.2010.01.026. Epub 2010 Mar 4.

Finding the missing heritability of complex diseases.寻找复杂疾病中缺失的遗传力。

Nature. 2009 Oct 8;461(7265):747-53. doi: 10.1038/nature08494.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

全基因组互作分析的改进统计学方法。

Improved statistics for genome-wide interaction analysis.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献