Lin Dan-Yu
Department of Biostatistics, University of North Carolina, Chapel Hill, North Carolina.
Genet Epidemiol. 2019 Jun;43(4):365-372. doi: 10.1002/gepi.22183. Epub 2019 Jan 8.
Whole-exome sequencing (WES) and whole-genome sequencing (WGS) studies are underway to investigate the impact of genetic variants on complex diseases and traits. It is customary to perform single-variant association tests for common variants and region-based association tests for rare variants. The latter may target variants with similar or opposite effects, interrogate variants with different frequencies or different functional annotations, and examine a variety of regions. The large number of tests that are performed necessitates adjustment for multiple testing. The conventional Bonferroni correction is overly conservative as the test statistics are correlated. To address this challenge, we propose a simple and accurate method based on parametric bootstrap to assess genomewide significance. We show that the correlations of the test statistics are determined primarily by the genotypes, such that the same significance threshold can be used in different studies that share a common sequencing platform. We demonstrate the usefulness of the proposed method with WES data from the National Heart, Lung, and Blood Institute Exome Sequencing Project and WGS data from the 1000 Genomes Project. We recommend the p value of as the genomewide significance threshold for testing all common and low-frequency variants (MAFs 0.1%) in the human genome.
全外显子组测序(WES)和全基因组测序(WGS)研究正在进行中,以调查基因变异对复杂疾病和性状的影响。对于常见变异进行单变异关联测试,对于罕见变异进行基于区域的关联测试,这是惯例。后者可能针对具有相似或相反效应的变异,询问具有不同频率或不同功能注释的变异,并检查各种区域。所进行的大量测试需要对多重检验进行校正。由于检验统计量是相关的,传统的Bonferroni校正过于保守。为应对这一挑战,我们提出一种基于参数自助法的简单而准确的方法来评估全基因组显著性。我们表明,检验统计量的相关性主要由基因型决定,因此在共享共同测序平台的不同研究中可以使用相同的显著性阈值。我们用来自美国国立心肺血液研究所外显子组测序项目的WES数据和来自千人基因组计划的WGS数据证明了所提出方法的实用性。我们建议将 的p值作为检测人类基因组中所有常见和低频变异(MAFs 0.1%)的全基因组显著性阈值。