Kuchenbaecker Karoline, Appel Emil Vincent Rosenbaum
Wellcome Trust Sanger Institute, Cambridge, UK.
University College London, London, UK.
Methods Mol Biol. 2018;1793:51-71. doi: 10.1007/978-1-4939-7868-7_5.
While genome-wide association studies have been very successful in identifying associations of common genetic variants with many different traits, the rarer frequency spectrum of the genome has not yet been comprehensively explored. Technological developments increasingly lift restrictions to access rare genetic variation. Dense reference panels enable improved genotype imputation for rarer variants in studies using DNA microarrays. Moreover, the decreasing cost of next generation sequencing makes whole exome and genome sequencing increasingly affordable for large samples. Large-scale efforts based on sequencing, such as ExAC, 100,000 Genomes, and TopMed, are likely to significantly advance this field.The main challenge in evaluating complex trait associations of rare variants is statistical power. The choice of population should be considered carefully because allele frequencies and linkage disequilibrium structure differ between populations. Genetically isolated populations can have favorable genomic characteristics for the study of rare variants.One strategy to increase power is to assess the combined effect of multiple rare variants within a region, known as aggregate testing. A range of methods have been developed for this. Model performance depends on the genetic architecture of the region of interest.
虽然全基因组关联研究在识别常见基因变异与许多不同性状的关联方面非常成功,但基因组中较罕见的频率谱尚未得到全面探索。技术发展越来越多地解除了对获取罕见基因变异的限制。密集的参考面板能够在使用DNA微阵列的研究中改善对较罕见变异的基因型推测。此外,下一代测序成本的降低使得对大样本进行全外显子组和基因组测序越来越经济实惠。基于测序的大规模项目,如ExAC、10万基因组计划和TopMed,可能会显著推动该领域的发展。评估罕见变异的复杂性状关联的主要挑战在于统计效力。选择人群时应谨慎考虑,因为不同人群的等位基因频率和连锁不平衡结构存在差异。遗传隔离人群对于罕见变异的研究可能具有有利的基因组特征。增加效力的一种策略是评估一个区域内多个罕见变异的联合效应,即所谓的聚合检验。为此已经开发了一系列方法。模型性能取决于感兴趣区域的遗传结构。