Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA.
Nat Genet. 2012 May 29;44(6):623-30. doi: 10.1038/ng.2303.
Exome sequencing is emerging as a popular approach to study the effect of rare coding variants on complex phenotypes. The promise of exome sequencing is grounded in theoretical population genetics and in empirical successes of candidate gene sequencing studies. Many projects aimed at common diseases are underway, and their results are eagerly anticipated. In this Perspective, using exome sequencing data from 438 individuals, we discuss several aspects of exome sequencing studies that we view as particularly important. We review processing and quality control of raw sequence data, evaluate the statistical properties of exome sequencing studies, discuss rare variant burden tests to detect association to phenotypes, and demonstrate the importance of accounting for population stratification in the analysis of rare variants. We conclude that enthusiasm for exome sequencing studies of complex traits should be combined with the caution that thousands of samples may be required to reach sufficient statistical power.
外显子组测序作为一种研究罕见编码变异对复杂表型影响的方法正在兴起。外显子组测序的前景基于理论群体遗传学和候选基因测序研究的经验成功。许多针对常见疾病的项目正在进行中,人们热切期待着这些项目的结果。在这篇观点文章中,我们使用了 438 个人的外显子组测序数据,讨论了我们认为特别重要的外显子组测序研究的几个方面。我们回顾了原始序列数据的处理和质量控制,评估了外显子组测序研究的统计特性,讨论了用于检测与表型关联的罕见变异负担测试,并演示了在分析罕见变异时考虑群体分层的重要性。我们的结论是,对外显子组测序研究复杂性状的热情应该与以下观点结合起来,即可能需要数千个样本才能达到足够的统计效力。