Mensah-Ablorh Akweley, Lindstrom Sara, Haiman Christopher A, Henderson Brian E, Marchand Loic Le, Lee Seunngeun, Stram Daniel O, Eliassen A Heather, Price Alkes, Kraft Peter
Department of Epidemiology, Harvard School of Public Health, Boston, Massachusetts, United States of America.
Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, Massachusetts, United States of America.
Genet Epidemiol. 2016 Jan;40(1):57-65. doi: 10.1002/gepi.21939. Epub 2015 Dec 7.
Several methods have been proposed to increase power in rare variant association testing by aggregating information from individual rare variants (MAF < 0.005). However, how to best combine rare variants across multiple ethnicities and the relative performance of designs using different ethnic sampling fractions remains unknown. In this study, we compare the performance of several statistical approaches for assessing rare variant associations across multiple ethnicities. We also explore how different ethnic sampling fractions perform, including single-ethnicity studies and studies that sample up to four ethnicities. We conducted simulations based on targeted sequencing data from 4,611 women in four ethnicities (African, European, Japanese American, and Latina). As with single-ethnicity studies, burden tests had greater power when all causal rare variants were deleterious, and variance component-based tests had greater power when some causal rare variants were deleterious and some were protective. Multiethnic studies had greater power than single-ethnicity studies at many loci, with inclusion of African Americans providing the largest impact. On average, studies including African Americans had as much as 20% greater power than equivalently sized studies without African Americans. This suggests that association studies between rare variants and complex disease should consider including subjects from multiple ethnicities, with preference given to genetically diverse groups.
已经提出了几种方法,通过汇总来自单个罕见变异(小等位基因频率<0.005)的信息来提高罕见变异关联测试的效能。然而,如何在多个种族中最佳地组合罕见变异以及使用不同种族抽样比例的设计的相对性能仍然未知。在本研究中,我们比较了几种评估多个种族中罕见变异关联的统计方法的性能。我们还探讨了不同种族抽样比例的表现,包括单一种族研究和抽样多达四个种族的研究。我们基于对四个种族(非洲裔、欧洲裔、日裔美国人和拉丁裔)的4611名女性的靶向测序数据进行了模拟。与单一种族研究一样,当所有因果罕见变异都是有害的时候,负担检验具有更高的效能;当一些因果罕见变异是有害的而一些是保护性的时候,基于方差成分的检验具有更高的效能。在许多位点上,多民族研究比单一种族研究具有更高的效能,纳入非裔美国人产生的影响最大。平均而言,纳入非裔美国人的研究比规模相当但未纳入非裔美国人的研究的效能高出多达20%。这表明罕见变异与复杂疾病之间的关联研究应考虑纳入多个种族的受试者,优先选择基因多样化的群体。