Saad Mohamad, Wijsman Ellen M
Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, Washington, United States of America; Department of Biostatistics, University of Washington, Seattle, Washington, United States of America.
Genet Epidemiol. 2014 Nov;38(7):579-90. doi: 10.1002/gepi.21844. Epub 2014 Aug 1.
In the last two decades, complex traits have become the main focus of genetic studies. The hypothesis that both rare and common variants are associated with complex traits is increasingly being discussed. Family-based association studies using relatively large pedigrees are suitable for both rare and common variant identification. Because of the high cost of sequencing technologies, imputation methods are important for increasing the amount of information at low cost. A recent family-based imputation method, Genotype Imputation Given Inheritance (GIGI), is able to handle large pedigrees and accurately impute rare variants, but does less well for common variants where population-based methods perform better. Here, we propose a flexible approach to combine imputation data from both family- and population-based methods. We also extend the Sequence Kernel Association Test for Rare and Common variants (SKAT-RC), originally proposed for data from unrelated subjects, to family data in order to make use of such imputed data. We call this extension "famSKAT-RC." We compare the performance of famSKAT-RC and several other existing burden and kernel association tests. In simulated pedigree sequence data, our results show an increase of imputation accuracy from use of our combining approach. Also, they show an increase of power of the association tests with this approach over the use of either family- or population-based imputation methods alone, in the context of rare and common variants. Moreover, our results show better performance of famSKAT-RC compared to the other considered tests, in most scenarios investigated here.
在过去二十年中,复杂性状已成为基因研究的主要焦点。关于罕见变异和常见变异均与复杂性状相关的假说正越来越多地被讨论。使用相对较大家系的基于家系的关联研究适用于罕见变异和常见变异的识别。由于测序技术成本高昂,推断方法对于低成本增加信息量很重要。最近一种基于家系的推断方法,即遗传给定基因型推断(GIGI),能够处理大型家系并准确推断罕见变异,但对于常见变异的处理效果较差,而基于群体的方法在处理常见变异方面表现更好。在此,我们提出一种灵活的方法来结合基于家系和基于群体的方法的推断数据。我们还将最初为无关个体数据提出的罕见和常见变异序列核关联检验(SKAT-RC)扩展到家系数据,以便利用此类推断数据。我们将此扩展称为“famSKAT-RC”。我们比较了famSKAT-RC与其他几种现有的负担和核关联检验的性能。在模拟的家系序列数据中,我们的结果表明,使用我们的组合方法可提高推断准确性。此外,在罕见和常见变异的背景下,与单独使用基于家系或基于群体的推断方法相比,使用此方法可提高关联检验的效能。而且,在本文研究的大多数情况下,我们的结果表明famSKAT-RC比其他考虑的检验表现更好。