Santorico Stephanie A, Edwards Karen L
Department of Mathematical and Statistical Sciences, University of Colorado, Denver, CO, USA.
Genet Epidemiol. 2014 Sep;38 Suppl 1:S92-6. doi: 10.1002/gepi.21832.
Whole-genome sequencing (WGS) is becoming an affordable technology for the study of the genetics of complex traits. With any new technology, experimental designs and statistical methods, both old and new, must be evaluated. One design seeing a resurgence of interest is the use of families. Genetic Analysis Workshop 18 provided the opportunity to evaluate statistical methods applied to WGS data for family-based studies. We summarize the results of five contributions that used linkage in the context of WGS. The investigators took differing approaches, including assessment of false-positive rates in classic two-point linkage, the effects of heterogeneity on linkage and association tests, and the use of linkage to focus association tests. We describe the primary findings of each contribution and note challenges that are not new to those working in family designs or specific to WGS data; for example, choice of phenotype definition, covariate adjustment, and use of longitudinal data may produce different results, making comparisons challenging. We detail new issues brought about by WGS, such as the elevated genome-wide false-positive rate for classic two-point parametric linkage analysis, computational demands in multipoint calculations, and lack of clarity in how to best use linkage to focus association testing. Finally, we comment on when linkage may be helpful for WGS, highlighting where additional research is needed; for example, although linkage analysis has been successful in the study of rare variants of large effect, how to best use family information in the context of rare variants of moderate effect remains an open research question.
全基因组测序(WGS)正成为一种可负担得起的用于研究复杂性状遗传学的技术。对于任何新技术,都必须对新旧实验设计和统计方法进行评估。一种重新受到关注的设计是使用家系。遗传分析研讨会18提供了一个机会,来评估应用于基于家系研究的WGS数据的统计方法。我们总结了五项在WGS背景下使用连锁分析的研究成果。研究人员采用了不同的方法,包括评估经典两点连锁分析中的假阳性率、基因异质性对连锁和关联检验的影响,以及使用连锁分析来聚焦关联检验。我们描述了每项研究的主要发现,并指出了对于从事家系设计工作的人员来说并非新问题、也不是WGS数据所特有的挑战;例如,表型定义的选择、协变量调整以及纵向数据的使用可能会产生不同的结果,这使得比较具有挑战性。我们详细阐述了WGS带来的新问题,例如经典两点参数连锁分析中全基因组假阳性率升高、多点计算中的计算需求,以及如何最好地使用连锁分析来聚焦关联检验尚不明确。最后,我们评论了连锁分析何时可能对WGS有帮助,强调了需要进一步研究的方面;例如,尽管连锁分析在研究大效应罕见变异方面取得了成功,但在中等效应罕见变异的背景下如何最好地利用家系信息仍然是一个开放的研究问题。