Univ Brest, Inserm, EFS, UMR 1078, GGB, Brest, France.
CHU Brest, Brest, France.
Genet Epidemiol. 2019 Sep;43(6):646-656. doi: 10.1002/gepi.22210. Epub 2019 May 13.
Genetic association studies have provided new insights into the genetic variability of human complex traits with a focus mainly on continuous or binary traits. Methods have been proposed to take into account disease heterogeneity between subgroups of patients when studying common variants but none was specifically designed for rare variants. Because rare variants are expected to have stronger effects and to be more heterogeneously distributed among cases than common ones, subgroup analyses might be particularly attractive in this context. To address this issue, we propose an extension of burden tests by using a multinomial regression model, which enables association tests between rare variants and multicategory phenotypes. We evaluated the type I error and the power of two burden tests, CAST and WSS, by simulating data under different scenarios. In the case of genetic heterogeneity between case subgroups, we showed an advantage of multinomial regression over logistic regression, which considers all the cases against the controls. We replicated these results on real data from Moyamoya disease where the burden tests performed better when cases were stratified according to age-of-onset. We implemented the functions for association tests in the R package "Ravages" available on Github.
遗传关联研究为人类复杂特征的遗传变异性提供了新的见解,主要侧重于连续或二元特征。已经提出了一些方法来考虑患者亚组之间的疾病异质性,以研究常见变体,但没有专门针对罕见变体设计的方法。由于罕见变体预计比常见变体具有更强的影响,并且在病例中的分布更为异质,因此在这种情况下,亚组分析可能特别有吸引力。为了解决这个问题,我们提出了一种扩展的负担测试,使用多项式回归模型,该模型可以在罕见变体和多类别表型之间进行关联测试。我们通过模拟不同情况下的数据来评估 CAST 和 WSS 这两种负担测试的Ⅰ型错误和功效。在病例亚组之间存在遗传异质性的情况下,我们表明多项式回归比考虑所有病例与对照的逻辑回归更有优势。我们在 Moyamoya 病的真实数据上复制了这些结果,其中根据发病年龄对病例进行分层时,负担测试的效果更好。我们在 Github 上的 R 包“Ravages”中实现了关联测试的功能。