Luningham Justin M, McArtor Daniel B, Hendriks Anne M, van Beijsterveldt Catharina E M, Lichtenstein Paul, Lundström Sebastian, Larsson Henrik, Bartels Meike, Boomsma Dorret I, Lubke Gitta H
Department of Psychology, University of Notre Dame, Notre Dame, IN, United States.
Netherlands Twin Register, Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, Netherlands.
Front Genet. 2019 Dec 10;10:1227. doi: 10.3389/fgene.2019.01227. eCollection 2019.
Parallel meta-analysis is a popular approach for increasing the power to detect genetic effects in genome-wide association studies across multiple cohorts. Consortia studying the genetics of behavioral phenotypes are oftentimes faced with systematic differences in phenotype measurement across cohorts, introducing heterogeneity into the meta-analysis and reducing statistical power. This study investigated integrative data analysis (IDA) as an approach for jointly modeling the phenotype across multiple datasets. We put forth a bi-factor integration model (BFIM) that provides a single common phenotype score and accounts for sources of study-specific variability in the phenotype. In order to capitalize on this modeling strategy, a phenotype reference panel was utilized as a supplemental sample with complete data on all behavioral measures. A simulation study showed that a mega-analysis of genetic variant effects in a BFIM were more powerful than meta-analysis of genetic effects on a cohort-specific sum score of items. Saving the factor scores from the BFIM and using those as the outcome in meta-analysis was also more powerful than the sum score in most simulation conditions, but a small degree of bias was introduced by this approach. The reference panel was necessary to realize these power gains. An empirical demonstration used the BFIM to harmonize aggression scores in 9-year old children across the Netherlands Twin Register and the Child and Adolescent Twin Study in Sweden, providing a template for application of the BFIM to a range of different phenotypes. A supplemental data collection in the Netherlands Twin Register served as a reference panel for phenotype modeling across both cohorts. Our results indicate that model-based harmonization for the study of complex traits is a useful step within genetic consortia.
平行荟萃分析是一种常用方法,可增强在多个队列的全基因组关联研究中检测基因效应的能力。研究行为表型遗传学的联盟常常面临不同队列间表型测量的系统差异,这会给荟萃分析引入异质性并降低统计效能。本研究调查了整合数据分析(IDA)作为一种在多个数据集中联合对表型进行建模的方法。我们提出了一种双因素整合模型(BFIM),该模型可提供单一的共同表型分数,并考虑表型中特定研究变异的来源。为了利用这种建模策略,使用了一个表型参考面板作为补充样本,该样本包含所有行为测量的完整数据。一项模拟研究表明,在BFIM中对基因变异效应进行的大型分析比在特定队列项目总和分数上对基因效应进行的荟萃分析更具效能。在大多数模拟条件下,保存BFIM中的因子分数并将其用作荟萃分析的结果也比总和分数更具效能,但这种方法会引入一定程度的偏差。参考面板对于实现这些效能提升是必要的。一项实证示范使用BFIM对荷兰双胞胎登记处和瑞典儿童与青少年双胞胎研究中9岁儿童的攻击分数进行了协调,为将BFIM应用于一系列不同表型提供了模板。荷兰双胞胎登记处的补充数据收集用作两个队列间表型建模的参考面板。我们的结果表明,基于模型的复杂性状研究协调是基因联盟中一个有用的步骤。