Chen Jiayu, Calhoun Vince D, Pearlson Godfrey D, Perrone-Bizzozero Nora, Sui Jing, Turner Jessica A, Bustillo Juan R, Ehrlich Stefan, Sponheim Scott R, Cañive José M, Ho Beng-Choon, Liu Jingyu
Department of Electrical and Computer Engineering, University of New Mexico, Albuquerque, NM 87131, USA; The Mind Research Network, Albuquerque, NM 87106, USA.
Neuroimage. 2013 Dec;83:384-96. doi: 10.1016/j.neuroimage.2013.05.073. Epub 2013 May 28.
One application of imaging genomics is to explore genetic variants associated with brain structure and function, presenting a new means of mapping genetic influences on mental disorders. While there is growing interest in performing genome-wide searches for determinants, it remains challenging to identify genetic factors of small effect size, especially in limited sample sizes. In an attempt to address this issue, we propose to take advantage of a priori knowledge, specifically to extend parallel independent component analysis (pICA) to incorporate a reference (pICA-R), aiming to better reveal relationships between hidden factors of a particular attribute. The new approach was first evaluated on simulated data for its performance under different configurations of effect size and dimensionality. Then pICA-R was applied to a 300-participant (140 schizophrenia (SZ) patients versus 160 healthy controls) dataset consisting of structural magnetic resonance imaging (sMRI) and single nucleotide polymorphism (SNP) data. Guided by a reference SNP set derived from ANK3, a gene implicated by the Psychiatric Genomic Consortium SZ study, pICA-R identified one pair of SNP and sMRI components with a significant loading correlation of 0.27 (p=1.64×10(-6)). The sMRI component showed a significant group difference in loading parameters between patients and controls (p=1.33×10(-15)), indicating SZ-related reduction in gray matter concentration in prefrontal and temporal regions. The linked SNP component also showed a group difference (p=0.04) and was predominantly contributed to by 1030 SNPs. The effect of these top contributing SNPs was verified using association test results of the Psychiatric Genomic Consortium SZ study, where the 1030 SNPs exhibited significant SZ enrichment compared to the whole genome. In addition, pathway analyses indicated the genetic component majorly relating to neurotransmitter and nervous system signaling pathways. Given the simulation and experiment results, pICA-R may prove a promising multivariate approach for use in imaging genomics to discover reliable genetic risk factors under a scenario of relatively high dimensionality and small effect size.
影像基因组学的一个应用是探索与脑结构和功能相关的基因变异,为绘制基因对精神障碍的影响提供了一种新方法。虽然人们对进行全基因组搜索决定因素的兴趣日益浓厚,但识别效应量小的遗传因素仍然具有挑战性,尤其是在样本量有限的情况下。为了解决这个问题,我们建议利用先验知识,具体来说是扩展并行独立成分分析(pICA)以纳入一个参考(pICA-R),旨在更好地揭示特定属性隐藏因素之间的关系。新方法首先在模拟数据上针对不同效应量和维度配置下的性能进行评估。然后将pICA-R应用于一个包含300名参与者(140名精神分裂症(SZ)患者与160名健康对照)的数据集,该数据集由结构磁共振成像(sMRI)和单核苷酸多态性(SNP)数据组成。在来自ANK3(一个由精神基因组学联盟SZ研究涉及的基因)的参考SNP集的指导下,pICA-R识别出一对SNP和sMRI成分,其显著载荷相关性为0.27(p = 1.64×10(-6))。sMRI成分在患者和对照之间的载荷参数上显示出显著的组间差异(p = 1.33×10(-15)),表明SZ患者前额叶和颞叶区域灰质浓度降低。相关的SNP成分也显示出组间差异(p = 0.04),并且主要由1030个SNP贡献。使用精神基因组学联盟SZ研究的关联测试结果验证了这些主要贡献SNP的效应,其中这1030个SNP与全基因组相比显示出显著的SZ富集。此外,通路分析表明遗传成分主要与神经递质和神经系统信号通路相关。鉴于模拟和实验结果,pICA-R可能被证明是一种有前途的多变量方法,可用于影像基因组学,以在相对高维度和小效应量的情况下发现可靠的遗传风险因素。