Du Lei, Zhang Tuo, Liu Kefei, Yan Jingwen, Yao Xiaohui, Risacher Shannon L, Saykin Andrew J, Han Junwei, Guo Lei, Shen Li
School of Automation, Northwestern Polytechnical University, Xi'an China.
Radiology and Imaging Sciences, Indiana University School of Medicine, IN, USA.
Inf Process Med Imaging. 2017 Jun;10265:543-555. doi: 10.1007/978-3-319-59050-9_43. Epub 2017 May 23.
Brain imaging genetics attracts more and more attention since it can reveal associations between genetic factors and the structures or functions of human brain. Sparse canonical correlation analysis (SCCA) is a powerful bi-multivariate association identification technique in imaging genetics. There have been many SCCA methods which could capture different types of structured imaging genetic relationships. These methods either use the group lasso to recover the group structure, or employ the graph/network guided fused lasso to find out the network structure. However, the group lasso methods have limitation in generalization because of the incomplete or unavailable prior knowledge in real world. The graph/network guided methods are sensitive to the sign of the sample correlation which may be incorrectly estimated. We introduce a new SCCA model using a novel graph guided pairwise group lasso penalty, and propose an efficient optimization algorithm. The proposed method has a strong upper bound for the grouping effect for both positively and negatively correlated variables. We show that our method performs better than or equally to two state-of-the-art SCCA methods on both synthetic and real neuroimaging genetics data. In particular, our method identifies stronger canonical correlations and captures better canonical loading profiles, showing its promise for revealing biologically meaningful imaging genetic associations.
脑成像遗传学越来越受到关注,因为它可以揭示遗传因素与人类大脑结构或功能之间的关联。稀疏典型相关分析(SCCA)是成像遗传学中一种强大的双多变量关联识别技术。已经有许多SCCA方法,它们可以捕捉不同类型的结构化成像遗传关系。这些方法要么使用组套索来恢复组结构,要么采用图/网络引导的融合套索来找出网络结构。然而,由于现实世界中先验知识的不完整或不可用,组套索方法在泛化方面存在局限性。图/网络引导方法对样本相关性的符号敏感,而样本相关性可能被错误估计。我们引入了一种使用新型图引导成对组套索惩罚的新SCCA模型,并提出了一种有效的优化算法。所提出的方法对于正相关和负相关变量的分组效果都有很强的上限。我们表明,在合成和真实神经成像遗传学数据上,我们的方法比两种最先进的SCCA方法表现更好或相当。特别是,我们的方法识别出更强的典型相关性,并捕捉到更好的典型载荷分布,显示出其在揭示具有生物学意义的成像遗传关联方面的前景。