Shen Dan, Zhu Hongtu
Inf Process Med Imaging. 2015;24:758-69. doi: 10.1007/978-3-319-19992-4_60.
We consider the problem of using high dimensional data residing on graphs to predict a low-dimensional outcome variable, such as disease status. Examples of data include time series and genetic data measured on linear graphs and imaging data measured on triangulated graphs (or lattices), among many others. Many of these data have two key features including spatial smoothness and intrinsically low dimensional structure. We propose a simple solution based on a general statistical framework, called spatially weighted principal component regression (SWPCR). In SWPCR, we introduce two sets of weights including importance score weights for the selection of individual features at each node and spatial weights for the incorporation of the neighboring pattern on the graph. We integrate the importance score weights with the spatial weights in order to recover the low dimensional structure of high dimensional data. We demonstrate the utility of our methods through extensive simulations and a real data analysis based on Alzheimer's disease neuroimaging initiative data.
我们考虑利用图上的高维数据来预测低维结果变量(如疾病状态)的问题。数据示例包括在线性图上测量的时间序列和基因数据,以及在三角剖分图(或晶格)上测量的成像数据等众多数据。这些数据中的许多都具有两个关键特征,即空间平滑性和内在的低维结构。我们基于一个称为空间加权主成分回归(SWPCR)的通用统计框架提出了一种简单的解决方案。在SWPCR中,我们引入了两组权重,包括用于在每个节点选择单个特征的重要性得分权重和用于纳入图上邻域模式的空间权重。我们将重要性得分权重与空间权重相结合,以恢复高维数据的低维结构。我们通过广泛的模拟以及基于阿尔茨海默病神经影像倡议数据的真实数据分析来证明我们方法的实用性。