Department of Mathematics, Indian Institute of Science, Bangalore, India.
Centre for Neuroscience, Indian Institute of Science, Bangalore, India.
Sci Rep. 2022 May 4;12(1):7295. doi: 10.1038/s41598-022-10459-7.
Conventional Vector Autoregressive (VAR) modelling methods applied to high dimensional neural time series data result in noisy solutions that are dense or have a large number of spurious coefficients. This reduces the speed and accuracy of auxiliary computations downstream and inflates the time required to compute functional connectivity networks by a factor that is at least inversely proportional to the true network density. As these noisy solutions have distorted coefficients, thresholding them as per some criterion, statistical or otherwise, does not alleviate the problem. Thus obtaining a sparse representation of such data is important since it provides an efficient representation of the data and facilitates its further analysis. We propose a fast Sparse Vector Autoregressive Greedy Search (SVARGS) method that works well for high dimensional data, even when the number of time points is relatively low, by incorporating only statistically significant coefficients. In numerical experiments, our methods show high accuracy in recovering the true sparse model. The relative absence of spurious coefficients permits accurate, stable and fast evaluation of derived quantities such as power spectrum, coherence and Granger causality. Consequently, sparse functional connectivity networks can be computed, in a reasonable time, from data comprising tens of thousands of channels/voxels. This enables a much higher resolution analysis of functional connectivity patterns and community structures in such large networks than is possible using existing time series methods. We apply our method to EEG data where computed network measures and community structures are used to distinguish emotional states as well as to ADHD fMRI data where it is used to distinguish children with ADHD from typically developing children.
应用于高维神经时间序列数据的传统向量自回归 (VAR) 建模方法会导致嘈杂的解决方案,这些解决方案要么密集,要么存在大量虚假系数。这会降低下游辅助计算的速度和准确性,并使计算功能连接网络所需的时间膨胀至少与真实网络密度成反比。由于这些嘈杂的解决方案具有扭曲的系数,因此根据某些标准(统计或其他标准)对其进行阈值处理并不能解决问题。因此,对这类数据进行稀疏表示非常重要,因为它提供了数据的有效表示,并促进了其进一步分析。我们提出了一种快速稀疏向量自回归贪婪搜索 (SVARGS) 方法,该方法通过仅包含统计上显著的系数,即使在时间点数量相对较低的情况下,也能很好地处理高维数据。在数值实验中,我们的方法在恢复真实稀疏模型方面表现出很高的准确性。虚假系数的相对缺乏允许对衍生量(如功率谱、相干性和格兰杰因果关系)进行准确、稳定和快速的评估。因此,可以在合理的时间内从包含数万个通道/体素的数据中计算稀疏功能连接网络。这使得能够对如此大的网络中的功能连接模式和社区结构进行比使用现有时间序列方法更高的分辨率分析。我们将我们的方法应用于 EEG 数据,其中计算的网络度量和社区结构用于区分情绪状态,以及 ADHD fMRI 数据,其中它用于区分 ADHD 儿童和典型发育儿童。