Iqbal Zafar, Rahman Md Mahfuzur, Mahmood Usman, Zia Qasim, Fu Zening, Calhoun Vince D, Plis Sergey
Department of Computer Science, Georgia State University, Atlanta, GA 30302, USA.
Center for Translational Research in Neuroimaging and Data Science (TReNDS), Atlanta, GA 30303, USA.
Brain Sci. 2025 Jan 11;15(1):60. doi: 10.3390/brainsci15010060.
Functional magnetic resonance imaging data pose significant challenges due to their inherently noisy and complex nature, making traditional statistical models less effective in capturing predictive features. While deep learning models offer superior performance through their non-linear capabilities, they often lack transparency, reducing trust in their predictions. This study introduces the Time Reversal (TR) pretraining method to address these challenges. TR aims to learn temporal dependencies in data, leveraging large datasets for pretraining and applying this knowledge to improve schizophrenia classification on smaller datasets.
We pretrained an LSTM-based model with attention using the TR approach, focusing on learning the direction of time in fMRI data, achieving over 98 % accuracy on HCP and UK Biobank datasets. For downstream schizophrenia classification, TR-pretrained weights were transferred to models evaluated on FBIRN, COBRE, and B-SNIP datasets. Saliency maps were generated using Integrated Gradients (IG) to provide post hoc explanations for pretraining, while Earth Mover's Distance (EMD) quantified the temporal dynamics of salient features in the downstream tasks.
TR pretraining significantly improved schizophrenia classification performance across all datasets: median AUC scores increased from 0.7958 to 0.8359 (FBIRN), 0.6825 to 0.7778 (COBRE), and 0.6341 to 0.7224 (B-SNIP). The saliency maps revealed more concentrated and biologically meaningful salient features along the time axis, aligning with the episodic nature of schizophrenia. TR consistently outperformed baseline pretraining methods, including OCP and PCL, in terms of AUC, balanced accuracy, and robustness.
This study demonstrates the dual benefits of the TR method: enhanced predictive performance and improved interpretability. By aligning model predictions with meaningful temporal patterns in brain activity, TR bridges the gap between deep learning and clinical relevance. These findings emphasize the potential of explainable AI tools for aiding clinicians in diagnostics and treatment planning, especially in conditions characterized by disrupted temporal dynamics.
功能磁共振成像数据因其固有的噪声和复杂性质带来了重大挑战,使得传统统计模型在捕捉预测特征方面效果欠佳。虽然深度学习模型通过其非线性能力表现出卓越性能,但它们往往缺乏透明度,降低了对其预测结果的信任度。本研究引入时间反转(TR)预训练方法来应对这些挑战。TR旨在学习数据中的时间依赖性,利用大型数据集进行预训练,并将此知识应用于提高在较小数据集上的精神分裂症分类准确率。
我们使用TR方法对基于注意力机制的长短期记忆网络(LSTM)模型进行预训练,着重学习功能磁共振成像数据中的时间方向,在人类连接组计划(HCP)和英国生物银行数据集上实现了超过98%的准确率。对于下游的精神分裂症分类任务,将经过TR预训练的权重转移到在功能磁共振成像研究网络(FBIRN)、临床与转化科学合作中心(COBRE)和脑影像精神分裂症神经影像学计划(B-SNIP)数据集上评估的模型中。使用积分梯度(IG)生成显著性图,为预训练提供事后解释,同时使用推土机距离(EMD)量化下游任务中显著特征的时间动态变化。
TR预训练显著提高了所有数据集上的精神分裂症分类性能:中位数曲线下面积(AUC)得分从0.7958提高到0.8359(FBIRN),从0.6825提高到0.7778(COBRE),从0.6341提高到0.7224(B-SNIP)。显著性图显示,沿着时间轴有更集中且具有生物学意义的显著特征,这与精神分裂症的发作性本质相符。在AUC、平衡准确率和稳健性方面,TR始终优于包括OCP和PCL在内的基线预训练方法。
本研究证明了TR方法的双重益处:增强预测性能和提高可解释性。通过使模型预测与大脑活动中有意义的时间模式对齐,TR弥合了深度学习与临床相关性之间的差距。这些发现强调了可解释人工智能工具在辅助临床医生进行诊断和治疗规划方面的潜力,特别是在以时间动态紊乱为特征的病症中。