National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen 518060, China; Department of Communications and Electronics, Delta Higher Institute for Engineering and Technology (DHIET), Mansoura 35516, Egypt.
National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen 518060, China.
J Biomed Inform. 2021 Sep;121:103863. doi: 10.1016/j.jbi.2021.103863. Epub 2021 Jul 3.
Alzheimer's disease (AD) is a severe irreversible neurodegenerative disease that has great sufferings on patients and eventually leads to death. Early detection of AD and its prodromal stage, mild cognitive impairment (MCI) which can be either stable (sMCI) or progressive (pMCI), is highly desirable for effective treatment planning and tailoring therapy. Recent studies recommended using multimodal data fusion of genetic (single nucleotide polymorphisms, SNPs) and neuroimaging data (magnetic resonance imaging (MRI) and positron emission tomography (PET)) to discriminate AD/MCI from normal control (NC) subjects. However, missing multimodal data in the cohort under study is inevitable. In addition, data heterogeneity between phenotypes and genotypes biomarkers makes learning capability of the models more challenging. Also, the current studies mainly focus on identifying brain disease classification and ignoring the regression task. Furthermore, they utilize multistage for predicting the brain disease progression. To address these issues, we propose a novel multimodal neuroimaging and genetic data fusion for joint classification and clinical score regression tasks using the maximum number of available samples in one unified framework using convolutional neural network (CNN). Specifically, we initially perform a technique based on linear interpolation to fill the missing features for each incomplete sample. Then, we learn the neuroimaging features from MRI, PET, and SNPs using CNN to alleviate the heterogeneity among genotype and phenotype data. Meanwhile, the high learned features from each modality are combined for jointly identifying brain diseases and predicting clinical scores. To validate the performance of the proposed method, we test our method on 805 subjects from Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset. Also, we verify the similarity between the synthetic and real data using statistical analysis. Moreover, the experimental results demonstrate that the proposed method can yield better performance in both classification and regression tasks. Specifically, our proposed method achieves accuracy of 98.22%, 93.11%, and 97.35% for NC vs. AD, NC vs. sMCI, and NC vs. pMCI, respectively. On the other hand, our method attains the lowest root mean square error and the highest correlation coefficient for different clinical scores regression tasks compared with the state-of-the-art methods.
阿尔茨海默病(AD)是一种严重的不可逆神经退行性疾病,给患者带来极大的痛苦,最终导致死亡。早期发现 AD 及其前驱阶段轻度认知障碍(MCI),MCI 可以是稳定的(sMCI)或进展性的(pMCI),对于有效的治疗计划和定制治疗非常重要。最近的研究建议使用遗传(单核苷酸多态性,SNP)和神经影像学数据(磁共振成像(MRI)和正电子发射断层扫描(PET))的多模态数据融合来区分 AD/MCI 与正常对照(NC)受试者。然而,在所研究的队列中不可避免地会出现多模态数据缺失的情况。此外,表型和基因型生物标志物之间的数据异质性使得模型的学习能力更具挑战性。此外,目前的研究主要集中在识别脑疾病分类上,而忽略了回归任务。此外,他们利用多阶段来预测脑疾病的进展。为了解决这些问题,我们提出了一种新的多模态神经影像学和遗传数据融合方法,用于使用一个统一的框架联合分类和临床评分回归任务,使用最大数量的可用样本,使用卷积神经网络(CNN)。具体来说,我们首先使用基于线性插值的技术来填充每个不完整样本的缺失特征。然后,我们使用 CNN 从 MRI、PET 和 SNP 中学习神经影像学特征,以减轻基因型和表型数据之间的异质性。同时,从每个模态中学习到的高特征被组合用于联合识别脑疾病和预测临床评分。为了验证所提出方法的性能,我们在阿尔茨海默病神经影像学倡议(ADNI)数据集的 805 个受试者上测试了我们的方法。此外,我们还使用统计分析来验证合成数据和真实数据之间的相似性。此外,实验结果表明,所提出的方法在分类和回归任务中都能取得更好的性能。具体来说,我们的方法在 NC 与 AD、NC 与 sMCI 和 NC 与 pMCI 之间的分类任务中分别达到了 98.22%、93.11%和 97.35%的准确率。另一方面,与最先进的方法相比,我们的方法在不同的临床评分回归任务中达到了最低的均方根误差和最高的相关系数。