Li Zilu, Li Qingyun, Li Xiaoqing, Luo Wei, Guo Haiyan, Zhao Chunyan, Yang Canzhen, Xie Anke, Hu Kai, Guo Yangfan
Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, China.
Central Laboratory, Yan'an Hospital Affiliated to Kunming Medical University, Kunming, China.
PLoS One. 2025 Jun 2;20(6):e0325050. doi: 10.1371/journal.pone.0325050. eCollection 2025.
Alzheimer's disease (AD) etiology is complex, influenced by demographic risk factors such as age, sex, and educational level, alongside multi-omics factors derived from genomics, transcriptomics, and epigenomics. Advancements in multi-omics technology present both challenges and opportunities for AD diagnosis, enabling a more comprehensive understanding of the complex interactions among contributing factors, with the goal of improving diagnostic accuracy. To address this challenge, we propose a novel feature fusion approach in this study, AD-GCN, which integrates multi-omics data and their interaction networks to achieve more precise diagnosis and analysis of AD. In this study, we applied polygenic risk score and random forest algorithms for feature selection on genetic variation and methylation data. We then developed an AD-GCN for both multi-omics and single-omics classification tasks and compared its performance with that of machine learning ensemble methods. The experimental results demonstrated that multi-omics classification significantly outperformed single-omics classification, with AD-GCN surpassing the machine-learning ensembles. These findings highlight AD-GCN's strong potential to enhance AD diagnosis and improve accuracy in differentiating disease stages by integrating interactions across omics data, laying a solid foundation for the development of more precise and personalized AD diagnostic models.
阿尔茨海默病(AD)的病因复杂,受年龄、性别和教育水平等人口统计学风险因素影响,同时还受源自基因组学、转录组学和表观基因组学的多组学因素影响。多组学技术的进步给AD诊断带来了挑战和机遇,有助于更全面地理解致病因素之间的复杂相互作用,目标是提高诊断准确性。为应对这一挑战,我们在本研究中提出了一种新颖的特征融合方法AD-GCN,它整合了多组学数据及其相互作用网络,以实现对AD更精确的诊断和分析。在本研究中,我们应用多基因风险评分和随机森林算法对基因变异和甲基化数据进行特征选择。然后,我们针对多组学和单组学分类任务开发了AD-GCN,并将其性能与机器学习集成方法进行比较。实验结果表明,多组学分类显著优于单组学分类,AD-GCN超过了机器学习集成方法。这些发现凸显了AD-GCN通过整合组学数据间的相互作用来增强AD诊断和提高区分疾病阶段准确性的强大潜力,为开发更精确和个性化的AD诊断模型奠定了坚实基础。