Abir Abrar Rahman, Dip Sajib Acharjee, Zhang Liqing
Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1000, Bangladesh.
Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, United States.
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf438.
Alzheimer's Disease (AD) is a progressive neurodegenerative disorder, posing a growing public health challenge. Traditional machine learning models for AD prediction have relied on single omics data or phenotypic assessments, limiting their ability to capture the disease's molecular complexity and resulting in poor performance. Recent advances in high-throughput multi-omics have provided deeper biological insights. However, due to the scarcity of paired omics datasets, existing multi-omics AD prediction models rely on unpaired omics data, where different omics profiles are combined without being derived from the same biological sample, leading to biologically less meaningful pairings and causing less accurate predictions. To address these issues, we propose UnCOT-AD, a novel deep learning framework for Unpaired Cross-Omics Translation enabling effective multi-omics integration for AD prediction. Our method introduces the first-ever cross-omics translation model trained on unpaired omics datasets, using two coupled Variational Autoencoders and a novel cycle consistency mechanism to ensure accurate bidirectional translation between omics types. We integrate adversarial training to ensure that the generated omics profiles are biologically realistic. Moreover, we employ contrastive learning to capture the disease specific patterns in latent space to make the cross-omics translation more accurate and biologically relevant. We rigorously validate UnCOT-AD on both cross-omics translation and AD prediction tasks. Results show that UnCOT-AD empowers multi-omics based AD prediction by combining real omics profiles with corresponding omics profiles generated by our cross-omics translation module and achieves state-of-the-art performance in accuracy and robustness. Source code is available at https://github.com/abrarrahmanabir/UnCOT-AD.
阿尔茨海默病(AD)是一种进行性神经退行性疾病,对公共卫生构成了日益严峻的挑战。用于AD预测的传统机器学习模型依赖于单一组学数据或表型评估,限制了它们捕捉该疾病分子复杂性的能力,导致预测性能不佳。高通量多组学的最新进展提供了更深入的生物学见解。然而,由于配对组学数据集的稀缺,现有的多组学AD预测模型依赖于非配对组学数据,即不同的组学特征在未来自同一生物样本的情况下进行组合,导致生物学意义上的配对较少,预测准确性较低。为了解决这些问题,我们提出了UnCOT-AD,这是一种用于非配对跨组学翻译的新型深度学习框架,能够有效地整合多组学数据用于AD预测。我们的方法引入了首个在非配对组学数据集上训练的跨组学翻译模型,使用两个耦合的变分自编码器和一种新颖的循环一致性机制来确保组学类型之间的准确双向翻译。我们整合了对抗训练以确保生成的组学特征在生物学上是现实的。此外,我们采用对比学习来捕捉潜在空间中疾病特异性模式,以使跨组学翻译更加准确且与生物学相关。我们在跨组学翻译和AD预测任务上对UnCOT-AD进行了严格验证。结果表明,UnCOT-AD通过将真实组学特征与我们的跨组学翻译模块生成的相应组学特征相结合,增强了基于多组学的AD预测,并在准确性和稳健性方面达到了当前最优性能。源代码可在https://github.com/abrarrahmanabir/UnCOT-AD获取。