整合机器学习方法以剖析阿尔茨海默病中基因推断的转录组图谱

Integration of Machine Learning Methods to Dissect Genetically Imputed Transcriptomic Profiles in Alzheimer's Disease.

作者信息

Maj Carlo, Azevedo Tiago, Giansanti Valentina, Borisov Oleg, Dimitri Giovanna Maria, Spasov Simeon, Lió Pietro, Merelli Ivan

机构信息

Institute for Genomic Statistics and Bioinformatics, University Hospital Bonn, Bonn, Germany.

Department of Computer Science and Technology, University of Cambridge, Cambridge, United Kingdom.

出版信息

Front Genet. 2019 Sep 3;10:726. doi: 10.3389/fgene.2019.00726. eCollection 2019.

DOI:10.3389/fgene.2019.00726

PMID:31552082

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6735530/

Abstract

The genetic component of many common traits is associated with the gene expression and several variants act as expression quantitative loci, regulating the gene expression in a tissue specific manner. In this work, we applied tissue-specific cis-eQTL gene expression prediction models on the genotype of 808 samples including controls, subjects with mild cognitive impairment, and patients with Alzheimer's Disease. We then dissected the imputed transcriptomic profiles by means of different unsupervised and supervised machine learning approaches to identify potential biological associations. Our analysis suggests that unsupervised and supervised methods can provide complementary information, which can be integrated for a better characterization of the underlying biological system. In particular, a variational autoencoder representation of the transcriptomic profiles, followed by a support vector machine classification, has been used for tissue-specific gene prioritizations. Interestingly, the achieved gene prioritizations can be efficiently integrated as a feature selection step for improving the accuracy of deep learning classifier networks. The identified gene-tissue information suggests a potential role for inflammatory and regulatory processes in gut-brain axis related tissues. In line with the expected low heritability that can be apportioned to eQTL variants, we were able to achieve only relatively low prediction capability with deep learning classification models. However, our analysis revealed that the classification power strongly depends on the network structure, with recurrent neural networks being the best performing network class. Interestingly, cross-tissue analysis suggests a potentially greater role of models trained in brain tissues also by considering dementia-related endophenotypes. Overall, the present analysis suggests that the combination of supervised and unsupervised machine learning techniques can be used for the evaluation of high dimensional omics data.

摘要

许多常见性状的遗传成分与基因表达相关，一些变异体作为表达数量性状位点，以组织特异性方式调节基因表达。在这项研究中，我们将组织特异性顺式表达数量性状基因座（cis-eQTL）基因表达预测模型应用于808个样本的基因型，这些样本包括对照组、轻度认知障碍受试者和阿尔茨海默病患者。然后，我们通过不同的无监督和有监督机器学习方法剖析估算的转录组图谱，以识别潜在的生物学关联。我们的分析表明，无监督和有监督方法可以提供互补信息，可将这些信息整合起来以更好地表征潜在的生物系统。特别是，转录组图谱的变分自编码器表示，随后进行支持向量机分类，已用于组织特异性基因优先级排序。有趣的是，所实现的基因优先级排序可以有效地作为特征选择步骤进行整合，以提高深度学习分类器网络的准确性。所识别的基因-组织信息表明炎症和调节过程在肠-脑轴相关组织中具有潜在作用。与可归因于表达数量性状位点变异体的预期低遗传力一致，我们使用深度学习分类模型仅实现了相对较低的预测能力。然而，我们的分析表明，分类能力很大程度上取决于网络结构，循环神经网络是表现最佳的网络类别。有趣的是，跨组织分析表明，通过考虑与痴呆相关的内表型，在脑组织中训练的模型可能也具有更大的作用。总体而言，本分析表明，有监督和无监督机器学习技术的结合可用于评估高维组学数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44ca/6735530/c0480eef254d/fgene-10-00726-g001.jpg

相似文献

Integration of Machine Learning Methods to Dissect Genetically Imputed Transcriptomic Profiles in Alzheimer's Disease.整合机器学习方法以剖析阿尔茨海默病中基因推断的转录组图谱

Front Genet. 2019 Sep 3;10:726. doi: 10.3389/fgene.2019.00726. eCollection 2019.

Bayesian genome-wide TWAS with reference transcriptomic data of brain and blood tissues identified 141 risk genes for Alzheimer's disease dementia.基于大脑和血液组织参考转录组数据的贝叶斯全基因组 TWAS 鉴定出 141 个阿尔茨海默病痴呆风险基因。

Alzheimers Res Ther. 2024 Jun 1;16(1):120. doi: 10.1186/s13195-024-01488-7.

Unsupervised and supervised learning with neural network for human transcriptome analysis and cancer diagnosis.基于神经网络的无监督和监督学习在人类转录组分析和癌症诊断中的应用。

Sci Rep. 2020 Nov 5;10(1):19106. doi: 10.1038/s41598-020-75715-0.

Single-slice Alzheimer's disease classification and disease regional analysis with Supervised Switching Autoencoders.基于监督切换自动编码器的单切片阿尔茨海默病分类和疾病区域分析。

Comput Biol Med. 2020 Jan;116:103527. doi: 10.1016/j.compbiomed.2019.103527. Epub 2019 Oct 31.

Enhancing the prediction of IDC breast cancer staging from gene expression profiles using hybrid feature selection methods and deep learning architecture.使用混合特征选择方法和深度学习架构增强从基因表达谱预测浸润性导管癌乳腺癌分期的能力。

Med Biol Eng Comput. 2023 Nov;61(11):2895-2919. doi: 10.1007/s11517-023-02892-1. Epub 2023 Aug 2.

Machine learning framework for early MRI-based Alzheimer's conversion prediction in MCI subjects.用于基于磁共振成像（MRI）早期预测轻度认知障碍（MCI）患者向阿尔茨海默病转化的机器学习框架。

Neuroimage. 2015 Jan 1;104:398-412. doi: 10.1016/j.neuroimage.2014.10.002. Epub 2014 Oct 12.

A deeply supervised adaptable neural network for diagnosis and classification of Alzheimer's severity using multitask feature extraction.一种深度监督自适应神经网络，用于使用多任务特征提取进行阿尔茨海默病严重程度的诊断和分类。

PLoS One. 2024 Mar 26;19(3):e0297996. doi: 10.1371/journal.pone.0297996. eCollection 2024.

Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage.将机器学习中的手工特征与潜在变量相结合，以预测放射性肺损伤。

Med Phys. 2019 May;46(5):2497-2511. doi: 10.1002/mp.13497. Epub 2019 Apr 8.

Review of Machine Learning Techniques in Soft Tissue Biomechanics and Biomaterials.机器学习技术在软组织生物力学和生物材料中的应用综述。

Cardiovasc Eng Technol. 2024 Oct;15(5):522-549. doi: 10.1007/s13239-024-00737-y. Epub 2024 Jul 2.

DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype-phenotype prediction.DeepGAMI：基于生物学的深度辅助学习的多模态整合与插补方法，以提高基因型-表型预测。

Genome Med. 2023 Oct 31;15(1):88. doi: 10.1186/s13073-023-01248-6.

引用本文的文献

A review of AI-based radiogenomics in neurodegenerative disease.基于人工智能的神经退行性疾病放射基因组学综述

Front Big Data. 2025 Feb 20;8:1515341. doi: 10.3389/fdata.2025.1515341. eCollection 2025.

Deep learning analysis of fMRI data for predicting Alzheimer's Disease: A focus on convolutional neural networks and model interpretability.用于预测阿尔茨海默病的功能磁共振成像数据的深度学习分析：聚焦卷积神经网络和模型可解释性。

PLoS One. 2024 Dec 4;19(12):e0312848. doi: 10.1371/journal.pone.0312848. eCollection 2024.

Uncovering waterlogging-responsive genes in cucumber through machine learning and differential gene correlation analysis.通过机器学习和差异基因相关性分析揭示黄瓜中的耐涝基因

Bot Stud. 2024 Aug 14;65(1):25. doi: 10.1186/s40529-024-00433-z.

Neural Computation-Based Methods for the Early Diagnosis and Prognosis of Alzheimer's Disease Not Using Neuroimaging Biomarkers: A Systematic Review.基于神经计算的不使用神经影像学生物标志物的阿尔茨海默病早期诊断和预后方法：系统评价。

J Alzheimers Dis. 2024;98(3):793-823. doi: 10.3233/JAD-231271.

MTM: a multi-task learning framework to predict individualized tissue gene expression profiles.MTM：一种用于预测个体化组织基因表达谱的多任务学习框架。

Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad363.

Wide and deep learning based approaches for classification of Alzheimer's disease using genome-wide association studies.基于广泛和深度学习的方法，利用全基因组关联研究对阿尔茨海默病进行分类。

PLoS One. 2023 May 1;18(5):e0283712. doi: 10.1371/journal.pone.0283712. eCollection 2023.

A machine learning approach to analyse and predict the electric cars scenario: The Italian case.一种分析和预测电动汽车情况的机器学习方法：意大利案例。

PLoS One. 2023 Jan 20;18(1):e0279040. doi: 10.1371/journal.pone.0279040. eCollection 2023.

Guidelines for bioinformatics of single-cell sequencing data analysis in Alzheimer's disease: review, recommendation, implementation and application.阿尔茨海默病单细胞测序数据分析的生物信息学指南：综述、建议、实施和应用。

Mol Neurodegener. 2022 Mar 2;17(1):17. doi: 10.1186/s13024-022-00517-z.

Deep Learning with Neuroimaging and Genomics in Alzheimer's Disease.深度学习在阿尔茨海默病中的神经影像学和基因组学研究

Int J Mol Sci. 2021 Jul 24;22(15):7911. doi: 10.3390/ijms22157911.

Multilayer modelling of the human transcriptome and biological mechanisms of complex diseases and traits.人类转录组的多层次建模及复杂疾病和特征的生物学机制。

NPJ Syst Biol Appl. 2021 May 27;7(1):24. doi: 10.1038/s41540-021-00186-6.

本文引用的文献

Deep learning in bioinformatics: Introduction, application, and perspective in the big data era.深度学习在生物信息学中的应用：大数据时代的介绍、应用和展望。

Methods. 2019 Aug 15;166:4-21. doi: 10.1016/j.ymeth.2019.04.008. Epub 2019 Apr 22.

Recent Advances of Deep Learning in Bioinformatics and Computational Biology.深度学习在生物信息学和计算生物学中的最新进展

Front Genet. 2019 Mar 26;10:214. doi: 10.3389/fgene.2019.00214. eCollection 2019.

Lifespan Changes of the Human Brain In Alzheimer's Disease.阿尔茨海默病患者大脑的寿命变化。

Sci Rep. 2019 Mar 8;9(1):3998. doi: 10.1038/s41598-019-39809-8.

A statistical framework for cross-tissue transcriptome-wide association analysis.跨组织转录组全基因组关联分析的统计框架。

Nat Genet. 2019 Mar;51(3):568-576. doi: 10.1038/s41588-019-0345-7. Epub 2019 Feb 25.

A simple convolutional neural network for prediction of enhancer-promoter interactions with DNA sequence data.基于 DNA 序列数据的增强子-启动子相互作用预测的简单卷积神经网络。

Bioinformatics. 2019 Sep 1;35(17):2899-2906. doi: 10.1093/bioinformatics/bty1050.

Cell-Type-Specific Afferent Innervation of the Nucleus Accumbens Core and Shell.伏隔核核心区和壳区的细胞类型特异性传入神经支配

Front Neuroanat. 2018 Oct 16;12:84. doi: 10.3389/fnana.2018.00084. eCollection 2018.

Gut Microbiota and Their Neuroinflammatory Implications in Alzheimer's Disease.肠道微生物群及其在阿尔茨海默病中的神经炎症意义。

Nutrients. 2018 Nov 14;10(11):1765. doi: 10.3390/nu10111765.

Alzheimer's Biomarkers From Multiple Modalities Selectively Discriminate Clinical Status: Relative Importance of Salivary Metabolomics Panels, Genetic, Lifestyle, Cognitive, Functional Health and Demographic Risk Markers.来自多种模式的阿尔茨海默病生物标志物可选择性区分临床状态：唾液代谢组学面板、遗传、生活方式、认知、功能健康和人口统计学风险标志物的相对重要性。

Front Aging Neurosci. 2018 Oct 2;10:296. doi: 10.3389/fnagi.2018.00296. eCollection 2018.

Recurrent Neural Network for Predicting Transcription Factor Binding Sites.用于预测转录因子结合位点的递归神经网络。

Sci Rep. 2018 Oct 15;8(1):15270. doi: 10.1038/s41598-018-33321-1.

The Gut-Brain Axis in Alzheimer's Disease and Omega-3. A Critical Overview of Clinical Trials.阿尔茨海默病与ω-3 脂肪酸的肠脑轴：临床试验的批判性综述。

Nutrients. 2018 Sep 8;10(9):1267. doi: 10.3390/nu10091267.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

整合机器学习方法以剖析阿尔茨海默病中基因推断的转录组图谱

Integration of Machine Learning Methods to Dissect Genetically Imputed Transcriptomic Profiles in Alzheimer's Disease.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献