通过机器学习揭示全基因组中的阿尔茨海默病基因谱。

Revealing Alzheimer's disease genes spectrum in the whole-genome by machine learning.

作者信息

Huang Xiaoyan, Liu Hankui, Li Xinming, Guan Liping, Li Jiankang, Tellier Laurent Christian Asker M, Yang Huanming, Wang Jian, Zhang Jianguo

机构信息

BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, 518083, China.

BGI-Shenzhen, Shenzhen, 518083, China.

出版信息

BMC Neurol. 2018 Jan 10;18(1):5. doi: 10.1186/s12883-017-1010-3.

DOI:10.1186/s12883-017-1010-3

PMID:29320986

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5763548/

Abstract

BACKGROUND

Alzheimer's disease (AD) is an important, progressive neurodegenerative disease, with a complex genetic architecture. A key goal of biomedical research is to seek out disease risk genes, and to elucidate the function of these risk genes in the development of disease. For this purpose, expanding the AD-associated gene set is necessary. In past research, the prediction methods for AD related genes has been limited in their exploration of the target genome regions. We here present a genome-wide method for AD candidate genes predictions.

METHODS

We present a machine learning approach (SVM), based upon integrating gene expression data with human brain-specific gene network data, to discover the full spectrum of AD genes across the whole genome.

RESULTS

We classified AD candidate genes with an accuracy and the area under the receiver operating characteristic (ROC) curve of 84.56% and 94%. Our approach provides a supplement for the spectrum of AD-associated genes extracted from more than 20,000 genes in a genome wide scale.

CONCLUSIONS

In this study, we have elucidated the whole-genome spectrum of AD, using a machine learning approach. Through this method, we expect for the candidate gene catalogue to provide a more comprehensive annotation of AD for researchers.

摘要

背景

阿尔茨海默病（AD）是一种重要的进行性神经退行性疾病，具有复杂的遗传结构。生物医学研究的一个关键目标是寻找疾病风险基因，并阐明这些风险基因在疾病发展中的功能。为此，有必要扩大与AD相关的基因集。在过去的研究中，AD相关基因的预测方法在探索目标基因组区域方面受到限制。我们在此提出一种全基因组范围内预测AD候选基因的方法。

方法

我们提出一种机器学习方法（支持向量机），该方法基于整合基因表达数据和人脑特异性基因网络数据，以发现全基因组范围内AD基因的全貌。

结果

我们对AD候选基因进行分类的准确率和受试者工作特征（ROC）曲线下面积分别为84.56%和94%。我们的方法为从全基因组范围内20000多个基因中提取的AD相关基因谱提供了补充。

结论

在本研究中，我们使用机器学习方法阐明了AD的全基因组谱。通过这种方法，我们期望候选基因目录能为研究人员提供更全面的AD注释。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06c8/5763548/ceb971bbabd7/12883_2017_1010_Fig1_HTML.jpg

相似文献

Revealing Alzheimer's disease genes spectrum in the whole-genome by machine learning.

BMC Neurol. 2018 Jan 10;18(1):5. doi: 10.1186/s12883-017-1010-3.

Integrating network, sequence and functional features using machine learning approaches towards identification of novel Alzheimer genes.

BMC Genomics. 2016 Oct 18;17(1):807. doi: 10.1186/s12864-016-3108-1.

Classifying Alzheimer's disease and normal subjects using machine learning techniques and genetic-environmental features.

J Formos Med Assoc. 2024 Jun;123(6):701-709. doi: 10.1016/j.jfma.2023.10.021. Epub 2023 Dec 2.

Distinguishing early and late brain aging from the Alzheimer's disease spectrum: consistent morphological patterns across independent samples.

Neuroimage. 2017 Sep;158:282-295. doi: 10.1016/j.neuroimage.2017.06.070. Epub 2017 Jun 27.

Uncovering the Impact of Aggrephagy in the Development of Alzheimer's Disease: Insights Into Diagnostic and Therapeutic Approaches from Machine Learning Analysis.

Curr Alzheimer Res. 2023;20(9):618-635. doi: 10.2174/0115672050280894231214063023.

Identification of Blood-Based Glycolysis Gene Associated with Alzheimer's Disease by Integrated Bioinformatics Analysis.

J Alzheimers Dis. 2021;83(1):163-178. doi: 10.3233/JAD-210540.

Novel Cortical Thickness Pattern for Accurate Detection of Alzheimer's Disease.

J Alzheimers Dis. 2015;48(4):995-1008. doi: 10.3233/JAD-150311.

Comparative analysis of machine learning algorithms for Alzheimer's disease classification using EEG signals and genetic information.

Comput Biol Med. 2024 Jun;176:108621. doi: 10.1016/j.compbiomed.2024.108621. Epub 2024 May 17.

Application of advanced machine learning methods on resting-state fMRI network for identification of mild cognitive impairment and Alzheimer's disease.

Brain Imaging Behav. 2016 Sep;10(3):799-817. doi: 10.1007/s11682-015-9448-7.

Genome-wide haplotype association study identify TNFRSF1A, CASP7, LRP1B, CDH1 and TG genes associated with Alzheimer's disease in Caribbean Hispanic individuals.

Oncotarget. 2015 Dec 15;6(40):42504-14. doi: 10.18632/oncotarget.6391.

引用本文的文献

Machine Learning-Based Alzheimer's Disease Stage Diagnosis Utilizing Blood Gene Expression and Clinical Data: A Comparative Investigation.

Diagnostics (Basel). 2025 Jan 17;15(2):211. doi: 10.3390/diagnostics15020211.

Etiology of Late-Onset Alzheimer's Disease, Biomarker Efficacy, and the Role of Machine Learning in Stage Diagnosis.

Diagnostics (Basel). 2024 Nov 23;14(23):2640. doi: 10.3390/diagnostics14232640.

AlzGenPred - CatBoost-based gene classifier for predicting Alzheimer's disease using high-throughput sequencing data.

Sci Rep. 2024 Dec 5;14(1):30294. doi: 10.1038/s41598-024-82208-x.

G-Protein Signaling in Alzheimer's Disease: Spatial Expression Validation of Semi-supervised Deep Learning-Based Computational Framework.

J Neurosci. 2024 Nov 6;44(45):e0587242024. doi: 10.1523/JNEUROSCI.0587-24.2024.

The Construction of a Multidomain Risk Model of Alzheimer's Disease and Related Dementias.

J Alzheimers Dis. 2023;96(2):535-550. doi: 10.3233/JAD-221292.

Identifying Effective Feature Selection Methods for Alzheimer's Disease Biomarker Gene Detection Using Machine Learning.

Diagnostics (Basel). 2023 May 17;13(10):1771. doi: 10.3390/diagnostics13101771.

Machine learning prediction and tau-based screening identifies potential Alzheimer's disease genes relevant to immunity.

Commun Biol. 2022 Feb 11;5(1):125. doi: 10.1038/s42003-022-03068-7.

Improving the Classification of Alzheimer's Disease Using Hybrid Gene Selection Pipeline and Deep Learning.

Front Genet. 2021 Nov 12;12:784814. doi: 10.3389/fgene.2021.784814. eCollection 2021.

TissueNexus: a database of human tissue functional gene networks built with a large compendium of curated RNA-seq data.

Nucleic Acids Res. 2022 Jan 7;50(D1):D710-D718. doi: 10.1093/nar/gkab1133.

Identification of Dysregulated Genes for Late-Onset Alzheimer's Disease Using Gene Expression Data in Brain.

J Alzheimers Dis Parkinsonism. 2020;10(6). Epub 2020 Oct 23.

本文引用的文献

Expression of Alzheimer's disease risk genes in ischemic brain degeneration.

Pharmacol Rep. 2016 Dec;68(6):1345-1349. doi: 10.1016/j.pharep.2016.09.006. Epub 2016 Sep 5.

Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information.

BMC Genomics. 2016 Aug 18;17 Suppl 4(Suppl 4):433. doi: 10.1186/s12864-016-2795-y.

Genome-wide prediction and functional characterization of the genetic basis of autism spectrum disorder.

Nat Neurosci. 2016 Nov;19(11):1454-1462. doi: 10.1038/nn.4353. Epub 2016 Aug 1.

Neuronal subtypes and diversity revealed by single-nucleus RNA sequencing of the human brain.

Science. 2016 Jun 24;352(6293):1586-90. doi: 10.1126/science.aaf1204.

The Ensembl gene annotation system.

Database (Oxford). 2016 Jun 23;2016. doi: 10.1093/database/baw093. Print 2016.

Predicting Essential Genes and Proteins Based on Machine Learning and Network Topological Features: A Comprehensive Review.

Front Physiol. 2016 Mar 8;7:75. doi: 10.3389/fphys.2016.00075. eCollection 2016.

Establishing the precise evolutionary history of a gene improves prediction of disease-causing missense mutations.

Genet Med. 2016 Oct;18(10):1029-36. doi: 10.1038/gim.2015.208. Epub 2016 Feb 18.

Blood-Borne Activity-Dependent Neuroprotective Protein (ADNP) is Correlated with Premorbid Intelligence, Clinical Stage, and Alzheimer's Disease Biomarkers.

J Alzheimers Dis. 2016;50(1):249-60. doi: 10.3233/JAD-150799.

Genome-wide haplotype association study identify TNFRSF1A, CASP7, LRP1B, CDH1 and TG genes associated with Alzheimer's disease in Caribbean Hispanic individuals.

Oncotarget. 2015 Dec 15;6(40):42504-14. doi: 10.18632/oncotarget.6391.

Risk prediction for sporadic Alzheimer's disease using genetic risk score in the Han Chinese population.

Oncotarget. 2015 Nov 10;6(35):36955-64. doi: 10.18632/oncotarget.6271.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr
超能文献

通过机器学习揭示全基因组中的阿尔茨海默病基因谱。

Revealing Alzheimer's disease genes spectrum in the whole-genome by machine learning.

作者信息

机构信息