• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于人工智能的阿尔茨海默病全基因组关联相关性评估及多基因风险评分预测

AI-enabled evaluation of genome-wide association relevance and polygenic risk score prediction in Alzheimer's disease.

作者信息

Platt Daniel E, Guzmán-Sáenz Aldo, Bose Aritra, Saha Subrata, Utro Filippo, Parida Laxmi

机构信息

IBM T. J. Watson Research Center, Yorktown Heights, New York, NY, USA.

Pfizer, Pearl River, New York, NY, USA.

出版信息

iScience. 2024 Feb 12;27(3):109209. doi: 10.1016/j.isci.2024.109209. eCollection 2024 Mar 15.

DOI:10.1016/j.isci.2024.109209
PMID:38439972
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10910245/
Abstract

GWAS focuses on significance loosing false positives; machine learning probes sub-significant features relying on predictivity. Yet, these are far from orthogonal. We sought to explore how these inform each other in sub-genome-wide significant situations to define relevance for predictive features. We introduce the SVM-based RubricOE that selects heavily cross-validated feature sets, and LDpred2 PRS as a strong contrast to SVM, to explore significance and predictivity. Our Alzheimer's test case notoriously lacks strong genetic signals except for few very strong phenotype-SNP associations, which suits the problem we are exploring. We found that the most significant SNPs among ML and PRS-selected SNPs captured most of the predictivity, while weaker associations tend also to contribute weakly to predictivity. SNPs with weak associations tend not to contribute to predictivity, but deletion of these features does not injure it. Significance provides a ranking that helps identify weakly predictive features.

摘要

全基因组关联研究(GWAS)侧重于减少假阳性的显著性;机器学习则依靠预测性来探究次显著性特征。然而,这两者并非完全正交。我们试图探索在亚基因组范围显著的情况下,它们如何相互补充以确定预测特征的相关性。我们引入了基于支持向量机(SVM)的RubricOE,它选择经过大量交叉验证的特征集,以及与SVM形成强烈对比的LDpred2多基因风险评分(PRS),以探究显著性和预测性。我们以阿尔茨海默病为例,除了少数非常强的表型-单核苷酸多态性(SNP)关联外,该病例明显缺乏强遗传信号,这适合我们正在探索的问题。我们发现,机器学习和PRS选择的SNP中最显著的SNP捕获了大部分预测性,而较弱的关联对预测性的贡献往往也较弱。关联较弱的SNP往往对预测性没有贡献,但删除这些特征不会损害预测性。显著性提供了一个排名,有助于识别预测性较弱的特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/381a/10910245/57fa70576284/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/381a/10910245/77910ae7bad7/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/381a/10910245/67460d4593fe/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/381a/10910245/2664135e655b/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/381a/10910245/57fa70576284/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/381a/10910245/77910ae7bad7/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/381a/10910245/67460d4593fe/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/381a/10910245/2664135e655b/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/381a/10910245/57fa70576284/gr4.jpg

相似文献

1
AI-enabled evaluation of genome-wide association relevance and polygenic risk score prediction in Alzheimer's disease.基于人工智能的阿尔茨海默病全基因组关联相关性评估及多基因风险评分预测
iScience. 2024 Feb 12;27(3):109209. doi: 10.1016/j.isci.2024.109209. eCollection 2024 Mar 15.
2
Robust SNP-based prediction of rheumatoid arthritis through machine-learning-optimized polygenic risk score.通过机器学习优化的多基因风险评分实现类风湿关节炎的稳健 SNP 预测。
J Transl Med. 2023 Feb 7;21(1):92. doi: 10.1186/s12967-023-03939-5.
3
Improving the Utility of Polygenic Risk Scores as a Biomarker for Alzheimer's Disease.提高多基因风险评分作为阿尔茨海默病生物标志物的效用。
Cells. 2021 Jun 29;10(7):1627. doi: 10.3390/cells10071627.
4
Polygenic Risk Scores in Alzheimer's Disease: Current Applications and Future Directions.阿尔茨海默病的多基因风险评分:当前应用与未来方向
Front Digit Health. 2020 Aug 11;2:14. doi: 10.3389/fdgth.2020.00014. eCollection 2020.
5
A machine learning pipeline for quantitative phenotype prediction from genotype data.基于基因型数据的定量表型预测的机器学习管道。
BMC Bioinformatics. 2010 Oct 26;11 Suppl 8(Suppl 8):S3. doi: 10.1186/1471-2105-11-S8-S3.
6
Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.使用基于质量的两阶段随机森林进行全基因组关联数据分类和单核苷酸多态性选择。
BMC Genomics. 2015;16 Suppl 2(Suppl 2):S5. doi: 10.1186/1471-2164-16-S2-S5. Epub 2015 Jan 21.
7
Plasma biomarkers and genetics in the diagnosis and prediction of Alzheimer's disease.血浆生物标志物和遗传学在阿尔茨海默病的诊断和预测中的应用。
Brain. 2023 Feb 13;146(2):690-699. doi: 10.1093/brain/awac128.
8
Polygenic Risk Scores in Alzheimer's Disease Genetics: Methodology, Applications, Inclusion, and Diversity.多基因风险评分在阿尔茨海默病遗传学中的应用:方法学、应用、纳入和多样性。
J Alzheimers Dis. 2022;89(1):1-12. doi: 10.3233/JAD-220025.
9
The Relative Power of Structural Genomic Variation versus SNPs in Explaining the Quantitative Trait Growth in the Marine Teleost .结构基因组变异与单核苷酸多态性在解释海洋硬骨鱼类数量性状生长中的相对作用
Genes (Basel). 2022 Jun 23;13(7):1129. doi: 10.3390/genes13071129.
10
Cognitive Decline in Alzheimer's Disease: Limited Clinical Utility for GWAS or Polygenic Risk Scores in a Clinical Trial Setting.阿尔茨海默病认知衰退:GWAS 或多基因风险评分在临床试验环境中的临床应用有限。
Genes (Basel). 2020 May 2;11(5):501. doi: 10.3390/genes11050501.

引用本文的文献

1
A foundation model for learning genetic associations from brain imaging phenotypes.一种用于从脑成像表型中学习基因关联的基础模型。
Bioinform Adv. 2025 Aug 13;5(1):vbaf196. doi: 10.1093/bioadv/vbaf196. eCollection 2025.
2
Genome-wide association neural networks identify genes linked to family history of Alzheimer's disease.全基因组关联神经网络识别出与阿尔茨海默病家族史相关的基因。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae704.

本文引用的文献

1
PToPI: A Comprehensive Review, Analysis, and Knowledge Representation of Binary Classification Performance Measures/Metrics.PToPI:二元分类性能度量/指标的全面综述、分析与知识表示
SN Comput Sci. 2023;4(1):13. doi: 10.1007/s42979-022-01409-1. Epub 2022 Oct 16.
2
New insights into the genetic etiology of Alzheimer's disease and related dementias.阿尔茨海默病及相关痴呆症的遗传学病因新见解。
Nat Genet. 2022 Apr;54(4):412-436. doi: 10.1038/s41588-022-01024-z. Epub 2022 Apr 4.
3
Impact of Clinical and Genomic Factors on COVID-19 Disease Severity.
临床和基因组因素对 COVID-19 疾病严重程度的影响。
AMIA Annu Symp Proc. 2022 Feb 21;2021:378-387. eCollection 2021.
4
Fast two-stage phasing of large-scale sequence data.大规模序列数据的快速两阶段相位测定。
Am J Hum Genet. 2021 Oct 7;108(10):1880-1890. doi: 10.1016/j.ajhg.2021.08.005. Epub 2021 Sep 2.
5
A Machine Learning Method to Identify Genetic Variants Potentially Associated With Alzheimer's Disease.一种用于识别可能与阿尔茨海默病相关的基因变异的机器学习方法。
Front Genet. 2021 Jun 14;12:647436. doi: 10.3389/fgene.2021.647436. eCollection 2021.
6
Genome-wide epistasis analysis for Alzheimer's disease and implications for genetic risk prediction.全基因组上位性分析在阿尔茨海默病中的作用及其对遗传风险预测的影响。
Alzheimers Res Ther. 2021 Mar 4;13(1):55. doi: 10.1186/s13195-021-00794-8.
7
LDpred2: better, faster, stronger.LDpred2:更优、更快、更强。
Bioinformatics. 2021 Apr 1;36(22-23):5424-5431. doi: 10.1093/bioinformatics/btaa1029.
8
What Can Machine Learning Approaches in Genomics Tell Us about the Molecular Basis of Amyotrophic Lateral Sclerosis?基因组学中的机器学习方法能让我们了解肌萎缩侧索硬化症的分子基础吗?
J Pers Med. 2020 Nov 26;10(4):247. doi: 10.3390/jpm10040247.
9
Risk prediction of late-onset Alzheimer's disease implies an oligogenic architecture.迟发性阿尔茨海默病的风险预测提示其具有寡基因结构。
Nat Commun. 2020 Sep 23;11(1):4799. doi: 10.1038/s41467-020-18534-1.
10
Genetic meta-analysis of diagnosed Alzheimer's disease identifies new risk loci and implicates Aβ, tau, immunity and lipid processing.基于诊断的阿尔茨海默病的全基因组关联荟萃分析鉴定出新的风险位点,并提示 Aβ、tau、免疫和脂类代谢过程的作用。
Nat Genet. 2019 Mar;51(3):414-430. doi: 10.1038/s41588-019-0358-2. Epub 2019 Feb 28.