• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用源自 AlphaFold2 的特征预测错义变异的致病性。

Predicting the pathogenicity of missense variants using features derived from AlphaFold2.

机构信息

Institute of Human Genetics, Bonn School of Medicine, University Hospital of Bonn, University of Bonn, Bonn, Germany.

Berlin Institute of Health at Charité-Universitätsmedizin Berlin, Berlin, Germany.

出版信息

Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad280.

DOI:10.1093/bioinformatics/btad280
PMID:37084271
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10203375/
Abstract

MOTIVATION

Missense variants are a frequent class of variation within the coding genome, and some of them cause Mendelian diseases. Despite advances in computational prediction, classifying missense variants into pathogenic or benign remains a major challenge in the context of personalized medicine. Recently, the structure of the human proteome was derived with unprecedented accuracy using the artificial intelligence system AlphaFold2. This raises the question of whether AlphaFold2 wild-type structures can improve the accuracy of computational pathogenicity prediction for missense variants.

RESULTS

To address this, we first engineered a set of features for each amino acid from these structures. We then trained a random forest to distinguish between relatively common (proxy-benign) and singleton (proxy-pathogenic) missense variants from gnomAD v3.1. This yielded a novel AlphaFold2-based pathogenicity prediction score, termed AlphScore. Important feature classes used by AlphScore are solvent accessibility, amino acid network related features, features describing the physicochemical environment, and AlphaFold2's quality parameter (predicted local distance difference test). AlphScore alone showed lower performance than existing in silico scores used for missense prediction, such as CADD or REVEL. However, when AlphScore was added to those scores, the performance increased, as measured by the approximation of deep mutational scan data, as well as the prediction of expert-curated missense variants from the ClinVar database. Overall, our data indicate that the integration of AlphaFold2-predicted structures can improve pathogenicity prediction of missense variants.

AVAILABILITY AND IMPLEMENTATION

AlphScore, combinations of AlphScore with existing scores, as well as variants used for training and testing are publicly available.

摘要

动机

错义变异是编码基因组中常见的变异类型之一,其中一些会导致孟德尔疾病。尽管在计算预测方面取得了进展,但在个性化医疗的背景下,将错义变异分类为致病性或良性仍然是一个主要挑战。最近,使用人工智能系统 AlphaFold2 以前所未有的精度推导出了人类蛋白质组的结构。这就提出了一个问题,即 AlphaFold2 的野生型结构是否可以提高计算错义变异致病性预测的准确性。

结果

为了解决这个问题,我们首先从这些结构中为每个氨基酸设计了一组特征。然后,我们使用随机森林来区分 gnomAD v3.1 中的相对常见(代理良性)和单峰(代理致病性)错义变异。这产生了一种新的基于 AlphaFold2 的致病性预测评分,称为 AlphScore。AlphScore 使用的重要特征类别包括溶剂可及性、氨基酸网络相关特征、描述物理化学环境的特征以及 AlphaFold2 的质量参数(预测局部距离差异测试)。AlphScore 本身的性能低于用于错义预测的现有计算评分,例如 CADD 或 REVEL。然而,当 AlphScore 与这些评分结合使用时,性能会提高,这可以通过对深度突变扫描数据的逼近以及对 ClinVar 数据库中专家 curated 的错义变异的预测来衡量。总体而言,我们的数据表明,整合 AlphaFold2 预测的结构可以提高错义变异的致病性预测。

可用性和实施

AlphScore、AlphScore 与现有评分的组合以及用于训练和测试的变体均可供公开使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b632/10203375/da3df2bc06e1/btad280f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b632/10203375/21e48f62fcdb/btad280f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b632/10203375/41e223264ec3/btad280f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b632/10203375/85346d24384b/btad280f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b632/10203375/da3df2bc06e1/btad280f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b632/10203375/21e48f62fcdb/btad280f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b632/10203375/41e223264ec3/btad280f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b632/10203375/85346d24384b/btad280f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b632/10203375/da3df2bc06e1/btad280f4.jpg

相似文献

1
Predicting the pathogenicity of missense variants using features derived from AlphaFold2.利用源自 AlphaFold2 的特征预测错义变异的致病性。
Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad280.
2
Accuracy of a machine learning method based on structural and locational information from AlphaFold2 for predicting the pathogenicity of TARDBP and FUS gene variants in ALS.基于 AlphaFold2 的结构和位置信息的机器学习方法预测 ALS 中 TARDBP 和 FUS 基因突变致病性的准确性。
BMC Bioinformatics. 2023 May 19;24(1):206. doi: 10.1186/s12859-023-05338-5.
3
AI-derived comparative assessment of the performance of pathogenicity prediction tools on missense variants of breast cancer genes.基于人工智能的乳腺癌基因错义变异致病性预测工具性能的比较评估。
Hum Genomics. 2024 Sep 11;18(1):99. doi: 10.1186/s40246-024-00667-9.
4
Curated multiple sequence alignment for the Adenomatous Polyposis Coli (APC) gene and accuracy of in silico pathogenicity predictions.精心挑选的腺瘤性结肠息肉病基因(APC)的多重序列比对和计算机预测致病性的准确性。
PLoS One. 2020 Aug 4;15(8):e0233673. doi: 10.1371/journal.pone.0233673. eCollection 2020.
5
Artificial intelligence-based recognition for variant pathogenicity of BRCA1 using AlphaFold2-predicted structures.基于人工智能的 AlphaFold2 预测结构对 BRCA1 变异致病性的识别。
Theranostics. 2023 Jan 1;13(1):391-402. doi: 10.7150/thno.79362. eCollection 2023.
6
SIGMA leverages protein structural information to predict the pathogenicity of missense variants.SIGMA 利用蛋白质结构信息来预测错义变异的致病性。
Cell Rep Methods. 2024 Jan 22;4(1):100687. doi: 10.1016/j.crmeth.2023.100687. Epub 2024 Jan 10.
7
Structure-based pathogenicity relationship identifier for predicting effects of single missense variants and discovery of higher-order cancer susceptibility clusters of mutations.基于结构的致病性关系识别器,用于预测单错义变异的影响,并发现更高阶的癌症易感性突变簇。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad206.
8
mvPPT: A Highly Efficient and Sensitive Pathogenicity Prediction Tool for Missense Variants.mvPPT:一种用于错义变异的高效敏感致病性预测工具。
Genomics Proteomics Bioinformatics. 2023 Apr;21(2):414-426. doi: 10.1016/j.gpb.2022.07.005. Epub 2022 Aug 5.
9
REVEL: An Ensemble Method for Predicting the Pathogenicity of Rare Missense Variants.REVEL:一种预测罕见错义变异致病性的集成方法。
Am J Hum Genet. 2016 Oct 6;99(4):877-885. doi: 10.1016/j.ajhg.2016.08.016. Epub 2016 Sep 22.
10
PdmIRD: missense variants pathogenicity prediction for inherited retinal diseases in a disease-specific manner.PdmIRD:以疾病特异性方式预测遗传性视网膜疾病中的错义变异致病性。
Hum Genet. 2024 Mar;143(3):331-342. doi: 10.1007/s00439-024-02645-6. Epub 2024 Mar 13.

引用本文的文献

1
Missense variant analysis in the TRPV1 ARD reveals the unexpected functional significance of a methionine.对TRPV1锚蛋白重复结构域的错义变异分析揭示了甲硫氨酸出人意料的功能意义。
PLoS One. 2025 Sep 2;20(9):e0331224. doi: 10.1371/journal.pone.0331224. eCollection 2025.
2
Increasing pathogenic germline variant diagnosis rates in precision medicine: current best practices and future opportunities.提高精准医学中致病种系变异的诊断率:当前最佳实践与未来机遇
Hum Genomics. 2025 Aug 22;19(1):97. doi: 10.1186/s40246-025-00811-z.
3
Utilizing protein structure graph embeddings to predict the pathogenicity of missense variants.

本文引用的文献

1
A structural biology community assessment of AlphaFold2 applications.AlphaFold2 应用的结构生物学社区评估。
Nat Struct Mol Biol. 2022 Nov;29(11):1056-1067. doi: 10.1038/s41594-022-00849-w. Epub 2022 Nov 7.
2
PROST: AlphaFold2-aware Sequence-Based Predictor to Estimate Protein Stability Changes upon Missense Mutations.PROST:基于序列的 AlphaFold2 感知预测器,用于预测错义突变后蛋白质稳定性的变化。
J Chem Inf Model. 2022 Sep 12;62(17):4270-4282. doi: 10.1021/acs.jcim.2c00799. Epub 2022 Aug 16.
3
StabilitySort: assessment of protein stability changes on a genome-wide scale to prioritize potentially pathogenic genetic variation.
利用蛋白质结构图形嵌入来预测错义变体的致病性。
NAR Genom Bioinform. 2025 Jul 24;7(3):lqaf097. doi: 10.1093/nargab/lqaf097. eCollection 2025 Sep.
4
MOLGENIS VIP: an end-to-end DNA variant interpretation pipeline for research and diagnostics configurable to support rapid implementation of new methods.MOLGENIS VIP:一个用于研究和诊断的端到端DNA变异解读流程,可配置以支持新方法的快速实施。
NAR Genom Bioinform. 2025 Jun 23;7(2):lqaf087. doi: 10.1093/nargab/lqaf087. eCollection 2025 Jun.
5
PRP: pathogenic risk prediction for rare nonsynonymous single nucleotide variants.PRP:罕见非同义单核苷酸变异的致病风险预测
Hum Genet. 2025 May 29. doi: 10.1007/s00439-025-02751-z.
6
Detection of protein structural hotspots using AI distillation and explainability: application to the DAX-1 protein.使用人工智能蒸馏和可解释性检测蛋白质结构热点:应用于DAX-1蛋白
NAR Genom Bioinform. 2025 Apr 22;7(2):lqaf047. doi: 10.1093/nargab/lqaf047. eCollection 2025 Jun.
7
Variant effect predictor correlation with functional assays is reflective of clinical classification performance.变异效应预测器与功能测定的相关性反映了临床分类性能。
Genome Biol. 2025 Apr 22;26(1):104. doi: 10.1186/s13059-025-03575-w.
8
Deep learning tools predict variants in disordered regions with lower sensitivity.深度学习工具预测无序区域变异的敏感性较低。
BMC Genomics. 2025 Apr 12;26(1):367. doi: 10.1186/s12864-025-11534-9.
9
Predicting the impact of missense mutations on an unresolved protein's stability, structure, and function: A case study of Alzheimer's disease-associated TREM2 R47H variant.预测错义突变对未解析蛋白质稳定性、结构和功能的影响:以阿尔茨海默病相关的TREM2 R47H变体为例
Comput Struct Biotechnol J. 2025 Jan 31;27:564-574. doi: 10.1016/j.csbj.2025.01.024. eCollection 2025.
10
AFFIPred: AlphaFold2 structure-based Functional Impact Prediction of missense variations.AFFIPred:基于AlphaFold2结构的错义变异功能影响预测
Protein Sci. 2025 Feb;34(2):e70030. doi: 10.1002/pro.70030.
StabilitySort:在全基因组范围内评估蛋白质稳定性变化,以优先考虑潜在致病性遗传变异。
Bioinformatics. 2022 Sep 2;38(17):4220-4222. doi: 10.1093/bioinformatics/btac465.
4
The 3D mutational constraint on amino acid sites in the human proteome.人类蛋白质组中氨基酸位点的 3D 突变约束。
Nat Commun. 2022 Jun 7;13(1):3273. doi: 10.1038/s41467-022-30936-x.
5
Can AlphaFold2 predict the impact of missense mutations on structure?AlphaFold2能否预测错义突变对结构的影响?
Nat Struct Mol Biol. 2022 Jan;29(1):1-2. doi: 10.1038/s41594-021-00714-2.
6
Disease variant prediction with deep generative models of evolutionary data.利用进化数据的深度生成模型进行疾病变异预测。
Nature. 2021 Nov;599(7883):91-95. doi: 10.1038/s41586-021-04043-8. Epub 2021 Oct 27.
7
AlphaFold and Implications for Intrinsically Disordered Proteins.AlphaFold 及其对无序蛋白质的影响。
J Mol Biol. 2021 Oct 1;433(20):167208. doi: 10.1016/j.jmb.2021.167208. Epub 2021 Aug 18.
8
Next Generation Protein Structure Predictions and Genetic Variant Interpretation.下一代蛋白质结构预测与基因变异解读
J Mol Biol. 2021 Oct 1;433(20):167180. doi: 10.1016/j.jmb.2021.167180. Epub 2021 Aug 4.
9
Highly accurate protein structure prediction for the human proteome.高精准度的人类蛋白质组蛋白结构预测。
Nature. 2021 Aug;596(7873):590-596. doi: 10.1038/s41586-021-03828-1. Epub 2021 Jul 22.
10
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.