通过可解释人工智能绘制γ-分泌酶底物图谱。

Charting γ-secretase substrates by explainable AI.

作者信息

Breimann Stephan, Kamp Frits, Basset Gabriele, Abou-Ajram Claudia, Güner Gökhan, Yanagida Kanta, Okochi Masayasu, Müller Stephan A, Lichtenthaler Stefan F, Langosch Dieter, Frishman Dmitrij, Steiner Harald

机构信息

Biomedical Center (BMC), Division of Metabolic Biochemistry, Faculty of Medicine, LMU Munich, München, Germany.

German Center for Neurodegenerative Diseases (DZNE), DZNE Munich, München, Germany.

出版信息

Nat Commun. 2025 Jul 1;16(1):5428. doi: 10.1038/s41467-025-60638-z.

DOI:10.1038/s41467-025-60638-z

PMID:40593564

Abstract

Proteases recognize substrates by decoding sequence information-an essential cellular process elusive when recognition motifs are absent. Here, we unravel this problem for γ-secretase, an intramembrane-cleaving protease associated with Alzheimer's disease and cancer, by developing Comparative Physicochemical Profiling (CPP), a sequence-based algorithm for identifying interpretable physicochemical features. We show that CPP deciphers a γ-secretase substrate signature with single-residue resolution, which can explain the conformational transitions observed in substrates upon γ-secretase binding. Using machine learning, we predict the entire human γ-secretase substrate scope, revealing numerous previously unknown substrates. Our approach outperforms state-of-the-art protein language models, improving prediction accuracy from 60% to 90%, and achieves an 88% success rate in experimental validation. Building on these advancements, we identify pathways and diseases not linked before to γ-secretase. Generally, CPP decodes physicochemical signatures-a concept that extends beyond sequence motifs. We anticipate that our approach will be broadly applicable to diverse molecular recognition processes.

摘要

蛋白酶通过解读序列信息来识别底物，这是一个至关重要的细胞过程，而当识别基序缺失时该过程就难以捉摸。在这里，我们通过开发比较物理化学分析（CPP）来解决与阿尔茨海默病和癌症相关的膜内裂解蛋白酶γ-分泌酶的这一问题，CPP是一种基于序列的算法，用于识别可解释的物理化学特征。我们表明，CPP能以单残基分辨率解读γ-分泌酶底物特征，这可以解释在γ-分泌酶结合后底物中观察到的构象转变。利用机器学习，我们预测了整个人类γ-分泌酶底物范围，揭示了许多以前未知的底物。我们的方法优于当前最先进的蛋白质语言模型，将预测准确率从60%提高到90%，并在实验验证中取得了88%的成功率。基于这些进展，我们确定了以前与γ-分泌酶无关的途径和疾病。一般来说，CPP能解读物理化学特征，这一概念超越了序列基序。我们预计我们的方法将广泛适用于各种分子识别过程。

相似文献

Charting γ-secretase substrates by explainable AI.通过可解释人工智能绘制γ-分泌酶底物图谱。

Nat Commun. 2025 Jul 1;16(1):5428. doi: 10.1038/s41467-025-60638-z.

Predicting Affinity Through Homology (PATH): Interpretable Binding Affinity Prediction with Persistent Homology.通过同源性预测亲和力（PATH）：基于持久同源性的可解释结合亲和力预测

bioRxiv. 2024 Oct 21:2023.11.16.567384. doi: 10.1101/2023.11.16.567384.

γ-Secretase-Mediated Endoproteolysis of Neuregulin-1 and E-Cadherin.γ-分泌酶介导的神经调节蛋白-1和E-钙黏蛋白的内蛋白水解作用。

Biochemistry. 2025 Jul 1. doi: 10.1021/acs.biochem.5c00095.

Identification of presenilin mutations that have sufficient gamma-secretase proteolytic activity to mediate Notch signaling but disrupt organelle and neuronal health.鉴定具有足够γ-分泌酶蛋白水解活性以介导Notch信号传导但破坏细胞器和神经元健康的早老素突变。

Neurobiol Dis. 2025 Aug;212:106961. doi: 10.1016/j.nbd.2025.106961. Epub 2025 May 20.

Multi-omics analysis of druggable genes to facilitate Alzheimer's disease therapy: A multi-cohort machine learning study.可药物靶向基因的多组学分析以促进阿尔茨海默病治疗：一项多队列机器学习研究

J Prev Alzheimers Dis. 2025 Jun;12(6):100128. doi: 10.1016/j.tjpad.2025.100128. Epub 2025 Mar 11.

AMUSET-TICA: A Tensor-Based Approach for Identifying Slow Collective Variables in Biomolecular Dynamics.AMUSET-TICA：一种基于张量的方法，用于识别生物分子动力学中的慢集体变量。

J Chem Theory Comput. 2025 May 13;21(9):4855-4866. doi: 10.1021/acs.jctc.5c00076. Epub 2025 Apr 20.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果：一种针对特定个体见解的新型验证方法。

Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.

Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗？

Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

本文引用的文献

AAclust: -optimized clustering for selecting redundancy-reduced sets of amino acid scales.AAclust：用于选择氨基酸标度冗余减少集的优化聚类。

Bioinform Adv. 2024 Oct 30;4(1):vbae165. doi: 10.1093/bioadv/vbae165. eCollection 2024.

AAontology: An Ontology of Amino Acid Scales for Interpretable Machine Learning.AAontology：用于可解释机器学习的氨基酸尺度本体。

J Mol Biol. 2024 Oct 1;436(19):168717. doi: 10.1016/j.jmb.2024.168717. Epub 2024 Jul 24.

The γ-secretase substrate proteome and its role in cell signaling regulation.γ-分泌酶底物蛋白质组及其在细胞信号转导调控中的作用。

Mol Cell. 2023 Nov 16;83(22):4106-4122.e10. doi: 10.1016/j.molcel.2023.10.029.

Manipulating PTPRD function with ectodomain antibodies.利用外域抗体来操控 PTPRD 功能。

Genes Dev. 2023 Aug 1;37(15-16):743-759. doi: 10.1101/gad.350713.123. Epub 2023 Sep 5.

Permissive Conformations of a Transmembrane Helix Allow Intramembrane Proteolysis by γ-Secretase.跨膜螺旋的允许构象允许 γ-分泌酶进行膜内蛋白水解。

J Mol Biol. 2023 Sep 15;435(18):168218. doi: 10.1016/j.jmb.2023.168218. Epub 2023 Aug 1.

Cooperation of N- and C-terminal substrate transmembrane domain segments in intramembrane proteolysis by γ-secretase.γ-分泌酶通过 N-和 C-末端底物跨膜结构域片段的合作进行跨膜蛋白水解。

Commun Biol. 2023 Feb 15;6(1):177. doi: 10.1038/s42003-023-04470-5.

ADAM10- and γ-secretase-dependent cleavage of the transmembrane protein PTPRT attenuates neurodegeneration in the mouse model of Alzheimer's disease.跨膜蛋白 PTPRT 的 ADAM10 和 γ-分泌酶依赖性切割可减轻阿尔茨海默病小鼠模型中的神经退行性变。

FASEB J. 2023 Feb;37(2):e22734. doi: 10.1096/fj.202201396R.

UniProt: the Universal Protein Knowledgebase in 2023.UniProt：2023 年的通用蛋白质知识库。

Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.

Proteolytically generated soluble Tweak Receptor Fn14 is a blood biomarker for γ-secretase activity.蛋白水解产生的可溶性Tweak 受体 Fn14 是γ-分泌酶活性的血液生物标志物。

EMBO Mol Med. 2022 Oct 10;14(10):e16084. doi: 10.15252/emmm.202216084. Epub 2022 Sep 7.

Protein language-model embeddings for fast, accurate, and alignment-free protein structure prediction.基于蛋白质语言模型的嵌入来实现快速、准确且无需对齐的蛋白质结构预测。

Structure. 2022 Aug 4;30(8):1169-1177.e4. doi: 10.1016/j.str.2022.05.001. Epub 2022 May 23.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过可解释人工智能绘制γ-分泌酶底物图谱。

Charting γ-secretase substrates by explainable AI.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献