• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从有限数据预测蛋白质生物物理特性:通过NanoMelt对纳米抗体热稳定性的案例研究。

Prediction of protein biophysical traits from limited data: a case study on nanobody thermostability through NanoMelt.

作者信息

Ramon Aubin, Ni Mingyang, Predeina Olga, Gaffey Rebecca, Kunz Patrick, Onuoha Shimobi, Sormanni Pietro

机构信息

Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK.

Division of Functional Genome Analysis, German Cancer Research Center (DKFZ), Heidelberg, Germany.

出版信息

MAbs. 2025 Dec;17(1):2442750. doi: 10.1080/19420862.2024.2442750. Epub 2025 Jan 8.

DOI:10.1080/19420862.2024.2442750
PMID:39772905
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11730357/
Abstract

In-silico prediction of protein biophysical traits is often hindered by the limited availability of experimental data and their heterogeneity. Training on limited data can lead to overfitting and poor generalizability to sequences distant from those in the training set. Additionally, inadequate use of scarce and disparate data can introduce biases during evaluation, leading to unreliable model performances being reported. Here, we present a comprehensive study exploring various approaches for protein fitness prediction from limited data, leveraging pre-trained embeddings, repeated stratified nested cross-validation, and ensemble learning to ensure an unbiased assessment of the performances. We applied our framework to introduce NanoMelt, a predictor of nanobody thermostability trained with a dataset of 640 measurements of apparent melting temperature, obtained by integrating data from the literature with 129 new measurements from this study. We find that an ensemble model stacking multiple regression using diverse sequence embeddings achieves state-of-the-art accuracy in predicting nanobody thermostability. We further demonstrate NanoMelt's potential to streamline nanobody development by guiding the selection of highly stable nanobodies. We make the curated dataset of nanobody thermostability freely available and NanoMelt accessible as a downloadable software and webserver.

摘要

蛋白质生物物理特性的计算机模拟预测常常受到实验数据有限及其异质性的阻碍。在有限数据上进行训练可能导致过拟合,并且对与训练集中序列差异较大的序列泛化能力较差。此外,对稀缺且分散的数据利用不足可能会在评估过程中引入偏差,导致报告的模型性能不可靠。在此,我们进行了一项全面研究,探索从有限数据预测蛋白质适应性的各种方法,利用预训练嵌入、重复分层嵌套交叉验证和集成学习来确保对性能进行无偏评估。我们应用我们的框架引入了NanoMelt,这是一种纳米抗体热稳定性预测器,它使用通过将文献数据与本研究中的129个新测量数据整合而获得的640个表观熔解温度测量数据集进行训练。我们发现,使用多种序列嵌入的多元回归集成模型在预测纳米抗体热稳定性方面达到了当前的最高准确率。我们进一步证明了NanoMelt通过指导选择高度稳定的纳米抗体来简化纳米抗体开发的潜力。我们免费提供经过整理的纳米抗体热稳定性数据集,并将NanoMelt作为可下载软件和网络服务器提供访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/165b/11730357/4ee51a99b1a4/KMAB_A_2442750_F0006_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/165b/11730357/d831f01a1114/KMAB_A_2442750_F0001_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/165b/11730357/cfc7c6b2d100/KMAB_A_2442750_F0002_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/165b/11730357/0dc417e9a65c/KMAB_A_2442750_F0003_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/165b/11730357/2d78dab0ffa3/KMAB_A_2442750_F0004_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/165b/11730357/a8e48bd46d24/KMAB_A_2442750_F0005_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/165b/11730357/4ee51a99b1a4/KMAB_A_2442750_F0006_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/165b/11730357/d831f01a1114/KMAB_A_2442750_F0001_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/165b/11730357/cfc7c6b2d100/KMAB_A_2442750_F0002_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/165b/11730357/0dc417e9a65c/KMAB_A_2442750_F0003_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/165b/11730357/2d78dab0ffa3/KMAB_A_2442750_F0004_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/165b/11730357/a8e48bd46d24/KMAB_A_2442750_F0005_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/165b/11730357/4ee51a99b1a4/KMAB_A_2442750_F0006_OC.jpg

相似文献

1
Prediction of protein biophysical traits from limited data: a case study on nanobody thermostability through NanoMelt.从有限数据预测蛋白质生物物理特性:通过NanoMelt对纳米抗体热稳定性的案例研究。
MAbs. 2025 Dec;17(1):2442750. doi: 10.1080/19420862.2024.2442750. Epub 2025 Jan 8.
2
TEMPRO: nanobody melting temperature estimation model using protein embeddings.TEMPRO:使用蛋白质嵌入的纳米体融解温度预估模型。
Sci Rep. 2024 Aug 17;14(1):19074. doi: 10.1038/s41598-024-70101-6.
3
NanoBERTa-ASP: predicting nanobody paratope based on a pretrained RoBERTa model.NanoBERTa-ASP:基于预训练的 RoBERTa 模型预测纳米抗体表位。
BMC Bioinformatics. 2024 Mar 21;25(1):122. doi: 10.1186/s12859-024-05750-5.
4
Exploiting sequence and stability information for directing nanobody stability engineering.利用序列和稳定性信息指导纳米抗体稳定性工程。
Biochim Biophys Acta Gen Subj. 2017 Sep;1861(9):2196-2205. doi: 10.1016/j.bbagen.2017.06.014. Epub 2017 Jun 20.
5
Advancements in nanobody generation: Integrating conventional, in silico, and machine learning approaches.纳米抗体生成技术的进展:传统方法、计算方法和机器学习方法的整合。
Biotechnol Bioeng. 2024 Nov;121(11):3375-3388. doi: 10.1002/bit.28816. Epub 2024 Jul 25.
6
Exploring single-domain antibody thermostability by molecular dynamics simulation.通过分子动力学模拟探索单域抗体的热稳定性。
J Biomol Struct Dyn. 2019 Sep;37(14):3686-3696. doi: 10.1080/07391102.2018.1526116. Epub 2018 Nov 18.
7
Nanobody stability engineering by employing the ΔTm shift; a comparison with apparent rate constants of heat-induced aggregation.通过 ΔTm 偏移实现纳米抗体稳定性工程;与热诱导聚集的表观速率常数的比较。
Protein Eng Des Sel. 2019 Dec 31;32(5):241-249. doi: 10.1093/protein/gzz017.
8
Structural and Computational Studies of the SARS-CoV-2 Spike Protein Binding Mechanisms with Nanobodies: From Structure and Dynamics to Avidity-Driven Nanobody Engineering.SARS-CoV-2 刺突蛋白与纳米抗体结合机制的结构和计算研究:从结构和动力学到亲和力驱动的纳米抗体工程。
Int J Mol Sci. 2022 Mar 8;23(6):2928. doi: 10.3390/ijms23062928.
9
ParaAntiProt provides paratope prediction using antibody and protein language models.ParaAntiProt 利用抗体和蛋白质语言模型提供抗体互补决定区(CDR)预测。
Sci Rep. 2024 Nov 25;14(1):29141. doi: 10.1038/s41598-024-80940-y.
10
Toward generalizable prediction of antibody thermostability using machine learning on sequence and structure features.利用序列和结构特征的机器学习方法实现抗体热稳定性的可推广预测。
MAbs. 2023 Jan-Dec;15(1):2163584. doi: 10.1080/19420862.2022.2163584.

本文引用的文献

1
The lack of trade-off between conformational stability and binding affinity in a nanobody with therapeutic potential for a misfolding disease.在一种对错误折叠疾病具有治疗潜力的纳米抗体中,构象稳定性与结合亲和力之间不存在权衡关系。
Int J Biol Macromol. 2025 Jan;284(Pt 1):138046. doi: 10.1016/j.ijbiomac.2024.138046. Epub 2024 Nov 26.
2
Thermodynamic versus kinetic basis for the high conformational stability of nanobodies for therapeutic applications.用于治疗应用的纳米抗体高构象稳定性的热力学与动力学基础。
FEBS Lett. 2025 Mar;599(5):766-776. doi: 10.1002/1873-3468.15064. Epub 2024 Nov 26.
3
TEMPRO: nanobody melting temperature estimation model using protein embeddings.
TEMPRO:使用蛋白质嵌入的纳米体融解温度预估模型。
Sci Rep. 2024 Aug 17;14(1):19074. doi: 10.1038/s41598-024-70101-6.
4
AbMelt: Learning antibody thermostability from molecular dynamics.AbMelt:从分子动力学角度学习抗体热稳定性。
Biophys J. 2024 Sep 3;123(17):2921-2933. doi: 10.1016/j.bpj.2024.06.003. Epub 2024 Jun 7.
5
nanoBERT: a deep learning model for gene agnostic navigation of the nanobody mutational space.纳米抗体BERT:一种用于纳米抗体突变空间基因无关导航的深度学习模型。
Bioinform Adv. 2024 Mar 6;4(1):vbae033. doi: 10.1093/bioadv/vbae033. eCollection 2024.
6
Glycosylation Contributes to Thermostability and Proteolytic Resistance of rFIP-nha ().糖基化有助于 rFIP-nha()的热稳定性和抗蛋白酶水解性。
Molecules. 2023 Aug 31;28(17):6386. doi: 10.3390/molecules28176386.
7
ImmuneBuilder: Deep-Learning models for predicting the structures of immune proteins.免疫构建体:用于预测免疫蛋白结构的深度学习模型。
Commun Biol. 2023 May 29;6(1):575. doi: 10.1038/s42003-023-04927-7.
8
Protein Fitness Prediction Is Impacted by the Interplay of Language Models, Ensemble Learning, and Sampling Methods.蛋白质适应性预测受到语言模型、集成学习和采样方法相互作用的影响。
Pharmaceutics. 2023 Apr 25;15(5):1337. doi: 10.3390/pharmaceutics15051337.
9
DeepSTABp: A Deep Learning Approach for the Prediction of Thermal Protein Stability.DeepSTABp:一种用于预测热蛋白稳定性的深度学习方法。
Int J Mol Sci. 2023 Apr 18;24(8):7444. doi: 10.3390/ijms24087444.
10
In vivo hypermutation and continuous evolution.体内超突变与持续进化。
Nat Rev Methods Primers. 2022;2. doi: 10.1038/s43586-022-00130-w. Epub 2022 May 19.