• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SoluProt:大肠杆菌中可溶性蛋白质表达的预测

SoluProt: prediction of soluble protein expression in Escherichia coli.

作者信息

Hon Jiri, Marusiak Martin, Martinek Tomas, Kunka Antonin, Zendulka Jaroslav, Bednar David, Damborsky Jiri

机构信息

Loschmidt Laboratories, Centre for Toxic Compounds in the Environment RECETOX and Department of Experimental Biology, Faculty of Science, Masaryk University, Brno 625 00, Czech Republic.

International Clinical Research Center, St. Anne's University Hospital Brno, Brno 656 91, Czech Republic.

出版信息

Bioinformatics. 2021 Apr 9;37(1):23-28. doi: 10.1093/bioinformatics/btaa1102.

DOI:10.1093/bioinformatics/btaa1102
PMID:33416864
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8034534/
Abstract

MOTIVATION

Poor protein solubility hinders the production of many therapeutic and industrially useful proteins. Experimental efforts to increase solubility are plagued by low success rates and often reduce biological activity. Computational prediction of protein expressibility and solubility in Escherichia coli using only sequence information could reduce the cost of experimental studies by enabling prioritization of highly soluble proteins.

RESULTS

A new tool for sequence-based prediction of soluble protein expression in E.coli, SoluProt, was created using the gradient boosting machine technique with the TargetTrack database as a training set. When evaluated against a balanced independent test set derived from the NESG database, SoluProt's accuracy of 58.5% and AUC of 0.62 exceeded those of a suite of alternative solubility prediction tools. There is also evidence that it could significantly increase the success rate of experimental protein studies. SoluProt is freely available as a standalone program and a user-friendly webserver at https://loschmidt.chemi.muni.cz/soluprot/.

AVAILABILITY AND IMPLEMENTATION

https://loschmidt.chemi.muni.cz/soluprot/.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

蛋白质溶解性差会阻碍许多治疗性和工业用蛋白质的生产。提高溶解性的实验努力成功率很低,而且常常会降低生物活性。仅使用序列信息对大肠杆菌中蛋白质的可表达性和溶解性进行计算预测,通过对高溶解性蛋白质进行优先排序,可以降低实验研究的成本。

结果

利用梯度提升机技术,以TargetTrack数据库作为训练集,创建了一种基于序列预测大肠杆菌中可溶性蛋白质表达的新工具SoluProt。当针对来自NESG数据库的平衡独立测试集进行评估时,SoluProt的准确率为58.5%,曲线下面积为0.62,超过了一系列其他溶解性预测工具。也有证据表明它可以显著提高实验性蛋白质研究的成功率。SoluProt可作为独立程序和用户友好的网络服务器免费获取,网址为https://loschmidt.chemi.muni.cz/soluprot/。

可用性和实现方式

https://loschmidt.chemi.muni.cz/soluprot/。

补充信息

补充数据可在《生物信息学》在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c756/8034534/31ed896325d2/btaa1102f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c756/8034534/678753891339/btaa1102f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c756/8034534/31ed896325d2/btaa1102f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c756/8034534/678753891339/btaa1102f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c756/8034534/31ed896325d2/btaa1102f2.jpg

相似文献

1
SoluProt: prediction of soluble protein expression in Escherichia coli.SoluProt:大肠杆菌中可溶性蛋白质表达的预测
Bioinformatics. 2021 Apr 9;37(1):23-28. doi: 10.1093/bioinformatics/btaa1102.
2
EnzymeMiner: automated mining of soluble enzymes with diverse structures, catalytic properties and stabilities.EnzymeMiner:具有多种结构、催化特性和稳定性的可溶性酶的自动化挖掘。
Nucleic Acids Res. 2020 Jul 2;48(W1):W104-W109. doi: 10.1093/nar/gkaa372.
3
Solubility-Weighted Index: fast and accurate prediction of protein solubility.溶解度加权指数:快速准确预测蛋白质溶解度。
Bioinformatics. 2020 Sep 15;36(18):4691-4698. doi: 10.1093/bioinformatics/btaa578.
4
CaverDock: a molecular docking-based tool to analyse ligand transport through protein tunnels and channels.CaverDock:一种基于分子对接的工具,用于分析配体通过蛋白质隧道和通道的传输。
Bioinformatics. 2019 Dec 1;35(23):4986-4993. doi: 10.1093/bioinformatics/btz386.
5
SoluProtMut: A manually curated database of protein solubility changes upon mutations.SoluProtMut:一个人工整理的关于突变后蛋白质溶解度变化的数据库。
Comput Struct Biotechnol J. 2022 Nov 9;20:6339-6347. doi: 10.1016/j.csbj.2022.11.009. eCollection 2022.
6
SOLart: a structure-based method to predict protein solubility and aggregation.SOLart:一种基于结构的预测蛋白质可溶性和聚集性的方法。
Bioinformatics. 2020 Mar 1;36(5):1445-1452. doi: 10.1093/bioinformatics/btz773.
7
PaRSnIP: sequence-based protein solubility prediction using gradient boosting machine.PaRSnIP:基于梯度提升机的序列基蛋白质溶解性预测。
Bioinformatics. 2018 Apr 1;34(7):1092-1098. doi: 10.1093/bioinformatics/btx662.
8
AggreProt: a web server for predicting and engineering aggregation prone regions in proteins.AggreProt:一个用于预测和设计蛋白质中易于聚集区域的网络服务器。
Nucleic Acids Res. 2024 Jul 5;52(W1):W159-W169. doi: 10.1093/nar/gkae420.
9
Prediction of protein solubility based on sequence physicochemical patterns and distributed representation information with DeepSoluE.基于序列物理化学模式和分布式表示信息的 DeepSoluE 预测蛋白质溶解度。
BMC Biol. 2023 Jan 24;21(1):12. doi: 10.1186/s12915-023-01510-8.
10
BenchStab: a tool for automated querying of web-based stability predictors.BenchStab:一个用于自动化查询基于网络的稳定性预测器的工具。
Bioinformatics. 2024 Sep 2;40(9). doi: 10.1093/bioinformatics/btae553.

引用本文的文献

1
Recovering Immunogenic N Protein from Pellets of Recombinant .从重组菌沉淀中回收具有免疫原性的N蛋白
Vaccines (Basel). 2025 Jul 10;13(7):744. doi: 10.3390/vaccines13070744.
2
Annotation of the Extracellular Enveloped Form of Monkeypox Virus for the Design, Screening, Validation, and Simulation of a Chimeric Vaccine Construct.用于嵌合疫苗构建体设计、筛选、验证和模拟的猴痘病毒细胞外包膜形式注释
Biology (Basel). 2025 Jul 8;14(7):830. doi: 10.3390/biology14070830.
3
SOuLMuSiC, a novel tool for predicting the impact of mutations on protein solubility.
SOuLMuSiC,一种预测突变对蛋白质溶解度影响的新型工具。
Sci Rep. 2025 Jul 29;15(1):27531. doi: 10.1038/s41598-025-11326-x.
4
Immunoinformatics-Based development of a Multi-Epitope vaccine candidate targeting coinfection by Klebsiella pneumoniae and Acinetobacter baumannii.基于免疫信息学开发针对肺炎克雷伯菌和鲍曼不动杆菌共感染的多表位疫苗候选物。
BMC Infect Dis. 2025 Jul 3;25(1):894. doi: 10.1186/s12879-025-11242-5.
5
Design and Validation of a Multi-Epitope mRNA Vaccine Construct Against Human Monkeypox Virus (hMPXV) by Annotating Protein of Intracellular Mature Virus (IMV) Form of hMPXV.通过注释人猴痘病毒(hMPXV)细胞内成熟病毒(IMV)形式的蛋白质设计并验证针对人猴痘病毒(hMPXV)的多表位mRNA疫苗构建体
Biomedicines. 2025 Jun 11;13(6):1439. doi: 10.3390/biomedicines13061439.
6
Reverse vaccinology for hookworms: a rational selection of vaccinable antigens against parasitic nematodes.钩虫的反向疫苗学:针对寄生线虫的可疫苗接种抗原的合理选择。
Parasite. 2025;32:32. doi: 10.1051/parasite/2025025. Epub 2025 Jun 3.
7
Comprehensive immunoinformatics and bioinformatics strategies for designing a multi-epitope based vaccine targeting structural proteins of Nipah virus.用于设计靶向尼帕病毒结构蛋白的多表位疫苗的综合免疫信息学和生物信息学策略。
Front Immunol. 2025 May 13;16:1535322. doi: 10.3389/fimmu.2025.1535322. eCollection 2025.
8
Immunoinformatic-driven design and evaluation of multi-epitope mRNA vaccine targeting HIV-1 gp120.基于免疫信息学的针对HIV-1 gp120的多表位mRNA疫苗的设计与评估
Front Immunol. 2025 May 13;16:1480025. doi: 10.3389/fimmu.2025.1480025. eCollection 2025.
9
Structural insights and predictive modelling of a novel anti-HER2 scFv and Leptulipin: a newly designed immunotoxin protein for HER2 positive cancers.一种新型抗HER2单链抗体片段和瘦素脂肽的结构见解与预测模型:一种新设计的用于HER2阳性癌症的免疫毒素蛋白。
Toxicol Res (Camb). 2025 May 24;14(3):tfaf060. doi: 10.1093/toxres/tfaf060. eCollection 2025 Jun.
10
Sequence and taxonomic feature evaluation facilitated the discovery of alcohol oxidases.序列和分类学特征评估促进了醇氧化酶的发现。
Synth Syst Biotechnol. 2025 Apr 22;10(3):907-915. doi: 10.1016/j.synbio.2025.04.014. eCollection 2025 Sep.