潜望镜-Opt：基于机器学习预测在……中表达的重组周质蛋白的最佳发酵条件和产量。（你提供的原文似乎不完整，“expressed in”后面缺少具体内容）

PERISCOPE-Opt: Machine learning-based prediction of optimal fermentation conditions and yields of recombinant periplasmic protein expressed in .

作者信息

Packiam Kulandai Arockia Rajesh, Ooi Chien Wei, Li Fuyi, Mei Shutao, Tey Beng Ti, Ong Huey Fang, Song Jiangning, Ramanan Ramakrishnan Nagasundara

机构信息

Chemical Engineering Discipline, School of Engineering, Monash University Malaysia, Jalan Lagoon Selatan, 47500 Bandar Sunway, Malaysia.

Advanced Engineering Platform, Monash University Malaysia, Jalan Lagoon Selatan, 47500 Bandar Sunway, Selangor, Malaysia.

出版信息

Comput Struct Biotechnol J. 2022 Jun 3;20:2909-2920. doi: 10.1016/j.csbj.2022.06.006. eCollection 2022.

DOI:10.1016/j.csbj.2022.06.006

PMID:35765650

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9201004/

Abstract

Optimization of the fermentation process for recombinant protein production (RPP) is often resource-intensive. Machine learning (ML) approaches are helpful in minimizing the experimentations and find vast applications in RPP. However, these ML-based tools primarily focus on features with respect to amino-acid-sequence, ruling out the influence of fermentation process conditions. The present study combines the features derived from fermentation process conditions with that from amino acid-sequence to construct an ML-based model that predicts the maximal protein yields and the corresponding fermentation conditions for the expression of target recombinant protein in the periplasm. Two sets of XGBoost classifiers were employed in the first stage to classify the expression levels of the target protein as high (>50 mg/L), medium (between 0.5 and 50 mg/L), or low (<0.5 mg/L). The second-stage framework consisted of three regression models involving support vector machines and random forest to predict the expression yields corresponding to each expression-level-class. Independent tests showed that the predictor achieved an overall average accuracy of 75% and a Pearson coefficient correlation of 0.91 for the correctly classified instances. Therefore, our model offers a reliable substitution of numerous trial-and-error experiments to identify the optimal fermentation conditions and yield for RPP. It is also implemented as an open-access webserver, PERISCOPE-Opt (http://periscope-opt.erc.monash.edu).

摘要

重组蛋白生产（RPP）发酵过程的优化通常资源消耗大。机器学习（ML）方法有助于减少实验次数，并在RPP中得到广泛应用。然而，这些基于ML的工具主要关注氨基酸序列相关的特征，而忽略了发酵过程条件的影响。本研究将发酵过程条件衍生的特征与氨基酸序列的特征相结合，构建了一个基于ML的模型，该模型可预测周质中目标重组蛋白表达的最大蛋白产量及相应的发酵条件。第一阶段使用两组XGBoost分类器将目标蛋白的表达水平分为高（>50 mg/L）、中（0.5至50 mg/L之间）或低（<0.5 mg/L）。第二阶段框架由三个回归模型组成，涉及支持向量机和随机森林，以预测对应于每个表达水平类别的表达产量。独立测试表明，该预测器对正确分类的实例总体平均准确率达到75%，Pearson系数相关性为0.91。因此，我们的模型为确定RPP的最佳发酵条件和产量提供了一种可靠的替代大量试错实验的方法。它还作为一个开放获取的网络服务器PERISCOPE - Opt（http://periscope-opt.erc.monash.edu）得以实现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/316d/9201004/c49dde31df2e/ga1.jpg

相似文献

PERISCOPE-Opt: Machine learning-based prediction of optimal fermentation conditions and yields of recombinant periplasmic protein expressed in .

Comput Struct Biotechnol J. 2022 Jun 3;20:2909-2920. doi: 10.1016/j.csbj.2022.06.006. eCollection 2022.

Periscope: quantitative prediction of soluble protein expression in the periplasm of Escherichia coli.

Sci Rep. 2016 Mar 2;6:21844. doi: 10.1038/srep21844.

Hyperspectral Monitoring of Powdery Mildew Disease Severity in Wheat Based on Machine Learning.

Front Plant Sci. 2022 Mar 21;13:828454. doi: 10.3389/fpls.2022.828454. eCollection 2022.

Machine learning-based risk factor analysis and prevalence prediction of intestinal parasitic infections using epidemiological survey data.

PLoS Negl Trop Dis. 2022 Jun 14;16(6):e0010517. doi: 10.1371/journal.pntd.0010517. eCollection 2022 Jun.

Stepwise optimization of recombinant protein production in Escherichia coli utilizing computational and experimental approaches.

Appl Microbiol Biotechnol. 2020 Apr;104(8):3253-3266. doi: 10.1007/s00253-020-10454-w. Epub 2020 Feb 19.

Poplar's Waterlogging Resistance Modeling and Evaluating: Exploring and Perfecting the Feasibility of Machine Learning Methods in Plant Science.

Front Plant Sci. 2022 Feb 11;13:821365. doi: 10.3389/fpls.2022.821365. eCollection 2022.

Localization of Ventricular Activation Origin from the 12-Lead ECG: A Comparison of Linear Regression with Non-Linear Methods of Machine Learning.

Ann Biomed Eng. 2019 Feb;47(2):403-412. doi: 10.1007/s10439-018-02168-y. Epub 2018 Nov 21.

Machine Learning-Based Prediction of Supercapacitor Capacitance for MgCoO Electrodes.

Chemphyschem. 2024 Nov 4;25(21):e202400629. doi: 10.1002/cphc.202400629. Epub 2024 Sep 9.

Oversampling methods for machine learning model data training to improve model capabilities to predict the presence of Escherichia coli MG1655 in spinach wash water.

J Food Sci. 2024 Jan;89(1):150-173. doi: 10.1111/1750-3841.16850. Epub 2023 Dec 5.

Classification and prediction of spinal disease based on the SMOTE-RFE-XGBoost model.

PeerJ Comput Sci. 2023 Mar 10;9:e1280. doi: 10.7717/peerj-cs.1280. eCollection 2023.

引用本文的文献

Machine learning-based prediction of volatile compounds profiles in Saccharomyces cerevisiae fermentation simulating canned meat.

NPJ Sci Food. 2025 Jun 2;9(1):92. doi: 10.1038/s41538-025-00435-6.

Towards AI-designed genomes using a variational autoencoder.

Proc Biol Sci. 2024 Dec;291(2036):20241457. doi: 10.1098/rspb.2024.1457. Epub 2024 Dec 11.

Immunogenic potential and neutralizing ability of a heterologous version of the most abundant three-finger toxin from the coral snake .

J Venom Anim Toxins Incl Trop Dis. 2024 Nov 25;30:e20230074. doi: 10.1590/1678-9199-JVATITD-2023-0074. eCollection 2024.

Recent advances in culture medium design for enhanced production of monoclonal antibodies in CHO cells: A comparative study of machine learning and systems biology approaches.

Biotechnol Adv. 2025 Jan-Feb;78:108480. doi: 10.1016/j.biotechadv.2024.108480. Epub 2024 Nov 19.

Artificial intelligence-driven systems engineering for next-generation plant-derived biopharmaceuticals.

Front Plant Sci. 2023 Nov 15;14:1252166. doi: 10.3389/fpls.2023.1252166. eCollection 2023.

Machine learning-assisted medium optimization revealed the discriminated strategies for improved production of the foreign and native metabolites.

Comput Struct Biotechnol J. 2023 Apr 20;21:2654-2663. doi: 10.1016/j.csbj.2023.04.020. eCollection 2023.

Heterologous Expression and Immunogenic Potential of the Most Abundant Phospholipase A from Coral Snake to Develop Antivenoms.

Toxins (Basel). 2022 Nov 24;14(12):825. doi: 10.3390/toxins14120825.

本文引用的文献

Cloning, Expression, and Purification of the Human Synthetic Survivin Protein in Escherichia coli Using Response Surface Methodology (RSM).

Mol Biotechnol. 2023 Mar;65(3):326-336. doi: 10.1007/s12033-021-00399-4. Epub 2021 Sep 26.

as an antibody expression host for the production of diagnostic proteins: significance and expression.

Crit Rev Biotechnol. 2022 Aug;42(5):756-773. doi: 10.1080/07388551.2021.1967871. Epub 2021 Sep 1.

Optimization of expression, purification and secretion of functional recombinant human growth hormone in Escherichia coli using modified staphylococcal protein a signal peptide.

BMC Biotechnol. 2021 Aug 16;21(1):51. doi: 10.1186/s12896-021-00701-x.

SolTranNet-A Machine Learning Tool for Fast Aqueous Solubility Prediction.

J Chem Inf Model. 2021 Jun 28;61(6):2530-2536. doi: 10.1021/acs.jcim.1c00331. Epub 2021 May 26.

Evolution of Expression System in Producing Antibody Recombinant Fragments.

Int J Mol Sci. 2020 Aug 31;21(17):6324. doi: 10.3390/ijms21176324.

Solubility-Weighted Index: fast and accurate prediction of protein solubility.

Bioinformatics. 2020 Sep 15;36(18):4691-4698. doi: 10.1093/bioinformatics/btaa578.

Stepwise optimization of recombinant protein production in Escherichia coli utilizing computational and experimental approaches.

Appl Microbiol Biotechnol. 2020 Apr;104(8):3253-3266. doi: 10.1007/s00253-020-10454-w. Epub 2020 Feb 19.

Optimization of culture conditions for the expression of three different insoluble proteins in Escherichia coli.

Sci Rep. 2019 Nov 14;9(1):16850. doi: 10.1038/s41598-019-53200-7.

DeepSol: a deep learning framework for sequence-based protein solubility prediction.

Bioinformatics. 2018 Aug 1;34(15):2605-2613. doi: 10.1093/bioinformatics/bty166.

PaRSnIP: sequence-based protein solubility prediction using gradient boosting machine.

Bioinformatics. 2018 Apr 1;34(7):1092-1098. doi: 10.1093/bioinformatics/btx662.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

潜望镜-Opt：基于机器学习预测在……中表达的重组周质蛋白的最佳发酵条件和产量。（你提供的原文似乎不完整，“expressed in”后面缺少具体内容）

PERISCOPE-Opt: Machine learning-based prediction of optimal fermentation conditions and yields of recombinant periplasmic protein expressed in .

作者信息

Packiam Kulandai Arockia Rajesh, Ooi Chien Wei, Li Fuyi, Mei Shutao, Tey Beng Ti, Ong Huey Fang, Song Jiangning, Ramanan Ramakrishnan Nagasundara

机构信息

Chemical Engineering Discipline, School of Engineering, Monash University Malaysia, Jalan Lagoon Selatan, 47500 Bandar Sunway, Malaysia.

Advanced Engineering Platform, Monash University Malaysia, Jalan Lagoon Selatan, 47500 Bandar Sunway, Selangor, Malaysia.

出版信息

Comput Struct Biotechnol J. 2022 Jun 3;20:2909-2920. doi: 10.1016/j.csbj.2022.06.006. eCollection 2022.

DOI:10.1016/j.csbj.2022.06.006

PMID:35765650

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9201004/

Abstract

摘要

潜望镜-Opt：基于机器学习预测在……中表达的重组周质蛋白的最佳发酵条件和产量 。 （你提供的原文似乎不完整，“expressed in”后面缺少具体内容）

PERISCOPE-Opt: Machine learning-based prediction of optimal fermentation conditions and yields of recombinant periplasmic protein expressed in .

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

潜望镜-Opt：基于机器学习预测在……中表达的重组周质蛋白的最佳发酵条件和产量 。 （你提供的原文似乎不完整，“expressed in”后面缺少具体内容）

PERISCOPE-Opt: Machine learning-based prediction of optimal fermentation conditions and yields of recombinant periplasmic protein expressed in .

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

潜望镜-Opt：基于机器学习预测在……中表达的重组周质蛋白的最佳发酵条件和产量。（你提供的原文似乎不完整，“expressed in”后面缺少具体内容）

潜望镜-Opt：基于机器学习预测在……中表达的重组周质蛋白的最佳发酵条件和产量。（你提供的原文似乎不完整，“expressed in”后面缺少具体内容）