基于结构的精确高效计算突变，用于模拟维多利亚多管发光水母绿色荧光蛋白突变体的荧光水平。

Accurate and efficient structure-based computational mutagenesis for modeling fluorescence levels of Aequorea victoria green fluorescent protein mutants.

机构信息

Laboratory for Structural Bioinformatics, School of Systems Biology, George Mason University, 10900 University Boulevard MS 5B3, Manassas, VA 20110, USA.

出版信息

Protein Eng Des Sel. 2020 Sep 14;33. doi: 10.1093/protein/gzaa022.

DOI:10.1093/protein/gzaa022

PMID:32930801

Abstract

A computational mutagenesis technique was used to characterize the structural effects associated with over 46 000 single and multiple amino acid variants of Aequorea victoria green fluorescent protein (GFP), whose functional effects (fluorescence levels) were recently measured by experimental researchers. For each GFP mutant, the approach generated a single score reflecting the overall change in sequence-structure compatibility relative to native GFP, as well as a vector of environmental perturbation (EP) scores characterizing the impact at all GFP residue positions. A significant GFP structure-function relationship (P < 0.0001) was elucidated by comparing the sequence-structure compatibility scores with the functional data. Next, the computed vectors for GFP mutants were used to train predictive models of fluorescence by implementing random forest (RF) classification and tree regression machine learning algorithms. Classification performance reached 0.93 for sensitivity, 0.91 for precision and 0.90 for balanced accuracy, and regression models led to Pearson's correlation as high as r = 0.83 between experimental and predicted GFP mutant fluorescence. An RF model trained on a subset of over 1000 experimental single residue GFP mutants with measured fluorescence was used for predicting the 3300 remaining unstudied single residue mutants, with results complementing known GFP biochemical and biophysical properties. In addition, models trained on the subset of experimental GFP mutants harboring multiple residue replacements successfully predicted fluorescence of the single residue GFP mutants. The models developed for this study were accurate and efficient, and their predictions outperformed those of several related state-of-the-art methods.

摘要

使用计算突变技术来描述与 Aequorea victoria 绿色荧光蛋白 (GFP) 的 46000 多种单氨基酸和多氨基酸变体相关的结构效应，其功能效应 (荧光水平) 最近被实验研究人员测量。对于每个 GFP 突变体，该方法生成一个单一的分数，反映了相对于天然 GFP 的序列-结构兼容性的总体变化，以及一个描述所有 GFP 残基位置影响的环境扰动 (EP) 分数向量。通过比较序列-结构兼容性分数与功能数据，阐明了 GFP 结构-功能关系的显著相关性 (P<0.0001)。接下来，使用 GFP 突变体的计算向量通过实现随机森林 (RF) 分类和树回归机器学习算法来训练荧光预测模型。分类性能达到了 0.93 的灵敏度、0.91 的精确度和 0.90 的平衡准确性，回归模型导致实验和预测 GFP 突变体荧光之间的 Pearson 相关系数高达 r=0.83。使用经过实验测量的荧光的 1000 多个单残基 GFP 突变体子集训练的 RF 模型用于预测其余 3300 个未研究的单残基突变体，结果补充了已知的 GFP 生化和生物物理特性。此外，在含有多个残基替换的实验 GFP 突变体子集上训练的模型成功预测了单残基 GFP 突变体的荧光。为这项研究开发的模型准确且高效，其预测性能优于几种相关的最先进方法。

相似文献

Accurate and efficient structure-based computational mutagenesis for modeling fluorescence levels of Aequorea victoria green fluorescent protein mutants.基于结构的精确高效计算突变，用于模拟维多利亚多管发光水母绿色荧光蛋白突变体的荧光水平。

Protein Eng Des Sel. 2020 Sep 14;33. doi: 10.1093/protein/gzaa022.

Structural plasticity of green fluorescent protein to amino acid deletions and fluorescence rescue by folding-enhancing mutations.绿色荧光蛋白对氨基酸缺失的结构可塑性以及通过折叠增强突变实现的荧光拯救

BMC Biochem. 2015 Jul 25;16:17. doi: 10.1186/s12858-015-0046-5.

The first mutant of the Aequorea victoria green fluorescent protein that forms a red chromophore.维多利亚多管水母绿色荧光蛋白的首个形成红色发色团的突变体。

Biochemistry. 2008 Apr 22;47(16):4666-73. doi: 10.1021/bi702130s. Epub 2008 Mar 27.

A practical teaching course in directed protein evolution using the green fluorescent protein as a model.一门以绿色荧光蛋白为模型的定向蛋白质进化实践教学课程。

Biochem Mol Biol Educ. 2011 Jan-Feb;39(1):21-7. doi: 10.1002/bmb.20430.

Deletional protein engineering based on stable fold.基于稳定折叠的缺失型蛋白质工程。

PLoS One. 2012;7(12):e51510. doi: 10.1371/journal.pone.0051510. Epub 2012 Dec 11.

Modeling transcriptional activation changes to Gal4 variants via structure-based computational mutagenesis.通过基于结构的计算诱变对Gal4变体的转录激活变化进行建模。

PeerJ. 2018 May 29;6:e4844. doi: 10.7717/peerj.4844. eCollection 2018.

Illuminating the origins of spectral properties of green fluorescent proteins via proteochemometric and molecular modeling.通过蛋白质化学计量学和分子建模揭示绿色荧光蛋白光谱特性的起源

J Comput Chem. 2014 Oct 15;35(27):1951-66. doi: 10.1002/jcc.23708. Epub 2014 Aug 12.

Accurate prediction of stability changes in protein mutants by combining machine learning with structure based computational mutagenesis.通过将机器学习与基于结构的计算诱变相结合，准确预测蛋白质突变体的稳定性变化。

Bioinformatics. 2008 Sep 15;24(18):2002-9. doi: 10.1093/bioinformatics/btn353. Epub 2008 Jul 16.

From green to blue: site-directed mutagenesis of the green fluorescent protein to teach protein structure-function relationships.从绿色到蓝色：绿色荧光蛋白的定点诱变用于阐释蛋白质结构与功能的关系

Biochem Mol Biol Educ. 2011 Jul;39(4):309-15. doi: 10.1002/bmb.20467.

Modeling the functional consequences of single residue replacements in bacteriophage f1 gene V protein.模拟噬菌体 f1 基因 V 蛋白中单个残基替换的功能后果。

Protein Eng Des Sel. 2009 Nov;22(11):665-71. doi: 10.1093/protein/gzp050. Epub 2009 Aug 18.

引用本文的文献

Empirical validation of ProteinMPNN's efficiency in enhancing protein fitness.ProteinMPNN在提高蛋白质适应性方面效率的实证验证。

Front Genet. 2024 Jan 11;14:1347667. doi: 10.3389/fgene.2023.1347667. eCollection 2023.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于结构的精确高效计算突变，用于模拟维多利亚多管发光水母绿色荧光蛋白突变体的荧光水平。

Accurate and efficient structure-based computational mutagenesis for modeling fluorescence levels of Aequorea victoria green fluorescent protein mutants.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献