Suppr超能文献

通过基于结构的计算诱变对Gal4变体的转录激活变化进行建模。

Modeling transcriptional activation changes to Gal4 variants via structure-based computational mutagenesis.

作者信息

Masso Majid, Rao Nitin, Pyarasani Purnima

机构信息

Laboratory for Structural Bioinformatics, School of Systems Biology, George Mason University, Manassas, VA, United States of America.

出版信息

PeerJ. 2018 May 29;6:e4844. doi: 10.7717/peerj.4844. eCollection 2018.

Abstract

As a DNA binding transcriptional activator, Gal4 promotes the expression of genes responsible for galactose metabolism. The Gal4 protein from (baker's yeast) has become a model for studying eukaryotic transcriptional activation in general because its regulatory properties mirror those of several eukaryotic organisms, including mammals. Given the availability of a crystallographic structure for Gal4, here we implement an mutagenesis technique that makes use of a four-body knowledge-based energy function, in order to empirically quantify the structural impacts associated with single residue substitutions on the Gal4 protein. These results were used to examine the structure-function relationship in Gal4 based on a recently published experimental mutagenesis study, whereby functional changes to a uniformly distributed set of 1,068 single residue Gal4 variants were obtained by measuring their transcriptional activation levels relative to wild-type. A significant correlation was observed between computed (scalar) structural effect data and measured activity values for this collection of single residue Gal4 variants. Additionally, attribute vectors quantifying position-specific environmental impacts were generated for each of the Gal4 variants via computational mutagenesis, and we implemented supervised classification and regression statistical machine learning algorithms to train predictive models of variant Gal4 activity based on these structural changes. All models performed well under cross-validation testing, with balanced accuracy reaching 91% among the classification models, and with the actual and predicted activity values displaying a correlation as high as  = 0.80 for the regression models. Reliable predictions of transcriptional activation levels for Gal4 variants that have yet to be studied can be instantly generated by submitting their respective structure-based feature vectors to the trained models for testing. Such a computational pre-screening of Gal4 variants may potentially reduce costs associated with running large-scale mutagenesis experiments.

摘要

作为一种DNA结合转录激活因子,Gal4可促进负责半乳糖代谢的基因的表达。来自酿酒酵母的Gal4蛋白已成为研究真核生物转录激活的通用模型,因为其调控特性反映了包括哺乳动物在内的几种真核生物的特性。鉴于Gal4晶体结构的可得性,我们在此采用一种基于四体知识能量函数的诱变技术,以便通过实验量化Gal4蛋白上单个残基取代所带来的结构影响。基于最近发表的一项实验诱变研究,这些结果被用于研究Gal4的结构-功能关系,该研究通过测量1068个均匀分布的Gal4单残基变体相对于野生型的转录激活水平,获得了它们的功能变化。对于这组Gal4单残基变体,计算得到的(标量)结构效应数据与测量得到的活性值之间存在显著相关性。此外,通过计算诱变,为每个Gal4变体生成了量化位置特异性环境影响的属性向量,并且我们实施了监督分类和回归统计机器学习算法,以基于这些结构变化训练Gal4变体活性的预测模型。在交叉验证测试中,所有模型表现良好,分类模型的平衡准确率达到91%,回归模型的实际活性值与预测活性值之间的相关性高达0.80。通过将各自基于结构的特征向量提交给训练好的模型进行测试,可以立即生成对尚未研究的Gal4变体转录激活水平的可靠预测。对Gal4变体进行这样的计算预筛选可能会降低进行大规模诱变实验的相关成本。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f7e/5983003/e195349ee833/peerj-06-4844-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验