The Edison Family Center for Genome Sciences and Systems Biology, Washington University in St. Louis School of Medicine, Saint Louis, MO 63110, USA; Department of Genetics, Washington University in St. Louis School of Medicine, Saint Louis, MO 63110, USA.
Department of Biomedical Engineering, Washington University in St. Louis, Saint Louis, MO 63130, USA; Center for Biological Systems Engineering, Washington University in St. Louis, Saint Louis, MO 63130, USA.
Cell Syst. 2018 Apr 25;6(4):444-455.e6. doi: 10.1016/j.cels.2018.01.015. Epub 2018 Mar 7.
Transcriptional activation domains are essential for gene regulation, but their intrinsic disorder and low primary sequence conservation have made it difficult to identify the amino acid composition features that underlie their activity. Here, we describe a rational mutagenesis scheme that deconvolves the function of four activation domain sequence features-acidity, hydrophobicity, intrinsic disorder, and short linear motifs-by quantifying the activity of thousands of variants in vivo and simulating their conformational ensembles using an all-atom Monte Carlo approach. Our results with a canonical activation domain from the Saccharomyces cerevisiae transcription factor Gcn4 reconcile existing observations into a unified model of its function: the intrinsic disorder and acidic residues keep two hydrophobic motifs from driving collapse. Instead, the most-active variants keep their aromatic residues exposed to the solvent. Our results illustrate how the function of intrinsically disordered proteins can be revealed by high-throughput rational mutagenesis.
转录激活结构域对于基因调控至关重要,但由于其固有无序性和低一级序列保守性,难以确定其活性所依赖的氨基酸组成特征。在这里,我们描述了一种合理的诱变方案,通过在体内量化数千种变体的活性并使用全原子蒙特卡罗方法模拟它们的构象集合,来分解四个激活结构域序列特征(酸度、疏水性、固有无序性和短线性基序)的功能。我们使用来自酿酒酵母转录因子 Gcn4 的典型激活结构域的结果将现有观察结果整合到其功能的统一模型中:固有无序性和酸性残基防止两个疏水性基序发生塌陷。相反,最活跃的变体使它们的芳香族残基暴露在溶剂中。我们的结果说明了如何通过高通量合理诱变来揭示固有无序蛋白的功能。