Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine in St. Louis, Saint Louis, MO 63110, USA; Department of Genetics, Washington University School of Medicine in St. Louis, Saint Louis, MO 63110, USA; Center for Computational Biology, University of California Berkeley, Berkeley, CA 94720, USA.
Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine in St. Louis, Saint Louis, MO 63110, USA; Department of Genetics, Washington University School of Medicine in St. Louis, Saint Louis, MO 63110, USA.
Cell Syst. 2022 Apr 20;13(4):334-345.e5. doi: 10.1016/j.cels.2022.01.002. Epub 2022 Feb 3.
Acidic activation domains are intrinsically disordered regions of the transcription factors that bind coactivators. The intrinsic disorder and low evolutionary conservation of activation domains have made it difficult to identify the sequence features that control activity. To address this problem, we designed thousands of variants in seven acidic activation domains and measured their activities with a high-throughput assay in human cell culture. We found that strong activation domain activity requires a balance between the number of acidic residues and aromatic and leucine residues. These findings motivated a predictor of acidic activation domains that scans the human proteome for clusters of aromatic and leucine residues embedded in regions of high acidity. This predictor identifies known activation domains and accurately predicts previously unidentified ones. Our results support a flexible acidic exposure model of activation domains in which the acidic residues solubilize hydrophobic motifs so that they can interact with coactivators. A record of this paper's transparent peer review process is included in the supplemental information.
酸性激活结构域是转录因子中与共激活因子结合的无规则结构区域。激活结构域的固有无序性和低进化保守性使得识别控制活性的序列特征变得困难。为了解决这个问题,我们设计了七种酸性激活结构域中的数千个变体,并用人细胞培养中的高通量测定法测量了它们的活性。我们发现,强大的激活结构域活性需要在酸性残基数和芳香族及亮氨酸残基数之间取得平衡。这些发现促使我们设计了一种酸性激活结构域预测器,可以在人类蛋白质组中扫描富含芳香族和亮氨酸残基的区域中的簇,这些簇被嵌入到高酸性区域中。该预测器可以识别已知的激活结构域,并准确预测以前未识别的激活结构域。我们的结果支持了一种灵活的酸性暴露模型,其中酸性残基使疏水性基序溶解,从而使它们能够与共激活因子相互作用。本文透明同行评审过程的记录包含在补充信息中。