Department of Chemistry, Columbia University, New York, United States.
Department of Biological Sciences, Columbia University, New York, United States.
Elife. 2023 Mar 16;12:e82345. doi: 10.7554/eLife.82345.
Tyrosine kinases and SH2 (phosphotyrosine recognition) domains have binding specificities that depend on the amino acid sequence surrounding the target (phospho)tyrosine residue. Although the preferred recognition motifs of many kinases and SH2 domains are known, we lack a quantitative description of sequence specificity that could guide predictions about signaling pathways or be used to design sequences for biomedical applications. Here, we present a platform that combines genetically encoded peptide libraries and deep sequencing to profile sequence recognition by tyrosine kinases and SH2 domains. We screened several tyrosine kinases against a million-peptide random library and used the resulting profiles to design high-activity sequences. We also screened several kinases against a library containing thousands of human proteome-derived peptides and their naturally-occurring variants. These screens recapitulated independently measured phosphorylation rates and revealed hundreds of phosphosite-proximal mutations that impact phosphosite recognition by tyrosine kinases. We extended this platform to the analysis of SH2 domains and showed that screens could predict relative binding affinities. Finally, we expanded our method to assess the impact of non-canonical and post-translationally modified amino acids on sequence recognition. This specificity profiling platform will shed new light on phosphotyrosine signaling and could readily be adapted to other protein modification/recognition domains.
酪氨酸激酶和 SH2(磷酸酪氨酸识别)结构域具有依赖于靶标(磷酸化)酪氨酸残基周围氨基酸序列的结合特异性。尽管许多激酶和 SH2 结构域的首选识别基序是已知的,但我们缺乏对序列特异性的定量描述,这种描述可以指导关于信号通路的预测,或者可用于设计用于生物医学应用的序列。在这里,我们提出了一个结合遗传编码肽文库和深度测序的平台,以分析酪氨酸激酶和 SH2 结构域的序列识别。我们针对一个包含 100 万个随机肽的文库筛选了几种酪氨酸激酶,并使用所得图谱设计了高活性序列。我们还针对包含数千个人类蛋白质组衍生肽及其天然变体的文库筛选了几种激酶。这些筛选重现了独立测量的磷酸化速率,并揭示了数百个影响酪氨酸激酶磷酸化位点识别的磷酸化位点近端突变。我们将该平台扩展到 SH2 结构域的分析中,并表明筛选可以预测相对结合亲和力。最后,我们扩展了我们的方法来评估非典型和翻译后修饰的氨基酸对序列识别的影响。这种特异性分析平台将为磷酸酪氨酸信号转导提供新的见解,并可轻松适用于其他蛋白质修饰/识别结构域。