Molecular Biophysics Unit, Indian Institute of Science, Bangalore-560012, India.
Structure. 2012 Feb 8;20(2):371-81. doi: 10.1016/j.str.2011.11.021.
A major bottleneck in protein structure prediction is the selection of correct models from a pool of decoys. Relative activities of ∼1,200 individual single-site mutants in a saturation library of the bacterial toxin CcdB were estimated by determining their relative populations using deep sequencing. This phenotypic information was used to define an empirical score for each residue (RankScore), which correlated with the residue depth, and identify active-site residues. Using these correlations, ∼98% of correct models of CcdB (RMSD ≤ 4Å) were identified from a large set of decoys. The model-discrimination methodology was further validated on eleven different monomeric proteins using simulated RankScore values. The methodology is also a rapid, accurate way to obtain relative activities of each mutant in a large pool and derive sequence-structure-function relationships without protein isolation or characterization. It can be applied to any system in which mutational effects can be monitored by a phenotypic readout.
蛋白质结构预测的一个主要瓶颈是从诱饵库中选择正确的模型。通过使用深度测序确定饱和文库中细菌毒素 CcdB 的约 1200 个单个单点突变体的相对丰度,来估计它们的相对活性。该表型信息用于为每个残基定义经验得分(RankScore),该得分与残基深度相关,并确定活性位点残基。使用这些相关性,从大量诱饵中鉴定出 CcdB 的约 98%的正确模型(RMSD≤4Å)。该模型区分方法还使用模拟 RankScore 值在十一种不同的单体蛋白上进行了验证。该方法也是一种快速、准确的方法,可在不进行蛋白质分离或表征的情况下,从大量库中获得每个突变体的相对活性,并得出序列-结构-功能关系。它可以应用于任何可以通过表型读数监测突变影响的系统。