Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892-0520, USA.
Protein Sci. 2010 Feb;19(2):349-56. doi: 10.1002/pro.303.
Proteins with high-sequence identity but very different folds present a special challenge to sequence-based protein structure prediction methods. In particular, a 56-residue three-helical bundle protein (GA(95)) and an alpha/beta-fold protein (GB(95)), which share 95% sequence identity, were targets in the CASP-8 structure prediction contest. With only 12 out of 300 submitted server-CASP8 models for GA(95) exhibiting the correct fold, this protein proved particularly challenging despite its small size. Here, we demonstrate that the information contained in NMR chemical shifts can readily be exploited by the CS-Rosetta structure prediction program and yields adequate convergence, even when input chemical shifts are limited to just amide (1)H(N) and (15)N or (1)H(N) and (1)H(alpha) values.
具有高序列同一性但折叠结构非常不同的蛋白质对基于序列的蛋白质结构预测方法提出了特殊挑战。特别是,一个 56 个残基的三螺旋束蛋白(GA(95))和一个 α/β 折叠蛋白(GB(95))是 CASP-8 结构预测竞赛的目标。在提交的 300 个服务器-CASP8 模型中,只有 12 个模型对 GA(95)的正确折叠结构进行了预测,尽管该蛋白质体积较小,但它特别具有挑战性。在这里,我们证明了 NMR 化学位移中包含的信息可以通过 CS-Rosetta 结构预测程序轻松利用,即使输入化学位移仅限于酰胺(1)H(N)和(15)N 或(1)H(N)和(1)H(alpha)值,也能得到足够的收敛。