Cohen F E, Abarbanel R M, Kuntz I D, Fletterick R J
Biochemistry. 1986 Jan 14;25(1):266-75. doi: 10.1021/bi00349a037.
We extend the use of amino acid sequence patterns [Cohen, F.E., Abarbanel, R. M., Kuntz, I. D., & Fletterick, R. J. (1983) Biochemistry 22, 4894-4904] to the identification of turns in globular proteins. The approach uses a conservative strategy, combined with a hierarchical search (strongest patterns first) and length-dependent masking, to achieve high accuracy (95%) on a test set of proteins of known structure. Applying the same procedure to homologous families gives a 90% success rate. Straightforward changes are suggested to improve the predictive power. The computer program, written in Lisp, provides a general pattern-recognition language well suited for a number of investigations of protein and nucleic acid sequences.
我们将氨基酸序列模式的应用[科恩,F.E.,阿巴班内尔,R.M.,昆茨,I.D.,&弗莱泰里克,R.J.(1983年)《生物化学》22卷,4894 - 4904页]扩展到球状蛋白质中转角的识别。该方法采用保守策略,结合分层搜索(最强模式优先)和长度依赖性屏蔽,在一组已知结构的蛋白质测试集上实现了高精度(95%)。将相同程序应用于同源家族的成功率为90%。文中还提出了一些直接的改进建议以提高预测能力。用Lisp编写的计算机程序提供了一种通用的模式识别语言,非常适合对蛋白质和核酸序列进行多种研究。