Holm L, Sander C
Protein Design Group, European Molecular Biology Laboratory, Heidelberg, Germany.
Proteins. 1994 Jul;19(3):165-73. doi: 10.1002/prot.340190302.
The number of protein structures known in atomic detail has increased from one in 1960 (Kendrew, J.C., Strandberg, B.E., Hart, R.G., Davies, D.R., Phillips, D.C., Shore, V.C. Nature (London) 185:422-427, 1960) to more than 1000 in 1994. The rate at which new structures are being published exceeds one a day as a result of recent advances in protein engineering, crystallography, and spectroscopy. More and more frequently, a newly determined structure is similar in fold to a known one, even when no sequence similarity is detectable. A new generation of computer algorithms has now been developed that allows routine comparison of a protein structure with the database of all known structures. Such structure database searches are already used daily and they are beginning to rival sequence database searches as a tool for discovering biologically interesting relationships.
已知原子细节的蛋白质结构数量已从1960年的1个(肯德鲁,J.C.,斯特兰德伯格,B.E.,哈特,R.G.,戴维斯,D.R.,菲利普斯,D.C.,肖尔,V.C.《自然》(伦敦)185:422 - 427,1960)增加到1994年的1000多个。由于蛋白质工程、晶体学和光谱学的最新进展,新结构的发表速度超过每天1个。越来越频繁地,即使在检测不到序列相似性的情况下,新确定的结构在折叠方式上也与已知结构相似。现在已经开发出新一代计算机算法,能够将蛋白质结构与所有已知结构的数据库进行常规比较。这种结构数据库搜索已经每天都在使用,并且作为发现生物学上有趣关系的工具,它们开始与序列数据库搜索相媲美。