Pusey Marc L, Liu Zhi-Jie, Tempel Wolfram, Praissman Jeremy, Lin Dawei, Wang Bi-Cheng, Gavira José A, Ng Joseph D
Biophysics SD48, NASA/MSFC Huntsville, AL 35812, USA.
Prog Biophys Mol Biol. 2005 Jul;88(3):359-86. doi: 10.1016/j.pbiomolbio.2004.07.011.
The common goal for structural genomic centers and consortiums is to decipher as quickly as possible the three-dimensional structures for a multitude of recombinant proteins derived from known genomic sequences. Since X-ray crystallography is the foremost method to acquire atomic resolution for macromolecules, the limiting step is obtaining protein crystals that can be useful of structure determination. High-throughput methods have been developed in recent years to clone, express, purify, crystallize and determine the three-dimensional structure of a protein gene product rapidly using automated devices, commercialized kits and consolidated protocols. However, the average number of protein structures obtained for most structural genomic groups has been very low compared to the total number of proteins purified. As more entire genomic sequences are obtained for different organisms from the three kingdoms of life, only the proteins that can be crystallized and whose structures can be obtained easily are studied. Consequently, an astonishing number of genomic proteins remain unexamined. In the era of high-throughput processes, traditional methods in molecular biology, protein chemistry and crystallization are eclipsed by automation and pipeline practices. The necessity for high-rate production of protein crystals and structures has prevented the usage of more intellectual strategies and creative approaches in experimental executions. Fundamental principles and personal experiences in protein chemistry and crystallization are minimally exploited only to obtain "low-hanging fruit" protein structures. We review the practical aspects of today's high-throughput manipulations and discuss the challenges in fast pace protein crystallization and tools for crystallography. Structural genomic pipelines can be improved with information gained from low-throughput tactics that may help us reach the higher-bearing fruits. Examples of recent developments in this area are reported from the efforts of the Southeast Collaboratory for Structural Genomics (SECSG).
结构基因组中心和联盟的共同目标是尽快解析出众多源自已知基因组序列的重组蛋白的三维结构。由于X射线晶体学是获取大分子原子分辨率的首要方法,因此限制步骤在于获得可用于结构测定的蛋白质晶体。近年来已开发出高通量方法,使用自动化设备、商业化试剂盒和整合方案来快速克隆、表达、纯化、结晶并确定蛋白质基因产物的三维结构。然而,与纯化的蛋白质总数相比,大多数结构基因组团队获得的蛋白质结构的平均数量非常低。随着从生命三界的不同生物体中获得越来越多的完整基因组序列,仅研究那些能够结晶且其结构易于获得的蛋白质。因此,数量惊人的基因组蛋白仍未得到研究。在高通量过程的时代,分子生物学、蛋白质化学和结晶方面的传统方法因自动化和流水线操作而黯然失色。蛋白质晶体和结构的高速生产需求阻碍了在实验执行中使用更具智慧的策略和创造性方法。蛋白质化学和结晶方面的基本原理和个人经验仅被最少地利用以获得“低垂之果”的蛋白质结构。我们回顾了当今高通量操作的实际情况,并讨论了快速蛋白质结晶中的挑战以及晶体学工具。从低通量策略中获得的信息可以改进结构基因组流水线,这可能有助于我们获取更丰硕的成果。本文报道了东南结构基因组学协作组(SECSG)在该领域的最新进展实例。