Page Rebecca, Grzechnik Slawomir K, Canaves Jaume M, Spraggon Glen, Kreusch Andreas, Kuhn Peter, Stevens Raymond C, Lesley Scott A
Department of Molecular Biology, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA.
Acta Crystallogr D Biol Crystallogr. 2003 Jun;59(Pt 6):1028-37. doi: 10.1107/s0907444903007790. Epub 2003 May 23.
As the field of structural genomics continues to grow and new technologies are developed, novel strategies are needed to efficiently crystallize large numbers of protein targets, thus increasing output, not just throughput [Chayen & Saridakis (2002). Acta Cryst. D58, 921-927]. One strategy, developed for the high-throughput structure determination of the Thermotoga maritima proteome, is to quickly determine which proteins have a propensity for crystal formation followed by focused SeMet-incorporated protein crystallization attempts. This experimental effort has resulted in over 320 000 individual crystallization experiments. As such, it has provided one of the most extensive systematic data sets of commonly used crystallization conditions against a wide range of proteins to date. Analysis of this data shows that many of the original screening conditions are redundant, as all of the T. maritima proteins that crystallize readily could be identified using just 23% of the original conditions. It also shows that proteins that contain selenomethionine and are more extensively purified often crystallize in distinctly different conditions from those of their native less pure counterparts. Most importantly, it shows that the two-tiered strategy employed here is extremely successful for predicting which proteins will readily crystallize, as greater than 99% of the proteins identified as having a propensity to crystallize under non-optimal native conditions did so again as selenomethionine derivatives during the focused crystallization trials. This crystallization strategy can be adopted for both large-scale genomics programs and individual protein studies with multiple constructs and has the potential to significantly accelerate future crystallographic efforts.
随着结构基因组学领域不断发展以及新技术的开发,需要新的策略来高效地使大量蛋白质靶标结晶,从而提高产量,而不仅仅是通量[Chayen & Saridakis (2002). Acta Cryst. D58, 921 - 927]。为嗜热栖热菌蛋白质组的高通量结构测定开发的一种策略是,快速确定哪些蛋白质具有结晶倾向,随后进行聚焦的含硒代甲硫氨酸蛋白质结晶尝试。这项实验工作已产生超过32万个单独的结晶实验。因此,它提供了迄今为止针对广泛蛋白质的常用结晶条件最广泛的系统数据集之一。对该数据的分析表明,许多原始筛选条件是多余的,因为仅使用23%的原始条件就能鉴定出所有容易结晶的嗜热栖热菌蛋白质。分析还表明,含有硒代甲硫氨酸且纯化程度更高的蛋白质,其结晶条件通常与其天然的纯度较低的对应物明显不同。最重要的是,分析表明这里采用的两级策略在预测哪些蛋白质将容易结晶方面极其成功,因为在聚焦结晶试验期间,超过99%在非最佳天然条件下被鉴定为具有结晶倾向的蛋白质,作为硒代甲硫氨酸衍生物再次成功结晶。这种结晶策略可用于大规模基因组学计划以及对具有多种构建体的单个蛋白质的研究,并且有可能显著加速未来的晶体学研究工作。