Grabowski Marek, Niedzialkowska Ewa, Zimmerman Matthew D, Minor Wladek
Department of Molecular Physiology and Biological Physics, University of Virginia School of Medicine, 1340 Jefferson Park Avenue, Jordan Hall, Room 4223, Charlottesville, VA, 22908, USA.
Jerzy Haber Institute of Catalysis and Surface Chemistry, Polish Academy of Sciences, Niezapominajek 8, 30-239, Kraków, Poland.
J Struct Funct Genomics. 2016 Mar;17(1):1-16. doi: 10.1007/s10969-016-9201-5. Epub 2016 Mar 2.
The period 2000-2015 brought the advent of high-throughput approaches to protein structure determination. With the overall funding on the order of $2 billion (in 2010 dollars), the structural genomics (SG) consortia established worldwide have developed pipelines for target selection, protein production, sample preparation, crystallization, and structure determination by X-ray crystallography and NMR. These efforts resulted in the determination of over 13,500 protein structures, mostly from unique protein families, and increased the structural coverage of the expanding protein universe. SG programs contributed over 4400 publications to the scientific literature. The NIH-funded Protein Structure Initiatives alone have produced over 2000 scientific publications, which to date have attracted more than 93,000 citations. Software and database developments that were necessary to handle high-throughput structure determination workflows have led to structures of better quality and improved integrity of the associated data. Organized and accessible data have a positive impact on the reproducibility of scientific experiments. Most of the experimental data generated by the SG centers are freely available to the community and has been utilized by scientists in various fields of research. SG projects have created, improved, streamlined, and validated many protocols for protein production and crystallization, data collection, and functional analysis, significantly benefiting biological and biomedical research.
2000年至2015年期间,蛋白质结构测定领域迎来了高通量方法。全球范围内的结构基因组学(SG)联盟获得了约20亿美元(以2010年美元计算)的总体资金,开发了用于靶点选择、蛋白质生产、样品制备、结晶以及通过X射线晶体学和核磁共振进行结构测定的流程。这些努力使得测定了超过13500个蛋白质结构,其中大部分来自独特的蛋白质家族,扩大了不断扩展的蛋白质世界的结构覆盖范围。SG项目在科学文献中发表了超过4400篇论文。仅由美国国立卫生研究院资助的蛋白质结构计划就产生了超过2000篇科学出版物,迄今为止已吸引了超过93000次引用。处理高通量结构测定工作流程所需的软件和数据库开发带来了质量更高的结构以及相关数据完整性的提升。有条理且易于获取的数据对科学实验的可重复性产生了积极影响。SG中心生成的大多数实验数据都向公众免费提供,并已被各个研究领域的科学家所利用。SG项目创建、改进、简化并验证了许多蛋白质生产和结晶、数据收集以及功能分析的方案,极大地造福了生物学和生物医学研究。