Jaskolski Mariusz, Dauter Zbigniew, Shabalin Ivan G, Gilski Miroslaw, Brzezinski Dariusz, Kowiel Marcin, Rupp Bernhard, Wlodawer Alexander
Department of Crystallography, Faculty of Chemistry, A. Mickiewicz University, Poznan, Poland.
Center for Biocrystallographic Research, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland.
IUCrJ. 2021 Feb 9;8(Pt 2):238-256. doi: 10.1107/S2052252521001159. eCollection 2021 Mar 1.
The appearance at the end of 2019 of the new SARS-CoV-2 coronavirus led to an unprecedented response by the structural biology community, resulting in the rapid determination of many hundreds of structures of proteins encoded by the virus. As part of an effort to analyze and, if necessary, remediate these structures as deposited in the Protein Data Bank (PDB), this work presents a detailed analysis of 81 crystal structures of the main protease 3CL, an important target for the design of drugs against COVID-19. The structures of the unliganded enzyme and its complexes with a number of inhibitors were determined by multiple research groups using different experimental approaches and conditions; the resulting structures span 13 different polymorphs representing seven space groups. The structures of the enzyme itself, all determined by molecular replacement, are highly similar, with the exception of one polymorph with a different inter-domain orientation. However, a number of complexes with bound inhibitors were found to pose significant problems. Some of these could be traced to faulty definitions of geometrical restraints for ligands and to the general problem of a lack of such information in the PDB depositions. Several problems with ligand definition in the PDB itself were also noted. In several cases extensive corrections to the models were necessary to adhere to the evidence of the electron-density maps. Taken together, this analysis of a large number of structures of a single, medically important protein, all determined within less than a year using modern experimental tools, should be useful in future studies of other systems of high interest to the biomedical community.
2019年底新型SARS-CoV-2冠状病毒的出现引发了结构生物学界前所未有的反应,使得该病毒编码的数百种蛋白质结构得以迅速确定。作为对存于蛋白质数据库(PDB)中的这些结构进行分析并在必要时进行修正工作的一部分,本文对主要蛋白酶3CL的81个晶体结构进行了详细分析,3CL是抗COVID-19药物设计的一个重要靶点。多个研究小组使用不同的实验方法和条件确定了无配体酶及其与多种抑制剂复合物的结构;所得结构涵盖代表七个空间群的13种不同的多晶型。除了一个结构域间取向不同的多晶型外,所有通过分子置换确定的酶自身结构高度相似。然而,发现一些结合抑制剂的复合物存在重大问题。其中一些问题可追溯到配体几何约束定义错误以及PDB存档中普遍缺乏此类信息的问题。还指出了PDB本身在配体定义方面的几个问题。在一些情况下,需要对模型进行大量修正以符合电子密度图的证据。总之,对单个具有医学重要性的蛋白质的大量结构进行分析,所有这些结构均在不到一年的时间内使用现代实验工具确定,这对于生物医学领域其他备受关注的系统的未来研究应具有参考价值。