RCSB Protein Data Bank, Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854-8076.
Biopolymers. 2014 Jun;101(6):659-68. doi: 10.1002/bip.22434.
With the accumulation of a large number and variety of molecules in the Protein Data Bank (PDB) comes the need on occasion to review and improve their representation. The Worldwide PDB (wwPDB) partners have periodically updated various aspects of structural data representation to improve the integrity and consistency of the archive. The remediation effort described here was focused on improving the representation of peptide-like inhibitor and antibiotic molecules so that they can be easily identified and analyzed. Peptide-like inhibitors or antibiotics were identified in over 1000 PDB entries, systematically reviewed and represented either as peptides with polymer sequence or as single components. For the majority of the single-component molecules, their peptide-like composition was captured in a new representation, called the subcomponent sequence. A novel concept called "group" was developed for representing complex peptide-like antibiotics and inhibitors that are composed of multiple polymer and nonpolymer components. In addition, a reference dictionary was developed with detailed information about these peptide-like molecules to aid in their annotation, identification and analysis. Based on the experience gained in this remediation, guidelines, procedures, and tools were developed to annotate new depositions containing peptide-like inhibitors and antibiotics accurately and consistently.
随着蛋白质数据库 (PDB) 中分子数量和种类的不断积累,有时需要对其表示形式进行审查和改进。全球蛋白质数据库 (wwPDB) 合作伙伴定期更新结构数据表示的各个方面,以提高档案的完整性和一致性。这里描述的修复工作侧重于改进肽类抑制剂和抗生素分子的表示形式,以便于识别和分析。在 1000 多个 PDB 条目中识别出肽类抑制剂或抗生素,对其进行系统审查,并表示为具有聚合物序列的肽或单个成分。对于大多数单成分分子,其肽类成分被捕获在称为子成分序列的新表示形式中。为了表示由多个聚合物和非聚合物成分组成的复杂肽类抗生素和抑制剂,开发了一个新的概念,称为“组”。此外,还开发了一个参考字典,其中包含有关这些肽类分子的详细信息,以帮助注释、识别和分析。基于在这种修复中获得的经验,开发了指南、程序和工具,以准确一致地注释包含肽类抑制剂和抗生素的新沉积。