The School of Pharmacy, University College London, London, UK.
J Biol Chem. 2021 Jan-Jun;296:100553. doi: 10.1016/j.jbc.2021.100553. Epub 2021 Mar 17.
The determination of the double helical structure of DNA in 1953 remains the landmark event in the development of modern biological and biomedical science. This structure has also been the starting point for the determination of some 2000 DNA crystal structures in the subsequent 68 years. Their structural diversity has extended to the demonstration of sequence-dependent local structure in duplex DNA, to DNA bending in short and long sequences and in the DNA wound round the nucleosome, and to left-handed duplex DNAs. Beyond the double helix itself, in circumstances where DNA sequences are or can be induced to unwind from being duplex, a wide variety of topologies and forms can exist. Quadruplex structures, based on four-stranded cores of stacked G-quartets, are prevalent though not randomly distributed in the human and other genomes and can play roles in transcription, translation, and replication. Yet more complex folds can result in DNAs with extended tertiary structures and enzymatic/catalytic activity. The Protein Data Bank is the depository of all these structures, and the resource where structures can be critically examined and validated, as well as compared one with another to facilitate analysis of conformational and base morphology features. This review will briefly survey the major structural classes of DNAs and illustrate their significance, together with some examples of how the use of the Protein Data Bank by for example, data mining, has illuminated DNA structural concepts.
1953 年 DNA 双螺旋结构的确立仍然是现代生物和生物医学科学发展中的标志性事件。在随后的 68 年中,这一结构也成为了大约 2000 个 DNA 晶体结构确定的起点。它们的结构多样性已经扩展到证明双链 DNA 中序列依赖性的局部结构、短序列和长序列中的 DNA 弯曲以及 DNA 缠绕在核小体上,以及左手双链 DNA。除了双螺旋本身之外,在 DNA 序列被或可以诱导从双链解开的情况下,存在着各种各样的拓扑结构和形式。四链体结构基于堆叠的 G-四联体的四线核心,在人类和其他基因组中普遍存在,但并非随机分布,并且可以在转录、翻译和复制中发挥作用。更复杂的折叠可以导致具有扩展的三级结构和酶/催化活性的 DNA。蛋白质数据库是所有这些结构的存储库,也是可以对其进行严格检查和验证的资源,以及可以相互比较以促进构象和碱基形态特征分析的资源。这篇综述将简要地考察 DNA 的主要结构类别,并说明它们的重要性,同时还将举例说明如何通过例如数据挖掘等方式使用蛋白质数据库来阐明 DNA 结构概念。