Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India.
Acc Chem Res. 2021 Jan 5;54(1):46-56. doi: 10.1021/acs.accounts.0c00431. Epub 2020 Dec 21.
The story of the non-duplex DNA form known as the G-quadruplex (G4) has traversed a winding path. From initial skepticism followed by debate to a surge in interest, the G4 story intertwines many threads. Starting with computational predictions of a gene regulatory role, which now include epigenetic functions, our group was involved in many of these advances along with many other laboratories. Following a brief background, set in the latter half of the last century when the concept of the G4 as a structure took ground, here we account the developments. This is through a lens that though focused on our groups' research presents work from many other groups that played significant roles. Together these provide a broad perspective to the G4 story. Initially we were intrigued on seeing potential G4 (pG4)-forming sequences, then known to be found primarily at the telomeres and immunoglobin switch regions, occurring throughout the genome and being particularly prevalent in promoters of bacteria. We further observed that pG4s were not only prevalent but also conserved through evolution in promoters of human, chimpanzee, mouse and rat genomes. This was between 2005 and 2007. Encouraged by these partly and partly in response to the view held by many that genome-wide presence of G4s were genomic "accidents", the focus shifted to seeking experimental evidence.In the next year, 2008, two independent findings showed promise. First, on treating human cancer cells with G4-binding ligands, we observed widespread change in gene expression. Second, our search for the missing G4-specific transcription factor, without which, importantly, G4s in promoters posed only half the story, yielded results. We determined how NM23-H2 (also known as NME2 or NDPK-B) interacts with G4s and how interaction of NM23-H2 with a G4 in the promoter of the oncogene was important for regulation of transcription. NM23-H2, and subsequently many other similar factors discovered by multiple groups, is possibly giving shape to what might be the "G4-transcriptome". Later, a close look at NM23-H2-G4 interaction in regulation of the human reverse transcriptase gene () revealed the role of G4s in local epigenetic modifications. Meanwhile work from others showed how G4s impact histone modifications following replication. Together these show the intrinsic role of DNA sequence, through formation of DNA structure, in epigenetics.More recent work, however, was waiting to reveal aspects that tend to bring forth a completely new understanding of G4s. We observed that the telomere-repeat-binding-factor-2 (TRF2), known canonically to be telomere-associated, binds extensively outside telomeres throughout the genome. Moreover, a large fraction of the non-telomeric TRF2 sites comprise G4s. Second, the extent of non-telomeric TRF2 binding at promoters was dependent on telomere length. Thereby TRF2-induced epigenetic gene regulation was telomere-dependent. Together these implicate underlying connections that show signs of addressing an intriguing unanswered question that takes us back to the beginning: Why are G4s prevalent in two distinct regions, the telomeres and gene promoters?
非双螺旋 DNA 形式的 G-四链体 (G4) 的故事经历了曲折。从最初的怀疑到后来的争论,再到兴趣的激增,G4 的故事交织着许多线索。从最初对基因调控作用的计算预测开始,现在包括表观遗传功能,我们的小组与许多其他实验室一起参与了这些进展。在简要介绍了上个世纪后半叶 G4 作为一种结构的概念得到确立的背景之后,我们在这里讲述了这些进展。这是通过一个视角来呈现的,虽然聚焦于我们小组的研究,但也展示了许多其他小组的工作,这些工作发挥了重要作用。这些共同为 G4 的故事提供了一个广泛的视角。最初,我们对看到潜在的 G4(pG4)形成序列感到好奇,这些序列最初主要存在于端粒和免疫球蛋白开关区,现在在整个基因组中都有发现,并且在细菌启动子中尤为普遍。我们还观察到,pG4 不仅普遍存在,而且在人类、黑猩猩、小鼠和大鼠基因组的启动子中通过进化得到了保守。这是在 2005 年至 2007 年之间。部分受到这些观察结果的鼓舞,部分是为了回应许多人的观点,即基因组范围内 G4 的存在是基因组“意外”,我们的注意力转向寻找实验证据。次年,2008 年,两项独立的发现带来了希望。首先,在用 G4 结合配体处理人类癌细胞时,我们观察到广泛的基因表达变化。其次,我们寻找缺失的 G4 特异性转录因子的工作取得了成果,没有这个转录因子,重要的是,启动子中的 G4 只占一半的故事。我们确定了 NM23-H2(也称为 NME2 或 NDPK-B)如何与 G4 相互作用,以及 NM23-H2 与致癌基因启动子中的 G4 相互作用如何调节转录。NM23-H2,以及随后许多其他类似的因子被多个小组发现,可能正在形成所谓的“G4 转录组”。后来,我们仔细观察了 NM23-H2-G4 相互作用在调节人类逆转录酶基因 () 中的作用,揭示了 G4 在局部表观遗传修饰中的作用。与此同时,其他人的工作表明 G4 如何影响复制后的组蛋白修饰。这些共同表明,通过 DNA 结构的形成,DNA 序列在表观遗传学中具有内在作用。然而,最近的工作正在等待揭示一些方面,这些方面往往会带来对 G4 的完全新的理解。我们观察到,端粒重复结合因子-2(TRF2),通常与端粒相关,在整个基因组中端粒以外的广泛区域结合。此外,非端粒 TRF2 结合位点的很大一部分由 G4 组成。其次,启动子处非端粒 TRF2 结合的程度依赖于端粒的长度。因此,TRF2 诱导的表观遗传基因调控依赖于端粒。这些共同暗示了潜在的联系,表明正在解决一个有趣的未回答的问题,这个问题让我们回到了起点:为什么 G4 会在两个截然不同的区域,端粒和基因启动子中普遍存在?