Suppr超能文献

人类Y染色体无缝组装中G-四链体形成序列的完整分析。

Complete analysis of G-quadruplex forming sequences in the gapless assembly of human chromosome Y.

作者信息

Dobrovolná Michaela, Mergny Jean-Louis, Brázda Václav

机构信息

Institute of Biophysics, Czech Academy of Sciences, Královopolská 135, 612 65, Brno, Czech Republic; Faculty of Chemistry, Brno University of Technology, Purkyňova 118, 612 00, Brno, Czech Republic.

Institute of Biophysics, Czech Academy of Sciences, Královopolská 135, 612 65, Brno, Czech Republic; Laboratoire D'Optique et Biosciences, Ecole Polytechnique, CNRS, INSERM, Institut Polytechnique de Paris, 91120, Palaiseau, France.

出版信息

Biochimie. 2025 Feb;229:49-57. doi: 10.1016/j.biochi.2024.10.007. Epub 2024 Oct 9.

Abstract

Recent advancements have finally delivered a complete human genome assembly, including the elusive Y chromosome. This accomplishment closes a significant knowledge gap. Prior efforts were hampered by challenges in sequencing repetitive DNA structures such as direct and inverted repeats. We used the G4Hunter algorithm to analyze the presence of G-quadruplex forming sequences (G4s) within the current human reference genome (GRCh38) and the new telomere-to-telomere (T2T) Y chromosome assemblies. This analysis served a dual purpose: identifying the location of potential G4s within the genomes and exploring their association with functionally annotated sequences. Compared to GRCh38, the T2T assembly exhibited a significantly higher prevalence of G-quadruplex forming sequences. Notably, these repeats were abundantly located around precursor RNA, exons, genes, and within protein binding sites. This remarkable co-occurrence of G4-forming sequences with these critical regulatory regions suggests their role in fundamental DNA regulation processes. Our findings indicate that the current human reference genome significantly underestimated the number of G4s, potentially overlooking their functional importance.

摘要

最近的进展终于完成了一个完整的人类基因组组装,包括难以捉摸的Y染色体。这一成就填补了一个重大的知识空白。先前的努力受到测序重复DNA结构(如正向和反向重复)挑战的阻碍。我们使用G4Hunter算法分析了当前人类参考基因组(GRCh38)和新的端粒到端粒(T2T)Y染色体组装中形成G-四链体的序列(G4s)的存在情况。该分析有双重目的:确定基因组中潜在G4s的位置,并探索它们与功能注释序列的关联。与GRCh38相比,T2T组装中形成G-四链体的序列的发生率显著更高。值得注意的是,这些重复序列大量位于前体RNA、外显子、基因周围以及蛋白质结合位点内。G4形成序列与这些关键调控区域的显著共现表明它们在基本DNA调控过程中的作用。我们的研究结果表明,当前的人类参考基因组大大低估了G4s的数量,可能忽略了它们的功能重要性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验