Department of Genetic Medicine and Development, University of Geneva Medical Faculty, 1211 Geneva, Switzerland.
Foundation Campus Biotech, 1202 Geneva, Switzerland.
Genome Res. 2022 Apr;32(4):599-607. doi: 10.1101/gr.275350.121. Epub 2022 Mar 31.
The complete, ungapped sequence of the short arms of human acrocentric chromosomes (SAACs) is still unknown almost 20 years after the near completion of the Human Genome Project. Yet these short arms of Chromosomes 13, 14, 15, 21, and 22 contain the ribosomal DNA (rDNA) genes, which are of paramount importance for human biology. The sequences of SAACs show an extensive variation in the copy number of the various repetitive elements, the full extent of which is currently unknown. In addition, the full spectrum of repeated sequences, their organization, and the low copy number functional elements are also unknown. The Telomere-to-Telomere (T2T) Project using mainly long-read sequence technology has recently completed the assembly of the genome from a hydatidiform mole, CHM13, and has thus established a baseline reference for further studies on the organization, variation, functional annotation, and impact in human disorders of all the previously unknown genomic segments, including the SAACs. The publication of the initial results of the T2T Project will update and improve the reference genome for a better understanding of the evolution and function of the human genome.
近 20 年来,人类着丝粒染色体短臂(SAACs)的完整无缺口序列仍然未知,尽管人类基因组计划已接近完成。然而,这些染色体 13、14、15、21 和 22 的短臂包含核糖体 DNA(rDNA)基因,这些基因对人类生物学至关重要。SAACs 的序列显示出各种重复元件的拷贝数存在广泛的变异,目前尚不完全清楚其全部程度。此外,重复序列的全貌、它们的组织以及低拷贝数功能元件也未知。主要使用长读序列技术的端粒到端粒(T2T)项目最近完成了一个葡萄胎 CHM13 基因组的组装,从而为进一步研究所有以前未知的基因组片段(包括 SAACs)的组织、变异、功能注释以及在人类疾病中的影响建立了一个基线参考。T2T 项目的初步结果的发表将更新和改进参考基因组,以更好地理解人类基因组的进化和功能。