Suppr超能文献

人类 Y 染色体的完整序列。

The complete sequence of a human Y chromosome.

机构信息

Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.

Oxford Nanopore Technologies Inc., Oxford, UK.

出版信息

Nature. 2023 Sep;621(7978):344-354. doi: 10.1038/s41586-023-06457-y. Epub 2023 Aug 23.

Abstract

The human Y chromosome has been notoriously difficult to sequence and assemble because of its complex repeat structure that includes long palindromes, tandem repeats and segmental duplications. As a result, more than half of the Y chromosome is missing from the GRCh38 reference sequence and it remains the last human chromosome to be finished. Here, the Telomere-to-Telomere (T2T) consortium presents the complete 62,460,029-base-pair sequence of a human Y chromosome from the HG002 genome (T2T-Y) that corrects multiple errors in GRCh38-Y and adds over 30 million base pairs of sequence to the reference, showing the complete ampliconic structures of gene families TSPY, DAZ and RBMY; 41 additional protein-coding genes, mostly from the TSPY family; and an alternating pattern of human satellite 1 and 3 blocks in the heterochromatic Yq12 region. We have combined T2T-Y with a previous assembly of the CHM13 genome and mapped available population variation, clinical variants and functional genomics data to produce a complete and comprehensive reference sequence for all 24 human chromosomes.

摘要

人类 Y 染色体因其复杂的重复结构而难以测序和组装,其中包括长回文序列、串联重复和片段重复。因此,GRCh38 参考序列中缺失了超过一半的 Y 染色体,它仍然是最后一个完成的人类染色体。在这里,端粒到端粒(T2T)联盟展示了来自 HG002 基因组的人类 Y 染色体的完整 62,460,029 碱基对序列(T2T-Y),该序列纠正了 GRCh38-Y 中的多个错误,并向参考序列添加了超过 3000 万个碱基对,展示了 TSPY、DAZ 和 RBMY 基因家族的完整扩增子结构;41 个额外的蛋白质编码基因,主要来自 TSPY 家族;以及异染色质 Yq12 区域中人类卫星 1 和 3 块的交替模式。我们将 T2T-Y 与之前的 CHM13 基因组组装相结合,并将可用的群体变异、临床变异和功能基因组学数据映射到该序列中,从而为所有 24 个人类染色体生成了一个完整且全面的参考序列。

相似文献

1
The complete sequence of a human Y chromosome.
Nature. 2023 Sep;621(7978):344-354. doi: 10.1038/s41586-023-06457-y. Epub 2023 Aug 23.
2
A complete reference genome improves analysis of human genetic variation.
Science. 2022 Apr;376(6588):eabl3533. doi: 10.1126/science.abl3533. Epub 2022 Apr 1.
3
The complete sequence of a human genome.
Science. 2022 Apr;376(6588):44-53. doi: 10.1126/science.abj6987. Epub 2022 Mar 31.
4
Genomic characterization of large heterochromatic gaps in the human genome assembly.
PLoS Comput Biol. 2014 May 15;10(5):e1003628. doi: 10.1371/journal.pcbi.1003628. eCollection 2014 May.
5
Centromere reference models for human chromosomes X and Y satellite arrays.
Genome Res. 2014 Apr;24(4):697-707. doi: 10.1101/gr.159624.113. Epub 2014 Feb 5.
8
Telomere-to-telomere assembly of a complete human X chromosome.
Nature. 2020 Sep;585(7823):79-84. doi: 10.1038/s41586-020-2547-7. Epub 2020 Jul 14.
9
10
The origin and evolution of human ampliconic gene families and ampliconic structure.
Genome Res. 2007 Apr;17(4):441-50. doi: 10.1101/gr.5734907. Epub 2006 Dec 21.

引用本文的文献

1
DNA methylation influences human centromere positioning and function.
Nat Genet. 2025 Sep 4. doi: 10.1038/s41588-025-02324-w.
2
Scaling for African Inclusion in High-Throughput Whole Cancer Genome Bioinformatic Workflows.
Cancers (Basel). 2025 Jul 26;17(15):2481. doi: 10.3390/cancers17152481.
4
Efficiency of Learned Indexes on Genome Spectra.
bioRxiv. 2025 Jul 14:2025.07.10.664199. doi: 10.1101/2025.07.10.664199.
5
Detecting Foldback Artifacts in Long Reads.
bioRxiv. 2025 Jul 18:2025.07.15.664946. doi: 10.1101/2025.07.15.664946.
6
Long-read sequencing of trios reveals increased germline and postzygotic mutation rates in repetitive DNA.
bioRxiv. 2025 Jul 19:2025.07.18.665621. doi: 10.1101/2025.07.18.665621.
7
Pangenome discovery of missing autism variants.
medRxiv. 2025 Jul 22:2025.07.21.25331932. doi: 10.1101/2025.07.21.25331932.
8
An X-linked sex determination mechanism in cannabis and hop.
bioRxiv. 2025 Jul 24:2024.12.09.627636. doi: 10.1101/2024.12.09.627636.
9
A draft UAE-based Arab pangenome reference.
Nat Commun. 2025 Jul 24;16(1):6747. doi: 10.1038/s41467-025-61645-w.
10
Complex genetic variation in nearly complete human genomes.
Nature. 2025 Jul 23. doi: 10.1038/s41586-025-09140-6.

本文引用的文献

1
A draft human pangenome reference.
Nature. 2023 May;617(7960):312-324. doi: 10.1038/s41586-023-05896-x. Epub 2023 May 10.
2
Jasmine and Iris: population-scale structural variant comparison and analysis.
Nat Methods. 2023 Mar;20(3):408-417. doi: 10.1038/s41592-022-01753-3. Epub 2023 Jan 19.
3
Fast and accurate mapping of long reads to complete genome assemblies with VerityMap.
Genome Res. 2022 Nov-Dec;32(11-12):2107-2118. doi: 10.1101/gr.276871.122. Epub 2022 Nov 15.
4
Semi-automated assembly of high-quality diploid human reference genomes.
Nature. 2022 Nov;611(7936):519-531. doi: 10.1038/s41586-022-05325-5. Epub 2022 Oct 19.
5
Whole-genome sequence and assembly of the Javan gibbon (Hylobates moloch).
J Hered. 2023 Mar 16;114(1):35-43. doi: 10.1093/jhered/esac043.
6
A Map of 3' DNA Transduction Variants Mediated by Non-LTR Retroelements on 3202 Human Genomes.
Biology (Basel). 2022 Jul 8;11(7):1032. doi: 10.3390/biology11071032.
7
High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios.
Cell. 2022 Sep 1;185(18):3426-3440.e19. doi: 10.1016/j.cell.2022.08.004.
9
Recurrent inversion polymorphisms in humans associate with genetic instability and genomic disorders.
Cell. 2022 May 26;185(11):1986-2005.e26. doi: 10.1016/j.cell.2022.04.017. Epub 2022 May 6.
10
A classical revival: Human satellite DNAs enter the genomics era.
Semin Cell Dev Biol. 2022 Aug;128:2-14. doi: 10.1016/j.semcdb.2022.04.012. Epub 2022 Apr 27.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验