• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

完整的参考基因组提高了人类遗传变异分析的能力。

A complete reference genome improves analysis of human genetic variation.

机构信息

Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA.

Department of Biology, Johns Hopkins University, Baltimore, MD, USA.

出版信息

Science. 2022 Apr;376(6588):eabl3533. doi: 10.1126/science.abl3533. Epub 2022 Apr 1.

DOI:10.1126/science.abl3533
PMID:35357935
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9336181/
Abstract

Compared to its predecessors, the Telomere-to-Telomere CHM13 genome adds nearly 200 million base pairs of sequence, corrects thousands of structural errors, and unlocks the most complex regions of the human genome for clinical and functional study. We show how this reference universally improves read mapping and variant calling for 3202 and 17 globally diverse samples sequenced with short and long reads, respectively. We identify hundreds of thousands of variants per sample in previously unresolved regions, showcasing the promise of the T2T-CHM13 reference for evolutionary and biomedical discovery. Simultaneously, this reference eliminates tens of thousands of spurious variants per sample, including reduction of false positives in 269 medically relevant genes by up to a factor of 12. Because of these improvements in variant discovery coupled with population and functional genomic resources, T2T-CHM13 is positioned to replace GRCh38 as the prevailing reference for human genetics.

摘要

与之前的版本相比,端粒到端粒 CHM13 基因组增加了近 2 亿个碱基对的序列,纠正了数千个结构错误,并为人类基因组的最复杂区域解锁,以进行临床和功能研究。我们展示了这种参考如何普遍提高了对分别使用短读长和长读长测序的 3202 个和 17 个全球多样化样本的读取映射和变异调用。我们在以前无法解决的区域中为每个样本识别出数十万种变体,展示了 T2T-CHM13 参考在进化和生物医学发现方面的潜力。同时,该参考消除了每个样本中数万种虚假变体,包括在 269 个与医学相关的基因中,将假阳性减少了高达 12 倍。由于在变体发现方面的这些改进,以及人群和功能基因组资源,T2T-CHM13 有望取代 GRCh38 成为人类遗传学的主要参考。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd2b/9336181/e8a1e83b449b/nihms-1814268-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd2b/9336181/b436dcf67258/nihms-1814268-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd2b/9336181/e3dec2e1d477/nihms-1814268-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd2b/9336181/a26394c45062/nihms-1814268-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd2b/9336181/5d64d69cb74a/nihms-1814268-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd2b/9336181/e8a1e83b449b/nihms-1814268-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd2b/9336181/b436dcf67258/nihms-1814268-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd2b/9336181/e3dec2e1d477/nihms-1814268-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd2b/9336181/a26394c45062/nihms-1814268-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd2b/9336181/5d64d69cb74a/nihms-1814268-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd2b/9336181/e8a1e83b449b/nihms-1814268-f0006.jpg

相似文献

1
A complete reference genome improves analysis of human genetic variation.完整的参考基因组提高了人类遗传变异分析的能力。
Science. 2022 Apr;376(6588):eabl3533. doi: 10.1126/science.abl3533. Epub 2022 Apr 1.
2
The complete sequence of a human Y chromosome.人类 Y 染色体的完整序列。
Nature. 2023 Sep;621(7978):344-354. doi: 10.1038/s41586-023-06457-y. Epub 2023 Aug 23.
3
Improved sequence mapping using a complete reference genome and lift-over.使用完整参考基因组和提升操作提高序列比对。
Nat Methods. 2024 Jan;21(1):41-49. doi: 10.1038/s41592-023-02069-6. Epub 2023 Nov 30.
4
Localizing unmapped sequences with families to validate the Telomere-to-Telomere assembly and identify new hotspots for genetic diversity.利用家族将未映射的序列本地化,以验证端粒到端粒组装并确定新的遗传多样性热点。
Genome Res. 2023 Oct;33(10):1734-1746. doi: 10.1101/gr.277175.122. Epub 2023 Oct 25.
5
FixItFelix: improving genomic analysis by fixing reference errors.FixItFelix:通过修复参考错误来改进基因组分析。
Genome Biol. 2023 Feb 21;24(1):31. doi: 10.1186/s13059-023-02863-7.
6
Semi-automated assembly of high-quality diploid human reference genomes.半自动组装高质量的二倍体人类参考基因组。
Nature. 2022 Nov;611(7936):519-531. doi: 10.1038/s41586-022-05325-5. Epub 2022 Oct 19.
7
Epigenetic patterns in a complete human genome.人类全基因组中的表观遗传模式。
Science. 2022 Apr;376(6588):eabj5089. doi: 10.1126/science.abj5089. Epub 2022 Apr 1.
8
Sequencing and de novo assembly of 150 genomes from Denmark as a population reference.丹麦 150 个个体基因组的测序和从头组装作为一个群体参考。
Nature. 2017 Aug 3;548(7665):87-91. doi: 10.1038/nature23264. Epub 2017 Jul 26.
9
The benefit of a complete reference genome for cancer structural variant analysis.完整参考基因组在癌症结构变异分析中的益处。
medRxiv. 2024 Mar 18:2024.03.15.24304369. doi: 10.1101/2024.03.15.24304369.
10
A draft human pangenome reference.人类泛基因组参考草图。
Nature. 2023 May;617(7960):312-324. doi: 10.1038/s41586-023-05896-x. Epub 2023 May 10.

引用本文的文献

1
Finding easy regions for short-read variant calling from pangenome data.从泛基因组数据中寻找易于进行短读变异检测的区域。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf103.
2
Structural Variants: Mechanisms, Mapping, and Interpretation in Human Genetics.结构变异:人类遗传学中的机制、定位与解读
Genes (Basel). 2025 Jul 29;16(8):905. doi: 10.3390/genes16080905.
3
GenomePAM directs PAM characterization and engineering of CRISPR-Cas nucleases using mammalian genome repeats.GenomePAM利用哺乳动物基因组重复序列指导CRISPR-Cas核酸酶的PAM鉴定和工程改造。

本文引用的文献

1
Jasmine and Iris: population-scale structural variant comparison and analysis.茉莉花和虹膜:人群规模结构变异比较与分析。
Nat Methods. 2023 Mar;20(3):408-417. doi: 10.1038/s41592-022-01753-3. Epub 2023 Jan 19.
2
High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios.对扩展的 1000 基因组项目队列进行高覆盖率全基因组测序,包括 602 个三核苷酸重复序列。
Cell. 2022 Sep 1;185(18):3426-3440.e19. doi: 10.1016/j.cell.2022.08.004.
3
Chasing perfection: validation and polishing strategies for telomere-to-telomere genome assemblies.
Nat Biomed Eng. 2025 Aug 13. doi: 10.1038/s41551-025-01464-y.
4
Finding easy regions for short-read variant calling from pangenome data.从泛基因组数据中寻找易于进行短读变异检测的区域。
ArXiv. 2025 Aug 8:arXiv:2507.03718v2.
5
Phased genome assemblies and pangenome graphs of human populations of Japan and Saudi Arabia.日本和沙特阿拉伯人群的阶段性基因组组装和泛基因组图谱。
Sci Data. 2025 Aug 12;12(1):1316. doi: 10.1038/s41597-025-05652-y.
6
Segmental duplication-mediated rearrangements alter the landscape of mouse genomes.节段性重复介导的重排改变了小鼠基因组的格局。
bioRxiv. 2025 Jul 22:2025.07.18.665526. doi: 10.1101/2025.07.18.665526.
7
Laboratory considerations for GRCh37 to GRCh38 reference genome transition: A laboratory quality assurance bulletin of the American College of Medical Genetics and Genomics (ACMG).从GRCh37到GRCh38参考基因组转换的实验室考量:美国医学遗传学与基因组学学会(ACMG)的实验室质量保证公告
Genet Med Open. 2025 Jul 24;3:103433. doi: 10.1016/j.gimo.2025.103433. eCollection 2025.
8
Complex genetic variation in nearly complete human genomes.近乎完整的人类基因组中的复杂遗传变异。
Nature. 2025 Jul 23. doi: 10.1038/s41586-025-09140-6.
9
Human-specific gene expansions contribute to brain evolution.人类特有的基因扩增促进大脑进化。
Cell. 2025 Jul 18. doi: 10.1016/j.cell.2025.06.037.
10
Joint processing of long- and short-read sequencing data with deep learning improves variant calling.利用深度学习对长读长和短读长测序数据进行联合处理可改善变异检测。
Cell Rep Methods. 2025 Jul 21;5(7):101107. doi: 10.1016/j.crmeth.2025.101107. Epub 2025 Jul 15.
追求完美:端粒到端粒基因组组装的验证和优化策略。
Nat Methods. 2022 Jun;19(6):687-695. doi: 10.1038/s41592-022-01440-3. Epub 2022 Mar 31.
4
From telomere to telomere: The transcriptional and epigenetic state of human repeat elements.从端粒到端粒:人类重复元件的转录和表观遗传状态。
Science. 2022 Apr;376(6588):eabk3112. doi: 10.1126/science.abk3112. Epub 2022 Apr 1.
5
The complete sequence of a human genome.人类基因组的完整序列。
Science. 2022 Apr;376(6588):44-53. doi: 10.1126/science.abj6987. Epub 2022 Mar 31.
6
Segmental duplications and their variation in a complete human genome.人类全基因组中的串联重复序列及其变异。
Science. 2022 Apr;376(6588):eabj6965. doi: 10.1126/science.abj6965. Epub 2022 Apr 1.
7
Complete genomic and epigenetic maps of human centromeres.人类着丝粒的完整基因组和表观基因组图谱。
Science. 2022 Apr;376(6588):eabl4178. doi: 10.1126/science.abl4178. Epub 2022 Apr 1.
8
Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space.与美国国立人类基因组研究所(NHGRI)基因组数据科学分析、可视化和信息学实验室空间一起颠覆基因组学数据共享模式。
Cell Genom. 2022 Jan 12;2(1). doi: 10.1016/j.xgen.2021.100085. Epub 2022 Jan 13.
9
Curated variation benchmarks for challenging medically relevant autosomal genes.针对具有挑战性的医学相关常染色体基因的精选变异基准。
Nat Biotechnol. 2022 May;40(5):672-680. doi: 10.1038/s41587-021-01158-1. Epub 2022 Feb 7.
10
Failure to Detect Mutations in U2AF1 due to Changes in the GRCh38 Reference Sequence.未能检测到 U2AF1 突变,原因是 GRCh38 参考序列发生变化。
J Mol Diagn. 2022 Mar;24(3):219-223. doi: 10.1016/j.jmoldx.2021.10.013. Epub 2022 Jan 15.