• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

近乎完整的人类基因组中的复杂遗传变异。

Complex genetic variation in nearly complete human genomes.

作者信息

Logsdon Glennis A, Ebert Peter, Audano Peter A, Loftus Mark, Porubsky David, Ebler Jana, Yilmaz Feyza, Hallast Pille, Prodanov Timofey, Yoo DongAhn, Paisie Carolyn A, Harvey William T, Zhao Xuefang, Martino Gianni V, Henglin Mir, Munson Katherine M, Rabbani Keon, Chin Chen-Shan, Gu Bida, Ashraf Hufsah, Scholz Stephan, Austine-Orimoloye Olanrewaju, Balachandran Parithi, Bonder Marc Jan, Cheng Haoyu, Chong Zechen, Crabtree Jonathan, Gerstein Mark, Guethlein Lisbeth A, Hasenfeld Patrick, Hickey Glenn, Hoekzema Kendra, Hunt Sarah E, Jensen Matthew, Jiang Yunzhe, Koren Sergey, Kwon Youngjun, Li Chong, Li Heng, Li Jiaqi, Norman Paul J, Oshima Keisuke K, Paten Benedict, Phillippy Adam M, Pollock Nicholas R, Rausch Tobias, Rautiainen Mikko, Song Yuwei, Söylev Arda, Sulovari Arvis, Surapaneni Likhitha, Tsapalou Vasiliki, Zhou Weichen, Zhou Ying, Zhu Qihui, Zody Michael C, Mills Ryan E, Devine Scott E, Shi Xinghua, Talkowski Michael E, Chaisson Mark J P, Dilthey Alexander T, Konkel Miriam K, Korbel Jan O, Lee Charles, Beck Christine R, Eichler Evan E, Marschall Tobias

机构信息

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Department of Genetics, Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.

出版信息

Nature. 2025 Jul 23. doi: 10.1038/s41586-025-09140-6.

DOI:10.1038/s41586-025-09140-6
PMID:40702183
Abstract

Diverse sets of complete human genomes are required to construct a pangenome reference and to understand the extent of complex structural variation. Here we sequence 65 diverse human genomes and build 130 haplotype-resolved assemblies (median continuity of 130 Mb), closing 92% of all previous assembly gaps and reaching telomere-to-telomere status for 39% of the chromosomes. We highlight complete sequence continuity of complex loci, including the major histocompatibility complex (MHC), SMN1/SMN2, NBPF8 and AMY1/AMY2, and fully resolve 1,852 complex structural variants. In addition, we completely assemble and validate 1,246 human centromeres. We find up to 30-fold variation in α-satellite higher-order repeat array length and characterize the pattern of mobile element insertions into α-satellite higher-order repeat arrays. Although most centromeres predict a single site of kinetochore attachment, epigenetic analysis suggests the presence of two hypomethylated regions for 7% of centromeres. Combining our data with the draft pangenome reference significantly enhances genotyping accuracy from short-read data, enabling whole-genome inference to a median quality value of 45. Using this approach, 26,115 structural variants per individual are detected, substantially increasing the number of structural variants now amenable to downstream disease association studies.

摘要

构建泛基因组参考图谱并了解复杂结构变异的程度需要多样化的完整人类基因组集合。在此,我们对65个多样化的人类基因组进行了测序,并构建了130个单倍型解析的基因组组装序列(中位连续性为130 Mb),填补了此前所有组装缺口的92%,39%的染色体达到了端粒到端粒的完整状态。我们强调了复杂基因座的完整序列连续性,包括主要组织相容性复合体(MHC)、SMN1/SMN2、NBPF8和AMY1/AMY2,并完全解析了1852个复杂结构变异。此外,我们完全组装并验证了1246个人类着丝粒。我们发现α卫星高阶重复序列阵列长度存在高达30倍的变异,并对插入α卫星高阶重复序列阵列的移动元件插入模式进行了表征。尽管大多数着丝粒预测有一个单一的动粒附着位点,但表观遗传分析表明,7%的着丝粒存在两个低甲基化区域。将我们的数据与泛基因组参考草图相结合,显著提高了短读长数据的基因分型准确性,使全基因组推断的中位质量值达到45。使用这种方法,每个个体可检测到26115个结构变异,大大增加了目前适用于下游疾病关联研究的结构变异数量。

相似文献

1
Complex genetic variation in nearly complete human genomes.近乎完整的人类基因组中的复杂遗传变异。
Nature. 2025 Jul 23. doi: 10.1038/s41586-025-09140-6.
2
Complex genetic variation in nearly complete human genomes.近乎完整的人类基因组中的复杂遗传变异。
bioRxiv. 2024 Sep 25:2024.09.24.614721. doi: 10.1101/2024.09.24.614721.
3
Long-read genomics reveal extensive nuclear-specific evolution and allele-specific expression in a dikaryotic fungus.长读长基因组学揭示了双核真菌中广泛的核特异性进化和等位基因特异性表达。
Genome Res. 2025 Jun 2;35(6):1364-1376. doi: 10.1101/gr.280359.124.
4
Precise Identification of Higher-Order Repeats (HORs) in T2T-CHM13 Assembly of Human Chromosome 21-Novel 52mer HOR and Failures of Hg38 Assembly.人类21号染色体T2T-CHM13组装中高阶重复序列(HORs)的精确鉴定——新型52聚体HOR及Hg38组装的失败
Genes (Basel). 2025 Jul 27;16(8):885. doi: 10.3390/genes16080885.
5
Chromosome-level haplotype-resolved genome assembly provides insights into the highly heterozygous genome of Italian ryegrass (Lolium multiflorum Lam.).染色体水平单倍型解析的基因组组装为多花黑麦草(Lolium multiflorum Lam.)高度杂合的基因组提供了见解。
Plant Genome. 2025 Sep;18(3):e70079. doi: 10.1002/tpg2.70079.
6
Verkko2 integrates proximity-ligation data with long-read De Bruijn graphs for efficient telomere-to-telomere genome assembly, phasing, and scaffolding.Verkko2将邻近连接数据与长读长德布鲁因图相结合,以实现高效的端粒到端粒基因组组装、定相和支架搭建。
Genome Res. 2025 Jun 12. doi: 10.1101/gr.280383.124.
7
Human de novo mutation rates from a four-generation pedigree reference.基于一个四代家系参考得出的人类新生突变率。
Nature. 2025 Apr 23. doi: 10.1038/s41586-025-08922-2.
8
De novo Genome Assembly Using Long Reads and Chromosome Conformation Capture.使用长读长和染色体构象捕获进行从头基因组组装
Methods Mol Biol. 2025;2935:1-27. doi: 10.1007/978-1-0716-4583-3_1.
9
The telomere-to-telomere gapless genome of grass carp provides insights for genetic improvement.草鱼的端粒到端粒无间隙基因组为遗传改良提供了见解。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf059.
10
A familial, telomere-to-telomere reference for human mutation and recombination from a four-generation pedigree.来自一个四代家系的人类突变和重组的全基因组端粒到端粒参考序列。
bioRxiv. 2024 Aug 5:2024.08.05.606142. doi: 10.1101/2024.08.05.606142.

引用本文的文献

1
The future of pharmaceuticals: Artificial intelligence in drug discovery and development.制药的未来:药物研发中的人工智能
J Pharm Anal. 2025 Aug;15(8):101248. doi: 10.1016/j.jpha.2025.101248. Epub 2025 Feb 26.
2
Pangenome discovery of missing autism variants.自闭症缺失变异体的泛基因组发现。
medRxiv. 2025 Jul 22:2025.07.21.25331932. doi: 10.1101/2025.07.21.25331932.
3
Structural variation in 1,019 diverse humans based on long-read sequencing.基于长读长测序的1019名不同个体的结构变异

本文引用的文献

1
Structural variation in 1,019 diverse humans based on long-read sequencing.基于长读长测序的1019名不同个体的结构变异
Nature. 2025 Jul 23. doi: 10.1038/s41586-025-09290-7.
2
Human de novo mutation rates from a four-generation pedigree reference.基于一个四代家系参考得出的人类新生突变率。
Nature. 2025 Apr 23. doi: 10.1038/s41586-025-08922-2.
3
Complete sequencing of ape genomes.猿类基因组的完整测序。
Nature. 2025 Jul 23. doi: 10.1038/s41586-025-09290-7.
4
Population differences of chromosome 22q11.2 duplication structure predispose differentially to microdeletion and inversion.22号染色体q11.2区域重复结构的人群差异对微缺失和倒位具有不同的易感性。
bioRxiv. 2025 Jul 7:2025.07.04.662981. doi: 10.1101/2025.07.04.662981.
5
A T2T-CHM13 recombination map and globally diverse haplotype reference panel improves phasing and imputation.一个端粒到端粒的人类基因组参考组装(T2T-CHM13)重组图谱和全球多样化单倍型参考面板改善了定相和填充。
bioRxiv. 2025 Feb 28:2025.02.24.639687. doi: 10.1101/2025.02.24.639687.
6
ScatTR: Estimating the Size of Long Tandem Repeat Expansions from Short-Reads.ScatTR:从短读长估计长串联重复序列的扩增大小。
bioRxiv. 2025 Feb 20:2025.02.15.638440. doi: 10.1101/2025.02.15.638440.
7
Structural variation, selection, and diversification of the gene family from the human pangenome.人类泛基因组中基因家族的结构变异、选择与多样化
bioRxiv. 2025 Feb 5:2025.02.04.636496. doi: 10.1101/2025.02.04.636496.
8
GREGoR: Accelerating Genomics for Rare Diseases.GREGoR:加速罕见病基因组学研究
ArXiv. 2024 Dec 18:arXiv:2412.14338v1.
9
A personalized multi-platform assessment of somatic mosaicism in the human frontal cortex.人类额叶皮质体细胞镶嵌现象的个性化多平台评估
bioRxiv. 2024 Dec 21:2024.12.18.629274. doi: 10.1101/2024.12.18.629274.
10
Identification and annotation of centromeric hypomethylated regions with Centromere Dip Region (CDR)-Finder.使用着丝粒双区域(CDR)查找器对着丝粒低甲基化区域进行识别和注释。
bioRxiv. 2024 Nov 4:2024.11.01.621587. doi: 10.1101/2024.11.01.621587.
Nature. 2025 May;641(8062):401-418. doi: 10.1038/s41586-025-08816-3. Epub 2025 Apr 9.
4
Structural polymorphism and diversity of human segmental duplications.人类节段性重复序列的结构多态性与多样性
Nat Genet. 2025 Feb;57(2):390-401. doi: 10.1038/s41588-024-02051-8. Epub 2025 Jan 8.
5
Identification and annotation of centromeric hypomethylated regions with CDR-Finder.使用CDR-Finder对着丝粒低甲基化区域进行鉴定和注释。
Bioinformatics. 2024 Nov 28;40(12). doi: 10.1093/bioinformatics/btae733.
6
Reconstruction of the human amylase locus reveals ancient duplications seeding modern-day variation.人类淀粉酶基因座的重建揭示了古代重复事件引发了现代的变异。
Science. 2024 Nov 22;386(6724):eadn0609. doi: 10.1126/science.adn0609.
7
Graphasing: phasing diploid genome assembly graphs with single-cell strand sequencing.Graphasing:利用单细胞测序进行二倍体基因组组装图谱的相位分析。
Genome Biol. 2024 Oct 10;25(1):265. doi: 10.1186/s13059-024-03409-1.
8
Impact and characterization of serial structural variations across humans and great apes.人类和大型类人猿中连续结构变异的影响和特征。
Nat Commun. 2024 Sep 13;15(1):8007. doi: 10.1038/s41467-024-52027-9.
9
Recurrent evolution and selection shape structural diversity at the amylase locus.淀粉酶基因座结构多样性的反复进化和选择。
Nature. 2024 Oct;634(8034):617-625. doi: 10.1038/s41586-024-07911-1. Epub 2024 Sep 4.
10
High-resolution African HLA resource uncovers HLA-DRB1 expression effects underlying vaccine response.高分辨率非洲 HLA 资源揭示了疫苗反应背后的 HLA-DRB1 表达效应。
Nat Med. 2024 May;30(5):1384-1394. doi: 10.1038/s41591-024-02944-5. Epub 2024 May 13.