• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

定义和编目泛基因组图谱中的变异体。

Defining and cataloging variants in pangenome graphs.

作者信息

Nowbandegani Pouria Salehi, Zhang Shenghan, Hu Haoyang, Li Heng, O'Connor Luke J

机构信息

Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.

Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.

出版信息

bioRxiv. 2025 Aug 4:2025.08.04.668502. doi: 10.1101/2025.08.04.668502.

DOI:10.1101/2025.08.04.668502
PMID:40799530
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12340789/
Abstract

Structural variation causes some human haplotypes to align poorly with the linear reference genome, leading to 'reference bias'. A pangenome reference graph could ameliorate this bias by relating a sample to multiple reference assemblies. However, this approach requires a new definition of a 'genetic variant.' We introduce a definition of pangenome variants and a method, , to identify them. Our approach involves a pangenome which includes all nodes (sequences) of the pangenome graph, but only a subset of its edges; non-reference edges are . Our variants are biallelic and have well-defined positions. Analyzing the Minigraph-Cactus draft human pangenome reference graph, we identified 29.6 million genetic variants. Most variants (99.2%) are small, and most small variants (73.9%) are SNPs. 3.5 million variants (11.7%) have a reference allele which is not on GRCh38; these variants are difficult to detect without a pangenome reference, or with existing pangenome-based approaches. They tend to be embedded within tangled, multiallelic regions. We analyze two medically relevant regions, around the HLA-A and RHD genes, identifying thousands of small variants embedded within several large insertions, deletions, and inversions. We release an open-source software tool together with a VCF variant catalogue.

摘要

结构变异导致一些人类单倍型与线性参考基因组的比对效果不佳,从而产生“参考偏差”。泛基因组参考图通过将样本与多个参考组装进行关联,可改善这种偏差。然而,这种方法需要对“基因变异”进行新的定义。我们引入了泛基因组变异的定义以及一种识别它们的方法。我们的方法涉及一个泛基因组,它包含泛基因组图的所有节点(序列),但只包含其边的一个子集;非参考边是……我们的变异是双等位基因的,并且具有明确的位置。通过分析Minigraph-Cactus人类泛基因组参考草图,我们识别出了2960万个基因变异。大多数变异(99.2%)是小变异,并且大多数小变异(73.9%)是单核苷酸多态性(SNP)。350万个变异(11.7%)具有不在GRCh38上的参考等位基因;如果没有泛基因组参考,或者使用现有的基于泛基因组的方法,这些变异很难被检测到。它们往往嵌入在复杂的多等位基因区域内。我们分析了两个与医学相关的区域,即HLA - A和RHD基因周围,识别出了数千个嵌入在几个大的插入、缺失和倒位中的小变异。我们发布了一个开源软件工具以及一个VCF变异目录。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8203/12340789/2564a1c22bee/nihpp-2025.08.04.668502v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8203/12340789/ee9ba195fbc4/nihpp-2025.08.04.668502v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8203/12340789/d87e9cce8c58/nihpp-2025.08.04.668502v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8203/12340789/eb30c4ce66e7/nihpp-2025.08.04.668502v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8203/12340789/2564a1c22bee/nihpp-2025.08.04.668502v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8203/12340789/ee9ba195fbc4/nihpp-2025.08.04.668502v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8203/12340789/d87e9cce8c58/nihpp-2025.08.04.668502v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8203/12340789/eb30c4ce66e7/nihpp-2025.08.04.668502v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8203/12340789/2564a1c22bee/nihpp-2025.08.04.668502v1-f0004.jpg

相似文献

1
Defining and cataloging variants in pangenome graphs.定义和编目泛基因组图谱中的变异体。
bioRxiv. 2025 Aug 4:2025.08.04.668502. doi: 10.1101/2025.08.04.668502.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Pangenome graph construction from genome alignments with Minigraph-Cactus.基于 Minigraph-Cactus 的基因组比对构建泛基因组图谱。
Nat Biotechnol. 2024 Apr;42(4):663-673. doi: 10.1038/s41587-023-01793-w. Epub 2023 May 10.
4
The quantity, quality and findings of network meta-analyses evaluating the effectiveness of GLP-1 RAs for weight loss: a scoping review.评估胰高血糖素样肽-1受体激动剂(GLP-1 RAs)减肥效果的网状Meta分析的数量、质量及结果:一项范围综述
Health Technol Assess. 2025 Jun 25:1-73. doi: 10.3310/SKHT8119.
5
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
6
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
8
Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.液体活检能否通过低深度全基因组测序检测肉瘤患者的循环肿瘤DNA?一项初步评估。
Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21.
9
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
10
Sexual Harassment and Prevention Training性骚扰与预防培训

本文引用的文献

1
Building pangenome graphs.构建泛基因组图谱。
Nat Methods. 2024 Nov;21(11):2008-2012. doi: 10.1038/s41592-024-02430-3. Epub 2024 Oct 21.
2
The GIAB genomic stratifications resource for human reference genomes.GIAB 基因组分层资源用于人类参考基因组。
Nat Commun. 2024 Oct 19;15(1):9029. doi: 10.1038/s41467-024-53260-y.
3
A draft human pangenome reference.人类泛基因组参考草图。
Nature. 2023 May;617(7960):312-324. doi: 10.1038/s41586-023-05896-x. Epub 2023 May 10.
4
Pangenome graph construction from genome alignments with Minigraph-Cactus.基于 Minigraph-Cactus 的基因组比对构建泛基因组图谱。
Nat Biotechnol. 2024 Apr;42(4):663-673. doi: 10.1038/s41587-023-01793-w. Epub 2023 May 10.
5
Pangenome Graphs.泛基因组图谱。
Annu Rev Genomics Hum Genet. 2020 Aug 31;21:139-162. doi: 10.1146/annurev-genom-120219-080406. Epub 2020 May 26.
6
Variation graph toolkit improves read mapping by representing genetic variation in the reference.变异图谱工具包通过表示参考中的遗传变异来提高读映射质量。
Nat Biotechnol. 2018 Oct;36(9):875-879. doi: 10.1038/nbt.4227. Epub 2018 Aug 20.
7
Accurate detection of complex structural variations using single-molecule sequencing.利用单分子测序技术准确检测复杂结构变异。
Nat Methods. 2018 Jun;15(6):461-468. doi: 10.1038/s41592-018-0001-7. Epub 2018 Apr 30.
8
Superbubbles, Ultrabubbles, and Cacti.超级气泡、超气泡与仙人掌。
J Comput Biol. 2018 Jul;25(7):649-663. doi: 10.1089/cmb.2017.0251. Epub 2018 Feb 20.
9
Bandage: interactive visualization of de novo genome assemblies.绷带:从头基因组组装的交互式可视化
Bioinformatics. 2015 Oct 15;31(20):3350-2. doi: 10.1093/bioinformatics/btv383. Epub 2015 Jun 22.
10
Toward simplifying and accurately formulating fragment assembly.朝着简化和精确制定片段组装的方向。
J Comput Biol. 1995 Summer;2(2):275-90. doi: 10.1089/cmb.1995.2.275.