Suppr超能文献

基于 Minigraph-Cactus 的基因组比对构建泛基因组图谱。

Pangenome graph construction from genome alignments with Minigraph-Cactus.

机构信息

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA.

Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany.

出版信息

Nat Biotechnol. 2024 Apr;42(4):663-673. doi: 10.1038/s41587-023-01793-w. Epub 2023 May 10.

Abstract

Pangenome references address biases of reference genomes by storing a representative set of diverse haplotypes and their alignment, usually as a graph. Alternate alleles determined by variant callers can be used to construct pangenome graphs, but advances in long-read sequencing are leading to widely available, high-quality phased assemblies. Constructing a pangenome graph directly from assemblies, as opposed to variant calls, leverages the graph's ability to represent variation at different scales. Here we present the Minigraph-Cactus pangenome pipeline, which creates pangenomes directly from whole-genome alignments, and demonstrate its ability to scale to 90 human haplotypes from the Human Pangenome Reference Consortium. The method builds graphs containing all forms of genetic variation while allowing use of current mapping and genotyping tools. We measure the effect of the quality and completeness of reference genomes used for analysis within the pangenomes and show that using the CHM13 reference from the Telomere-to-Telomere Consortium improves the accuracy of our methods. We also demonstrate construction of a Drosophila melanogaster pangenome.

摘要

泛基因组参考通过存储一组具有代表性的多样化单倍型及其比对,通常以图的形式,解决了参考基因组的偏差。变体调用器确定的替代等位基因可用于构建泛基因组图谱,但长读测序的进步正在导致广泛可用的高质量相组装。与从变体调用相反,直接从组装构建泛基因组图谱利用了图谱在不同尺度上表示变异的能力。在这里,我们提出了 Minigraph-Cactus 泛基因组管道,该管道直接从全基因组比对创建泛基因组,并展示了其对来自人类泛基因组参考联盟的 90 个人类单倍型的扩展能力。该方法构建了包含所有形式遗传变异的图谱,同时允许使用当前的映射和基因分型工具。我们测量了在泛基因组中用于分析的参考基因组的质量和完整性的影响,并表明使用端粒到端粒联盟的 CHM13 参考可以提高我们方法的准确性。我们还展示了黑腹果蝇泛基因组的构建。

相似文献

引用本文的文献

3
Defining and cataloging variants in pangenome graphs.定义和编目泛基因组图谱中的变异体。
bioRxiv. 2025 Aug 4:2025.08.04.668502. doi: 10.1101/2025.08.04.668502.
7

本文引用的文献

2
Unbiased pangenome graphs.无偏泛基因组图。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac743.
7
The complete sequence of a human genome.人类基因组的完整序列。
Science. 2022 Apr;376(6588):44-53. doi: 10.1126/science.abj6987. Epub 2022 Mar 31.
10
The Need for a Human Pangenome Reference Sequence.人类泛基因组参考序列的需求。
Annu Rev Genomics Hum Genet. 2021 Aug 31;22:81-102. doi: 10.1146/annurev-genom-120120-081921. Epub 2021 Apr 30.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验