Suppr超能文献

构建某一群体的泛基因组参考图谱。

Building a pan-genome reference for a population.

作者信息

Nguyen Ngan, Hickey Glenn, Zerbino Daniel R, Raney Brian, Earl Dent, Armstrong Joel, Kent W James, Haussler David, Paten Benedict

机构信息

1 Center for Biomolecular Science and Engineering, University of California , Santa Cruz, California.

出版信息

J Comput Biol. 2015 May;22(5):387-401. doi: 10.1089/cmb.2014.0146. Epub 2015 Jan 7.

Abstract

A reference genome is a high quality individual genome that is used as a coordinate system for the genomes of a population, or genomes of closely related subspecies. Given a set of genomes partitioned by homology into alignment blocks we formalize the problem of ordering and orienting the blocks such that the resulting ordering maximally agrees with the underlying genomes' ordering and orientation, creating a pan-genome reference ordering. We show this problem is NP-hard, but also demonstrate, empirically and within simulations, the performance of heuristic algorithms based upon a cactus graph decomposition to find locally maximal solutions. We describe an extension of our Cactus software to create a pan-genome reference for whole genome alignments, and demonstrate how it can be used to create novel genome browser visualizations using human variation data as a test. In addition, we test the use of a pan-genome for describing variations and as a reference for read mapping.

摘要

参考基因组是一个高质量的个体基因组,用作群体基因组或密切相关亚种基因组的坐标系统。给定一组通过同源性划分为比对块的基因组,我们将比对块排序和定向的问题形式化,使得得到的排序最大程度地与基础基因组的排序和定向一致,从而创建一个泛基因组参考排序。我们证明这个问题是NP难问题,但也通过实验和模拟展示了基于仙人掌图分解的启发式算法寻找局部最大解的性能。我们描述了Cactus软件的扩展,用于为全基因组比对创建泛基因组参考,并展示了如何使用人类变异数据作为测试来创建新颖的基因组浏览器可视化。此外,我们测试了使用泛基因组来描述变异以及作为读取映射的参考。

相似文献

1
Building a pan-genome reference for a population.构建某一群体的泛基因组参考图谱。
J Comput Biol. 2015 May;22(5):387-401. doi: 10.1089/cmb.2014.0146. Epub 2015 Jan 7.
2
Cactus: Algorithms for genome multiple sequence alignment.仙人掌:基因组多重序列比对算法。
Genome Res. 2011 Sep;21(9):1512-28. doi: 10.1101/gr.123356.111. Epub 2011 Jun 10.
5
Coordinates and intervals in graph-based reference genomes.基于图谱的参考基因组中的坐标和区间
BMC Bioinformatics. 2017 May 18;18(1):263. doi: 10.1186/s12859-017-1678-9.
7
Multiple whole-genome alignments without a reference organism.无参考生物体的多个全基因组比对
Genome Res. 2009 Apr;19(4):682-9. doi: 10.1101/gr.081778.108. Epub 2009 Jan 28.
9
Cactus graphs for genome comparisons.用于基因组比较的仙人掌图。
J Comput Biol. 2011 Mar;18(3):469-81. doi: 10.1089/cmb.2010.0252.
10
Superbubbles, Ultrabubbles, and Cacti.超级气泡、超气泡与仙人掌。
J Comput Biol. 2018 Jul;25(7):649-663. doi: 10.1089/cmb.2017.0251. Epub 2018 Feb 20.

引用本文的文献

1
Reference-agnostic representation and visualization of pan-genomes.泛基因组的无参表示和可视化。
BMC Bioinformatics. 2021 Oct 16;22(1):502. doi: 10.1186/s12859-021-04424-w.
6
Estimating Pangenomes with Roary.用 Roary 估计泛基因组。
Mol Biol Evol. 2020 Mar 1;37(3):933-939. doi: 10.1093/molbev/msz284.
8
Characterizing the Major Structural Variant Alleles of the Human Genome.人类基因组主要结构变异等位基因的特征。
Cell. 2019 Jan 24;176(3):663-675.e19. doi: 10.1016/j.cell.2018.12.019. Epub 2019 Jan 17.
10
Whole-Genome Alignment and Comparative Annotation.全基因组比对和注释。
Annu Rev Anim Biosci. 2019 Feb 15;7:41-64. doi: 10.1146/annurev-animal-020518-115005. Epub 2018 Oct 31.

本文引用的文献

1
The UCSC Genome Browser database: extensions and updates 2013.UCSC 基因组浏览器数据库:扩展和更新 2013 年版
Nucleic Acids Res. 2013 Jan;41(Database issue):D64-9. doi: 10.1093/nar/gks1048. Epub 2012 Nov 15.
3
Cactus: Algorithms for genome multiple sequence alignment.仙人掌:基因组多重序列比对算法。
Genome Res. 2011 Sep;21(9):1512-28. doi: 10.1101/gr.123356.111. Epub 2011 Jun 10.
4
A user's guide to the encyclopedia of DNA elements (ENCODE).DNA 元件百科全书(ENCODE)使用指南
PLoS Biol. 2011 Apr;9(4):e1001046. doi: 10.1371/journal.pbio.1001046. Epub 2011 Apr 19.
5
Cactus graphs for genome comparisons.用于基因组比较的仙人掌图。
J Comput Biol. 2011 Mar;18(3):469-81. doi: 10.1089/cmb.2010.0252.
6
The GENCODE exome: sequencing the complete human exome.GENCODE 外显子组:对完整人类外显子组进行测序。
Eur J Hum Genet. 2011 Jul;19(7):827-31. doi: 10.1038/ejhg.2011.28. Epub 2011 Mar 2.
8
How and why chromosome inversions evolve.染色体倒位的发生机制和原因。
PLoS Biol. 2010 Sep 28;8(9):e1000501. doi: 10.1371/journal.pbio.1000501.
10
Genetic map refinement using a comparative genomic approach.
J Comput Biol. 2009 Oct;16(10):1475-86. doi: 10.1089/cmb.2009.0094.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验