• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

下一代序列组装的图一致性。

Graph accordance of next-generation sequence assemblies.

机构信息

The Genome Institute, Washington University School of Medicine, 4444 Forest Park Avenue, St Louis, MO 63108, USA.

出版信息

Bioinformatics. 2012 Jan 1;28(1):13-6. doi: 10.1093/bioinformatics/btr588. Epub 2011 Oct 23.

DOI:10.1093/bioinformatics/btr588
PMID:22025481
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3244760/
Abstract

MOTIVATION

No individual assembly algorithm addresses all the known limitations of assembling short-length sequences. Overall reduced sequence contig length is the major problem that challenges the usage of these assemblies. We describe an algorithm to take advantages of different assembly algorithms or sequencing platforms to improve the quality of next-generation sequence (NGS) assemblies.

RESULTS

The algorithm is implemented as a graph accordance assembly (GAA) program. The algorithm constructs an accordance graph to capture the mapping information between the target and query assemblies. Based on the accordance graph, the contigs or scaffolds of the target assembly can be extended, merged or bridged together. Extra constraints, including gap sizes, mate pairs, scaffold order and orientation, are explored to enforce those accordance operations in the correct context. We applied GAA to various chicken NGS assemblies and the results demonstrate improved contiguity statistics and higher genome and gene coverage.

AVAILABILITY

GAA is implemented in OO perl and is available here: http://sourceforge.net/projects/gaa-wugi/.

CONTACT

lye@genome.wustl.edu

摘要

动机

没有任何一种单一的组装算法能够解决所有已知的短序列组装限制。整体上序列片段的长度减少是主要问题,这限制了这些组装方法的使用。我们描述了一种算法,可以利用不同的组装算法或测序平台来提高下一代测序(NGS)组装的质量。

结果

该算法被实现为一个图谱一致性组装(GAA)程序。该算法构建一个一致性图谱,以捕获目标和查询组装之间的映射信息。基于该一致性图谱,可以扩展、合并或桥接目标组装的 contigs 或 scaffolds。额外的约束条件,包括缺口大小、mate pairs、scaffold 顺序和方向,都被探索用来在正确的上下文中执行这些一致性操作。我们将 GAA 应用于各种鸡的 NGS 组装中,结果表明改进了连续性统计和更高的基因组和基因覆盖率。

可用性

GAA 是用面向对象的 perl 实现的,可以在这里获得:http://sourceforge.net/projects/gaa-wugi/。

联系方式

lye@genome.wustl.edu

相似文献

1
Graph accordance of next-generation sequence assemblies.下一代序列组装的图一致性。
Bioinformatics. 2012 Jan 1;28(1):13-6. doi: 10.1093/bioinformatics/btr588. Epub 2011 Oct 23.
2
GRASS: a generic algorithm for scaffolding next-generation sequencing assemblies.GRASS:一种用于下一代测序组装的通用支架算法。
Bioinformatics. 2012 Jun 1;28(11):1429-37. doi: 10.1093/bioinformatics/bts175. Epub 2012 Apr 6.
3
SOPRA: Scaffolding algorithm for paired reads via statistical optimization.SOPRA:基于统计优化的配对读取支架算法。
BMC Bioinformatics. 2010 Jun 24;11:345. doi: 10.1186/1471-2105-11-345.
4
GAM-NGS: genomic assemblies merger for next generation sequencing.GAM-NGS:用于下一代测序的基因组组装合并。
BMC Bioinformatics. 2013;14 Suppl 7(Suppl 7):S6. doi: 10.1186/1471-2105-14-S7-S6. Epub 2013 Apr 22.
5
GMcloser: closing gaps in assemblies accurately with a likelihood-based selection of contig or long-read alignments.GMcloser:基于可能性选择 contig 或长读序列比对来精确地闭合组装缺口。
Bioinformatics. 2015 Dec 1;31(23):3733-41. doi: 10.1093/bioinformatics/btv465. Epub 2015 Aug 10.
6
Paired de bruijn graphs: a novel approach for incorporating mate pair information into genome assemblers.配对德布鲁因图:一种将配对末端信息整合到基因组组装工具中的新方法。
J Comput Biol. 2011 Nov;18(11):1625-34. doi: 10.1089/cmb.2011.0151. Epub 2011 Oct 14.
7
Finishing bacterial genome assemblies with Mix.使用 Mix 完成细菌基因组组装。
BMC Bioinformatics. 2013;14 Suppl 15(Suppl 15):S16. doi: 10.1186/1471-2105-14-S15-S16. Epub 2013 Oct 15.
8
ScaffMatch: scaffolding algorithm based on maximum weight matching.ScaffMatch:基于最大权重匹配的支架算法。
Bioinformatics. 2015 Aug 15;31(16):2632-8. doi: 10.1093/bioinformatics/btv211. Epub 2015 Apr 17.
9
Assessing the benefits of using mate-pairs to resolve repeats in de novo short-read prokaryotic assemblies.评估使用 Mate-Pairs 解决从头组装的短读 prokaryotic 重复的好处。
BMC Bioinformatics. 2011 Apr 13;12:95. doi: 10.1186/1471-2105-12-95.
10
Resequencing of the common marmoset genome improves genome assemblies and gene-coding sequence analysis.普通狨猴基因组的重测序改进了基因组组装和基因编码序列分析。
Sci Rep. 2015 Nov 20;5:16894. doi: 10.1038/srep16894.

引用本文的文献

1
TransBorrow: genome-guided transcriptome assembly by borrowing assemblies from different assemblers.TransBorrow:通过从不同的组装器借用组装来进行基因组指导的转录组组装。
Genome Res. 2020 Aug;30(8):1181-1190. doi: 10.1101/gr.257766.119. Epub 2020 Aug 17.
2
MAC: Merging Assemblies by Using Adjacency Algebraic Model and Classification.MAC:基于邻接代数模型和分类的装配合并
Front Genet. 2020 Jan 31;10:1396. doi: 10.3389/fgene.2019.01396. eCollection 2019.
3
Draft Genome Sequence of Bacillus marisflavi CK-NBRI-03, Isolated from Agricultural Soil.从农业土壤中分离得到的海黄芽孢杆菌CK-NBRI-03的基因组草图序列
Microbiol Resour Announc. 2020 Feb 13;9(7):e00044-20. doi: 10.1128/MRA.00044-20.
4
Extensive chromosomal rearrangements and rapid evolution of novel effector superfamilies contribute to host adaptation and speciation in the basal ascomycetous fungi.广泛的染色体重排和新型效应子超家族的快速进化有助于基础子囊菌的宿主适应和物种形成。
Mol Plant Pathol. 2020 Mar;21(3):330-348. doi: 10.1111/mpp.12899. Epub 2020 Jan 8.
5
Draft Genome Sequence of a Potential Plant Growth-Promoting Rhizobacterium, sp. Strain CK-NBRI-02.一种潜在的促进植物生长的根际细菌——[细菌名称]菌株CK-NBRI-02的基因组序列草图
Microbiol Resour Announc. 2019 Oct 24;8(43):e01113-19. doi: 10.1128/MRA.01113-19.
6
LR_Gapcloser: a tiling path-based gap closer that uses long reads to complete genome assembly.LR_Gapcloser:一种基于平铺路径的缺口闭合器,它使用长读长来完成基因组组装。
Gigascience. 2019 Jan 1;8(1):giy157. doi: 10.1093/gigascience/giy157.
7
Novo&Stitch: accurate reconciliation of genome assemblies via optical maps.Novo&Stitch:通过光学图谱实现基因组组装的精确比对。
Bioinformatics. 2018 Jul 1;34(13):i43-i51. doi: 10.1093/bioinformatics/bty255.
8
Construction of Red Fox Chromosomal Fragments from the Short-Read Genome Assembly.利用短读长基因组组装构建赤狐染色体片段
Genes (Basel). 2018 Jun 20;9(6):308. doi: 10.3390/genes9060308.
9
The genome of Austrofundulus limnaeus offers insights into extreme vertebrate stress tolerance and embryonic development.小口白甲鱼的基因组为研究脊椎动物的极端环境耐受和胚胎发育提供了线索。
BMC Genomics. 2018 Feb 20;19(1):155. doi: 10.1186/s12864-018-4539-7.
10
CAMSA: a tool for comparative analysis and merging of scaffold assemblies.CAMSA:一种用于支架组件比较分析和合并的工具。
BMC Bioinformatics. 2017 Dec 6;18(Suppl 15):496. doi: 10.1186/s12859-017-1919-y.

本文引用的文献

1
A vertebrate case study of the quality of assemblies derived from next-generation sequences.基于下一代测序的脊椎动物组装质量的案例研究。
Genome Biol. 2011;12(3):R31. doi: 10.1186/gb-2011-12-3-r31. Epub 2011 Mar 31.
2
High-quality draft assemblies of mammalian genomes from massively parallel sequence data.利用大规模平行测序数据生成高质量的哺乳动物基因组草图组装。
Proc Natl Acad Sci U S A. 2011 Jan 25;108(4):1513-8. doi: 10.1073/pnas.1017351108. Epub 2010 Dec 27.
3
Limitations of next-generation genome sequence assembly.下一代基因组序列组装的局限性。
Nat Methods. 2011 Jan;8(1):61-5. doi: 10.1038/nmeth.1527. Epub 2010 Nov 21.
4
Integrating genome assemblies with MAIA.将基因组组装与 MAIA 整合。
Bioinformatics. 2010 Sep 15;26(18):i433-9. doi: 10.1093/bioinformatics/btq366.
5
Optimization of de novo transcriptome assembly from next-generation sequencing data.从头转录组组装的优化。
Genome Res. 2010 Oct;20(10):1432-40. doi: 10.1101/gr.103846.109. Epub 2010 Aug 6.
6
Iterative Correction of Reference Nucleotides (iCORN) using second generation sequencing technology.利用第二代测序技术进行参考核苷酸的迭代校正(iCORN)。
Bioinformatics. 2010 Jul 15;26(14):1704-7. doi: 10.1093/bioinformatics/btq269. Epub 2010 Jun 18.
7
De Novo assembly of the complete genome of an enhanced electricity-producing variant of Geobacter sulfurreducens using only short reads.仅使用短读长对产电增强型 Geobacter sulfurreducens 全基因组进行从头组装。
PLoS One. 2010 Jun 8;5(6):e10922. doi: 10.1371/journal.pone.0010922.
8
The sequence and de novo assembly of the giant panda genome.大熊猫基因组的序列与从头组装。
Nature. 2010 Jan 21;463(7279):311-7. doi: 10.1038/nature08696. Epub 2009 Dec 13.
9
De novo genome sequence assembly of a filamentous fungus using Sanger, 454 and Illumina sequence data.利用 Sanger、454 和 Illumina 测序数据进行丝状真菌从头基因组序列组装。
Genome Biol. 2009;10(9):R94. doi: 10.1186/gb-2009-10-9-r94. Epub 2009 Sep 11.
10
Fast and accurate short read alignment with Burrows-Wheeler transform.使用Burrows-Wheeler变换进行快速准确的短读比对。
Bioinformatics. 2009 Jul 15;25(14):1754-60. doi: 10.1093/bioinformatics/btp324. Epub 2009 May 18.