• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BOSS:一种基于优化支架图的新型支架算法。

BOSS: a novel scaffolding algorithm based on an optimized scaffold graph.

机构信息

School of Information Science and Engineering, Central South University, ChangSha 410083, China.

College of Computer Science and Technology, Henan Polytechnic University, JiaoZuo 454000, China.

出版信息

Bioinformatics. 2017 Jan 15;33(2):169-176. doi: 10.1093/bioinformatics/btw597. Epub 2016 Sep 14.

DOI:10.1093/bioinformatics/btw597
PMID:27634951
Abstract

MOTIVATION

While aiming to determine orientations and orders of fragmented contigs, scaffolding is an essential step of assembly pipelines and can make assembly results more complete. Most existing scaffolding tools adopt scaffold graph approaches. However, due to repetitive regions in genome, sequencing errors and uneven sequencing depth, constructing an accurate scaffold graph is still a challenge task.

RESULTS

In this paper, we present a novel algorithm (called BOSS), which employs paired reads for scaffolding. To construct a scaffold graph, BOSS utilizes the distribution of insert size to decide whether an edge between two vertices (contigs) should be added and how an edge should be weighed. Moreover, BOSS adopts an iterative strategy to detect spurious edges whose removal can guarantee no contradictions in the scaffold graph. Based on the scaffold graph constructed, BOSS employs a heuristic algorithm to sort vertices (contigs) and then generates scaffolds. The experimental results demonstrate that BOSS produces more satisfactory scaffolds, compared with other popular scaffolding tools on real sequencing data of four genomes.

AVAILABILITY AND IMPLEMENTATION

BOSS is publicly available for download at https://github.com/bioinfomaticsCSU/BOSS CONTACT: jxwang@mail.csu.edu.cnSupplementary information: Supplementary data are available at Bioinformatics online.

摘要

动机

在旨在确定碎片化接头的方向和顺序时,支架是组装管道的重要步骤,它可以使组装结果更加完整。大多数现有的支架工具都采用支架图方法。然而,由于基因组中的重复区域、测序错误和不均匀的测序深度,构建准确的支架图仍然是一项具有挑战性的任务。

结果

在本文中,我们提出了一种新的算法(称为 BOSS),该算法使用配对读取进行支架构建。为了构建支架图,BOSS 利用插入大小的分布来决定两个顶点(接头)之间的边是否应该添加以及边应该如何加权。此外,BOSS 采用迭代策略来检测虚假边,删除这些边可以保证支架图中没有矛盾。基于构建的支架图,BOSS 采用启发式算法对顶点(接头)进行排序,然后生成支架。实验结果表明,与其他流行的支架工具相比,BOSS 在四个基因组的真实测序数据上生成了更令人满意的支架。

可用性和实现

BOSS 可在 https://github.com/bioinfomaticsCSU/BOSS 上公开下载。

联系方式

jxwang@mail.csu.edu.cn

补充信息

补充数据可在生物信息学在线获得。

相似文献

1
BOSS: a novel scaffolding algorithm based on an optimized scaffold graph.BOSS:一种基于优化支架图的新型支架算法。
Bioinformatics. 2017 Jan 15;33(2):169-176. doi: 10.1093/bioinformatics/btw597. Epub 2016 Sep 14.
2
SCOP: a novel scaffolding algorithm based on contig classification and optimization.SCOP:一种基于重叠群分类和优化的新型支架算法。
Bioinformatics. 2019 Apr 1;35(7):1142-1150. doi: 10.1093/bioinformatics/bty773.
3
ScaffMatch: scaffolding algorithm based on maximum weight matching.ScaffMatch:基于最大权重匹配的支架算法。
Bioinformatics. 2015 Aug 15;31(16):2632-8. doi: 10.1093/bioinformatics/btv211. Epub 2015 Apr 17.
4
SLR: a scaffolding algorithm based on long reads and contig classification.SLR:一种基于长读段和重叠群分类的支架算法。
BMC Bioinformatics. 2019 Oct 30;20(1):539. doi: 10.1186/s12859-019-3114-9.
5
SLHSD: hybrid scaffolding method based on short and long reads.SLHSD:基于短读长和长读长的混合支架搭建方法。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad169.
6
SGTK: a toolkit for visualization and assessment of scaffold graphs.SGTK:一个支架图可视化和评估的工具包。
Bioinformatics. 2019 Jul 1;35(13):2303-2305. doi: 10.1093/bioinformatics/bty956.
7
EPGA: de novo assembly using the distributions of reads and insert size.EPGA:基于读长和插入片段分布的从头组装。
Bioinformatics. 2015 Mar 15;31(6):825-33. doi: 10.1093/bioinformatics/btu762. Epub 2014 Nov 17.
8
A comprehensive evaluation of assembly scaffolding tools.装配脚手架工具的综合评估。
Genome Biol. 2014 Mar 3;15(3):R42. doi: 10.1186/gb-2014-15-3-r42.
9
CSAR: a contig scaffolding tool using algebraic rearrangements.CSAR:一种使用代数重排进行拼接的工具。
Bioinformatics. 2018 Jan 1;34(1):109-111. doi: 10.1093/bioinformatics/btx543.
10
ISEA: Iterative Seed-Extension Algorithm for De Novo Assembly Using Paired-End Information and Insert Size Distribution.ISEA:利用双末端信息和插入片段大小分布进行从头组装的迭代种子扩展算法
IEEE/ACM Trans Comput Biol Bioinform. 2017 Jul-Aug;14(4):916-925. doi: 10.1109/TCBB.2016.2550433. Epub 2016 Apr 5.

引用本文的文献

1
Comparative genomics of endophytic fungi Apiospora malaysiana with related ascomycetes indicates adaptation attuned to lifestyle choices with potential sustainable cellulolytic activity.马来西亚阿皮孢内生真菌与相关子囊菌的比较基因组学表明,其适应与潜在可持续纤维素分解活性相关的生活方式选择。
DNA Res. 2025 May 28;32(3). doi: 10.1093/dnares/dsaf011.
2
Maptcha: an efficient parallel workflow for hybrid genome scaffolding.Maptcha:一种用于混合基因组支架构建的高效并行工作流程。
BMC Bioinformatics. 2024 Aug 8;25(1):263. doi: 10.1186/s12859-024-05878-4.
3
Genome Analysis Coupled With Transcriptomics Reveals the Reduced Fitness of a Hot Spring Cyanobacterium UU774 Under Exogenous Nitrogen Supplement.
基因组分析与转录组学相结合揭示了外源添加氮条件下温泉蓝细菌UU774适应性的降低。
Front Microbiol. 2022 Jul 1;13:909289. doi: 10.3389/fmicb.2022.909289. eCollection 2022.
4
LROD: An Overlap Detection Algorithm for Long Reads Based on -mer Distribution.LROD:一种基于-mer分布的长读段重叠检测算法。
Front Genet. 2020 Jul 29;11:632. doi: 10.3389/fgene.2020.00632. eCollection 2020.
5
MAC: Merging Assemblies by Using Adjacency Algebraic Model and Classification.MAC:基于邻接代数模型和分类的装配合并
Front Genet. 2020 Jan 31;10:1396. doi: 10.3389/fgene.2019.01396. eCollection 2019.
6
LRScaf: improving draft genomes using long noisy reads.LRScaf:利用长噪声读取提高草稿基因组。
BMC Genomics. 2019 Dec 9;20(1):955. doi: 10.1186/s12864-019-6337-2.
7
SLR: a scaffolding algorithm based on long reads and contig classification.SLR:一种基于长读段和重叠群分类的支架算法。
BMC Bioinformatics. 2019 Oct 30;20(1):539. doi: 10.1186/s12859-019-3114-9.
8
Linking De Novo Assembly Results with Long DNA Reads Using the dnaasm-link Application.使用 dnaasm-link 应用程序将从头组装结果与长 DNA 读取结果进行链接。
Biomed Res Int. 2019 Apr 11;2019:7847064. doi: 10.1155/2019/7847064. eCollection 2019.
9
A Sequence-Based Novel Approach for Quality Evaluation of Third-Generation Sequencing Reads.基于序列的第三代测序读段质量评估新方法。
Genes (Basel). 2019 Jan 14;10(1):44. doi: 10.3390/genes10010044.
10
Fast-SG: an alignment-free algorithm for hybrid assembly.Fast-SG:一种用于混合组装的无比对算法。
Gigascience. 2018 May 1;7(5). doi: 10.1093/gigascience/giy048.