• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CAMSA:一种用于支架组件比较分析和合并的工具。

CAMSA: a tool for comparative analysis and merging of scaffold assemblies.

作者信息

Aganezov Sergey S, Alekseyev Max A

机构信息

Princeton University, 35 Olden St., Princeton, 08450, NJ, USA.

ITMO University, 49 Kronverksky Pr., St. Petersburg, 197101, Russia.

出版信息

BMC Bioinformatics. 2017 Dec 6;18(Suppl 15):496. doi: 10.1186/s12859-017-1919-y.

DOI:10.1186/s12859-017-1919-y
PMID:29244014
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5731503/
Abstract

BACKGROUND

Despite the recent progress in genome sequencing and assembly, many of the currently available assembled genomes come in a draft form. Such draft genomes consist of a large number of genomic fragments (scaffolds), whose positions and orientations along the genome are unknown. While there exists a number of methods for reconstruction of the genome from its scaffolds, utilizing various computational and wet-lab techniques, they often can produce only partial error-prone scaffold assemblies. It therefore becomes important to compare and merge scaffold assemblies produced by different methods, thus combining their advantages and highlighting present conflicts for further investigation. These tasks may be labor intensive if performed manually.

RESULTS

We present CAMSA-a tool for comparative analysis and merging of two or more given scaffold assemblies. The tool (i) creates an extensive report with several comparative quality metrics; (ii) constructs the most confident merged scaffold assembly; and (iii) provides an interactive framework for a visual comparative analysis of the given assemblies. Among the CAMSA features, only scaffold merging can be evaluated in comparison to existing methods. Namely, it resembles the functionality of assembly reconciliation tools, although their primary targets are somewhat different. Our evaluations show that CAMSA produces merged assemblies of comparable or better quality than existing assembly reconciliation tools while being the fastest in terms of the total running time.

CONCLUSIONS

CAMSA addresses the current deficiency of tools for automated comparison and analysis of multiple assemblies of the same set scaffolds. Since there exist numerous methods and techniques for scaffold assembly, identifying similarities and dissimilarities across assemblies produced by different methods is beneficial both for the developers of scaffold assembly algorithms and for the researchers focused on improving draft assemblies of specific organisms.

摘要

背景

尽管近期在基因组测序和组装方面取得了进展,但目前许多可用的组装基因组仍处于草图形式。此类草图基因组由大量基因组片段(支架)组成,这些片段在基因组中的位置和方向尚不清楚。虽然存在多种利用各种计算和湿实验室技术从支架重建基因组的方法,但它们往往只能产生部分容易出错的支架组装。因此,比较和合并不同方法产生的支架组装变得很重要,这样可以结合它们的优点并突出当前的冲突以便进一步研究。如果手动执行这些任务可能会很费力。

结果

我们展示了CAMSA——一种用于对两个或更多给定支架组装进行比较分析和合并的工具。该工具(i)创建一份包含多个比较质量指标的详细报告;(ii)构建最可靠的合并支架组装;(iii)提供一个交互式框架用于对给定组装进行可视化比较分析。在CAMSA的功能中,只有支架合并可以与现有方法进行比较评估。也就是说,它类似于组装协调工具的功能,尽管它们的主要目标略有不同。我们的评估表明,CAMSA产生的合并组装质量与现有组装协调工具相当或更好,同时在总运行时间方面是最快的。

结论

CAMSA解决了当前用于自动比较和分析同一组支架的多个组装的工具的不足。由于存在众多用于支架组装的方法和技术,识别不同方法产生的组装之间的异同对于支架组装算法的开发者和专注于改进特定生物体草图组装的研究人员都有益处。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cec9/5731503/418f8a8908b2/12859_2017_1919_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cec9/5731503/04d9993c3af8/12859_2017_1919_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cec9/5731503/2308e65f4cfd/12859_2017_1919_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cec9/5731503/d568345be4bd/12859_2017_1919_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cec9/5731503/00675d597675/12859_2017_1919_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cec9/5731503/418f8a8908b2/12859_2017_1919_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cec9/5731503/04d9993c3af8/12859_2017_1919_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cec9/5731503/2308e65f4cfd/12859_2017_1919_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cec9/5731503/d568345be4bd/12859_2017_1919_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cec9/5731503/00675d597675/12859_2017_1919_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cec9/5731503/418f8a8908b2/12859_2017_1919_Fig5_HTML.jpg

相似文献

1
CAMSA: a tool for comparative analysis and merging of scaffold assemblies.CAMSA:一种用于支架组件比较分析和合并的工具。
BMC Bioinformatics. 2017 Dec 6;18(Suppl 15):496. doi: 10.1186/s12859-017-1919-y.
2
A comparative evaluation of genome assembly reconciliation tools.基因组组装比对工具的比较评估
Genome Biol. 2017 May 18;18(1):93. doi: 10.1186/s13059-017-1213-3.
3
Assembly reconciliation.装配核对
Bioinformatics. 2008 Jan 1;24(1):42-5. doi: 10.1093/bioinformatics/btm542. Epub 2007 Dec 5.
4
Comparison of three assembly strategies for a heterozygous seedless grapevine genome assembly.三种组装策略用于组装一个杂合无核葡萄基因组的比较。
BMC Genomics. 2018 Jan 17;19(1):57. doi: 10.1186/s12864-018-4434-2.
5
CAR: contig assembly of prokaryotic draft genomes using rearrangements.CAR:利用重排对原核生物草图基因组进行重叠群组装。
BMC Bioinformatics. 2014 Nov 28;15(1):381. doi: 10.1186/s12859-014-0381-3.
6
dnAQET: a framework to compute a consolidated metric for benchmarking quality of de novo assemblies.dnAQET:一种用于计算从头组装质量基准测试综合指标的框架。
BMC Genomics. 2019 Sep 11;20(1):706. doi: 10.1186/s12864-019-6070-x.
7
GenomeQC: a quality assessment tool for genome assemblies and gene structure annotations.基因组 QC:基因组组装和基因结构注释的质量评估工具。
BMC Genomics. 2020 Mar 2;21(1):193. doi: 10.1186/s12864-020-6568-2.
8
Positional bias in variant calls against draft reference assemblies.针对草图参考基因组组装的变异位点调用中的位置偏差。
BMC Genomics. 2017 Mar 28;18(1):263. doi: 10.1186/s12864-017-3637-2.
9
GAM-NGS: genomic assemblies merger for next generation sequencing.GAM-NGS:用于下一代测序的基因组组装合并。
BMC Bioinformatics. 2013;14 Suppl 7(Suppl 7):S6. doi: 10.1186/1471-2105-14-S7-S6. Epub 2013 Apr 22.
10
SIS: a program to generate draft genome sequence scaffolds for prokaryotes.SIS:一个用于生成原核生物基因组序列草图支架的程序。
BMC Bioinformatics. 2012 May 14;13:96. doi: 10.1186/1471-2105-13-96.

引用本文的文献

1
Metagenomic and phylogenetic analyses reveal gene-level selection constrained by bacterial phylogeny, surrounding oxalate metabolism in the gut microbiota.宏基因组和系统发育分析揭示了受细菌系统发育限制的基因水平选择,这种选择围绕着肠道微生物群中的草酸代谢。
mSphere. 2025 Jun 25;10(6):e0091324. doi: 10.1128/msphere.00913-24. Epub 2025 May 13.
2
Automated assembly scaffolding using RagTag elevates a new tomato system for high-throughput genome editing.利用 RagTag 进行自动化组装支架,为高通量基因组编辑提升了一个新的番茄系统。
Genome Biol. 2022 Dec 15;23(1):258. doi: 10.1186/s13059-022-02823-7.
3
instaGRAAL: chromosome-level quality scaffolding of genomes using a proximity ligation-based scaffolder.

本文引用的文献

1
MaGuS: a tool for quality assessment and scaffolding of genome assemblies with Whole Genome Profiling™ Data.MaGuS:一种利用全基因组分析™数据进行基因组组装质量评估和搭建框架的工具。
BMC Bioinformatics. 2016 Mar 3;17:115. doi: 10.1186/s12859-016-0969-x.
2
Reconstruction of Ancestral Genomes in Presence of Gene Gain and Loss.存在基因获得与丢失情况下的祖先基因组重建
J Comput Biol. 2016 Mar;23(3):150-64. doi: 10.1089/cmb.2015.0160. Epub 2016 Feb 17.
3
Ancestral gene synteny reconstruction improves extant species scaffolding.祖先基因共线性重建改善现存物种的支架搭建。
instaGRAAL:基于邻近连接的支架的基因组染色体水平的高质量支架。
Genome Biol. 2020 Jun 18;21(1):148. doi: 10.1186/s13059-020-02041-z.
4
Evolutionary superscaffolding and chromosome anchoring to improve Anopheles genome assemblies.进化超级支架和染色体锚定以改进按蚊基因组组装。
BMC Biol. 2020 Jan 2;18(1):1. doi: 10.1186/s12915-019-0728-3.
BMC Genomics. 2015;16 Suppl 10(Suppl 10):S11. doi: 10.1186/1471-2164-16-S10-S11. Epub 2015 Oct 2.
4
Cytoscape.js: a graph theory library for visualisation and analysis.Cytoscape.js:一个用于可视化和分析的图论库。
Bioinformatics. 2016 Jan 15;32(2):309-11. doi: 10.1093/bioinformatics/btv557. Epub 2015 Sep 28.
5
Metassembler: merging and optimizing de novo genome assemblies.元组装器:合并和优化从头基因组组装
Genome Biol. 2015 Sep 24;16:207. doi: 10.1186/s13059-015-0764-4.
6
LINKS: Scalable, alignment-free scaffolding of draft genomes with long reads.链接:利用长读长对草图基因组进行可扩展的、无比对的支架搭建。
Gigascience. 2015 Aug 4;4:35. doi: 10.1186/s13742-015-0076-3. eCollection 2015.
7
FinisherSC: a repeat-aware tool for upgrading de novo assembly using long reads.FinisherSC:一种使用长读长进行从头组装升级的重复感知工具。
Bioinformatics. 2015 Oct 1;31(19):3207-9. doi: 10.1093/bioinformatics/btv280. Epub 2015 Jun 3.
8
ScaffMatch: scaffolding algorithm based on maximum weight matching.ScaffMatch:基于最大权重匹配的支架算法。
Bioinformatics. 2015 Aug 15;31(16):2632-8. doi: 10.1093/bioinformatics/btv211. Epub 2015 Apr 17.
9
ALLMAPS: robust scaffold ordering based on multiple maps.ALLMAPS:基于多图谱的稳健支架排序
Genome Biol. 2015 Jan 13;16(1):3. doi: 10.1186/s13059-014-0573-1.
10
Mosquito genomics. Highly evolvable malaria vectors: the genomes of 16 Anopheles mosquitoes.蚊子基因组学。高度可进化的疟疾传播媒介:16种按蚊的基因组
Science. 2015 Jan 2;347(6217):1258522. doi: 10.1126/science.1258522. Epub 2014 Nov 27.