• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CGAL:计算基因组组装似然值。

CGAL: computing genome assembly likelihoods.

作者信息

Rahman Atif, Pachter Lior

出版信息

Genome Biol. 2013 Jan 29;14(1):R8. doi: 10.1186/gb-2013-14-1-r8.

DOI:10.1186/gb-2013-14-1-r8
PMID:23360652
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3663106/
Abstract

Assembly algorithms have been extensively benchmarked using simulated data so that results can be compared to ground truth. However, in de novo assembly, only crude metrics such as contig number and size are typically used to evaluate assembly quality. We present CGAL, a novel likelihood-based approach to assembly assessment in the absence of a ground truth. We show that likelihood is more accurate than other metrics currently used for evaluating assemblies, and describe its application to the optimization and comparison of assembly algorithms. Our methods are implemented in software that is freely available at http://bio.math.berkeley.edu/cgal/.

摘要

组装算法已经使用模拟数据进行了广泛的基准测试,以便将结果与真实情况进行比较。然而,在从头组装中,通常仅使用诸如重叠群数量和大小等粗略指标来评估组装质量。我们提出了CGAL,这是一种在没有真实情况的情况下进行组装评估的基于似然性的新方法。我们表明,似然性比目前用于评估组装的其他指标更准确,并描述了其在组装算法优化和比较中的应用。我们的方法在软件中实现,该软件可在http://bio.math.berkeley.edu/cgal/免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/e8dcc7054385/gb-2013-14-1-r8-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/8b0149d5f594/gb-2013-14-1-r8-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/40b7b010a035/gb-2013-14-1-r8-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/950163dc066f/gb-2013-14-1-r8-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/4b9509b3b74a/gb-2013-14-1-r8-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/2da866ade929/gb-2013-14-1-r8-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/9e93f5850d75/gb-2013-14-1-r8-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/52c3160f477b/gb-2013-14-1-r8-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/203b69587e0a/gb-2013-14-1-r8-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/bca149bf2790/gb-2013-14-1-r8-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/e8dcc7054385/gb-2013-14-1-r8-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/8b0149d5f594/gb-2013-14-1-r8-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/40b7b010a035/gb-2013-14-1-r8-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/950163dc066f/gb-2013-14-1-r8-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/4b9509b3b74a/gb-2013-14-1-r8-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/2da866ade929/gb-2013-14-1-r8-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/9e93f5850d75/gb-2013-14-1-r8-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/52c3160f477b/gb-2013-14-1-r8-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/203b69587e0a/gb-2013-14-1-r8-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/bca149bf2790/gb-2013-14-1-r8-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad9e/3663106/e8dcc7054385/gb-2013-14-1-r8-10.jpg

相似文献

1
CGAL: computing genome assembly likelihoods.CGAL:计算基因组组装似然值。
Genome Biol. 2013 Jan 29;14(1):R8. doi: 10.1186/gb-2013-14-1-r8.
2
SWALO: scaffolding with assembly likelihood optimization.SWALO:具有装配可能性优化的脚手架
Nucleic Acids Res. 2021 Nov 18;49(20):e117. doi: 10.1093/nar/gkab717.
3
De novo likelihood-based measures for comparing genome assemblies.用于比较基因组组装的基于从头似然性的度量
BMC Res Notes. 2013 Aug 22;6:334. doi: 10.1186/1756-0500-6-334.
4
Assembly reconciliation.装配核对
Bioinformatics. 2008 Jan 1;24(1):42-5. doi: 10.1093/bioinformatics/btm542. Epub 2007 Dec 5.
5
SuRankCo: supervised ranking of contigs in de novo assemblies.SuRankCo:从头组装中重叠群的监督排序
BMC Bioinformatics. 2015 Jul 30;16:240. doi: 10.1186/s12859-015-0644-7.
6
OSLay: optimal syntenic layout of unfinished assemblies.OSLay:未完成组装的最优共线性布局
Bioinformatics. 2007 Jul 1;23(13):1573-9. doi: 10.1093/bioinformatics/btm153. Epub 2007 Apr 26.
7
Short read fragment assembly of bacterial genomes.细菌基因组的短读片段组装
Genome Res. 2008 Feb;18(2):324-30. doi: 10.1101/gr.7088808. Epub 2007 Dec 14.
8
LR_Gapcloser: a tiling path-based gap closer that uses long reads to complete genome assembly.LR_Gapcloser:一种基于平铺路径的缺口闭合器,它使用长读长来完成基因组组装。
Gigascience. 2019 Jan 1;8(1):giy157. doi: 10.1093/gigascience/giy157.
9
Heterozygous genome assembly via binary classification of homologous sequence.通过同源序列的二元分类进行杂合基因组组装。
BMC Bioinformatics. 2015;16 Suppl 7(Suppl 7):S5. doi: 10.1186/1471-2105-16-S7-S5. Epub 2015 Apr 23.
10
CISA: contig integrator for sequence assembly of bacterial genomes.CISA:用于细菌基因组序列组装的 contig 整合器。
PLoS One. 2013;8(3):e60843. doi: 10.1371/journal.pone.0060843. Epub 2013 Mar 28.

引用本文的文献

1
Sama: a contig assembler with correctness guarantee.Sama:一种具有正确性保证的重叠群组装程序。
Algorithms Mol Biol. 2025 Jun 3;20(1):9. doi: 10.1186/s13015-025-00280-y.
2
ScatTR: Estimating the Size of Long Tandem Repeat Expansions from Short-Reads.ScatTR:从短读长估计长串联重复序列的扩增大小。
bioRxiv. 2025 Feb 20:2025.02.15.638440. doi: 10.1101/2025.02.15.638440.
3
Theoretical Analysis of Sequencing Bioinformatics Algorithms and Beyond.测序生物信息学算法及其他方面的理论分析

本文引用的文献

1
Fast gapped-read alignment with Bowtie 2.快速缺口读对准与 Bowtie 2。
Nat Methods. 2012 Mar 4;9(4):357-9. doi: 10.1038/nmeth.1923.
2
Feature-by-feature--evaluating de novo sequence assembly.逐特征评估从头序列组装。
PLoS One. 2012;7(2):e31002. doi: 10.1371/journal.pone.0031002. Epub 2012 Feb 3.
3
GAGE: A critical evaluation of genome assemblies and assembly algorithms.盖奇:基因组组装和算法的关键评估。
Commun ACM. 2023 Jul;66(7):118-125. doi: 10.1145/3571723. Epub 2023 Jun 22.
4
Identification of errors in draft genome assemblies at single-nucleotide resolution for quality assessment and improvement.以单核苷酸分辨率识别基因组草案组装中的错误,以进行质量评估和改进。
Nat Commun. 2023 Oct 17;14(1):6556. doi: 10.1038/s41467-023-42336-w.
5
Whole-genome sequencing in medicinal plants: current progress and prospect.药用植物的全基因组测序:当前进展与展望
Sci China Life Sci. 2024 Feb;67(2):258-273. doi: 10.1007/s11427-022-2375-y. Epub 2023 Oct 12.
6
A draft human pangenome reference.人类泛基因组参考草图。
Nature. 2023 May;617(7960):312-324. doi: 10.1038/s41586-023-05896-x. Epub 2023 May 10.
7
Perplexity: evaluating transcript abundance estimation in the absence of ground truth.困惑度:在缺乏真实对照的情况下评估转录本丰度估计
Algorithms Mol Biol. 2022 Mar 25;17(1):6. doi: 10.1186/s13015-022-00214-y.
8
SWALO: scaffolding with assembly likelihood optimization.SWALO:具有装配可能性优化的脚手架
Nucleic Acids Res. 2021 Nov 18;49(20):e117. doi: 10.1093/nar/gkab717.
9
Lateral Gene Transfer Shapes Diversity of spp.横向基因转移塑造了[物种名称]的多样性
Front Cell Infect Microbiol. 2020 Jun 23;10:293. doi: 10.3389/fcimb.2020.00293. eCollection 2020.
10
Toward a more holistic method of genome assembly assessment.迈向更全面的基因组组装评估方法。
BMC Bioinformatics. 2020 Jul 6;21(Suppl 4):249. doi: 10.1186/s12859-020-3382-4.
Genome Res. 2012 Mar;22(3):557-67. doi: 10.1101/gr.131383.111. Epub 2012 Jan 6.
4
Assemblathon 1: a competitive assessment of de novo short read assembly methods.Assemblathon 1:从头开始的短读序列组装方法的竞争性评估。
Genome Res. 2011 Dec;21(12):2224-41. doi: 10.1101/gr.126599.111. Epub 2011 Sep 16.
5
Mauve assembly metrics.淡紫色组装指标。
Bioinformatics. 2011 Oct 1;27(19):2756-7. doi: 10.1093/bioinformatics/btr451. Epub 2011 Aug 2.
6
Comparative studies of de novo assembly tools for next-generation sequencing technologies.新一代测序技术从头组装工具的比较研究。
Bioinformatics. 2011 Aug 1;27(15):2031-7. doi: 10.1093/bioinformatics/btr319. Epub 2011 Jun 2.
7
Comparing de novo genome assembly: the long and short of it.从头开始比较基因组组装:长与短。
PLoS One. 2011 Apr 29;6(4):e19175. doi: 10.1371/journal.pone.0019175.
8
A practical comparison of de novo genome assembly software tools for next-generation sequencing technologies.新一代测序技术中从头基因组组装软件工具的实用比较。
PLoS One. 2011 Mar 14;6(3):e17915. doi: 10.1371/journal.pone.0017915.
9
Improving RNA-Seq expression estimates by correcting for fragment bias.通过纠正片段偏倚来提高 RNA-Seq 表达估计。
Genome Biol. 2011;12(3):R22. doi: 10.1186/gb-2011-12-3-r22. Epub 2011 Mar 16.
10
Limitations of next-generation genome sequence assembly.下一代基因组序列组装的局限性。
Nat Methods. 2011 Jan;8(1):61-5. doi: 10.1038/nmeth.1527. Epub 2010 Nov 21.