• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

QUAST-LG 进行多功能基因组组装评估。

Versatile genome assembly evaluation with QUAST-LG.

机构信息

Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia.

出版信息

Bioinformatics. 2018 Jul 1;34(13):i142-i150. doi: 10.1093/bioinformatics/bty266.

DOI:10.1093/bioinformatics/bty266
PMID:29949969
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6022658/
Abstract

MOTIVATION

The emergence of high-throughput sequencing technologies revolutionized genomics in early 2000s. The next revolution came with the era of long-read sequencing. These technological advances along with novel computational approaches became the next step towards the automatic pipelines capable to assemble nearly complete mammalian-size genomes.

RESULTS

In this manuscript, we demonstrate performance of the state-of-the-art genome assembly software on six eukaryotic datasets sequenced using different technologies. To evaluate the results, we developed QUAST-LG-a tool that compares large genomic de novo assemblies against reference sequences and computes relevant quality metrics. Since genomes generally cannot be reconstructed completely due to complex repeat patterns and low coverage regions, we introduce a concept of upper bound assembly for a given genome and set of reads, and compute theoretical limits on assembly correctness and completeness. Using QUAST-LG, we show how close the assemblies are to the theoretical optimum, and how far this optimum is from the finished reference.

AVAILABILITY AND IMPLEMENTATION

http://cab.spbu.ru/software/quast-lg.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

高通量测序技术在 21 世纪初的出现彻底改变了基因组学。下一次革命伴随着长读测序时代的到来。这些技术进步以及新颖的计算方法成为了迈向能够组装近乎完整的哺乳动物大小基因组的自动流水线的下一步。

结果

在本文中,我们展示了最先进的基因组组装软件在使用不同技术测序的六个真核数据集上的性能。为了评估结果,我们开发了 QUAST-LG 工具,该工具可将大型基因组从头组装与参考序列进行比较,并计算相关的质量指标。由于基因组通常由于复杂的重复模式和低覆盖区域而无法完全重建,因此我们引入了给定基因组和读取集的上限组装的概念,并计算了组装正确性和完整性的理论极限。使用 QUAST-LG,我们展示了组装与理论最优值的接近程度,以及该最优值与完成的参考序列的差距。

可用性和实现

http://cab.spbu.ru/software/quast-lg。

补充信息

补充数据可在 Bioinformatics 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9059/6022658/e3e704acb438/bty266f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9059/6022658/51a2d9eff887/bty266f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9059/6022658/e3e704acb438/bty266f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9059/6022658/51a2d9eff887/bty266f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9059/6022658/e3e704acb438/bty266f2.jpg

相似文献

1
Versatile genome assembly evaluation with QUAST-LG.QUAST-LG 进行多功能基因组组装评估。
Bioinformatics. 2018 Jul 1;34(13):i142-i150. doi: 10.1093/bioinformatics/bty266.
2
QUAST: quality assessment tool for genome assemblies.QUAST:基因组组装质量评估工具。
Bioinformatics. 2013 Apr 15;29(8):1072-5. doi: 10.1093/bioinformatics/btt086. Epub 2013 Feb 19.
3
Icarus: visualizer for de novo assembly evaluation.Icarus:从头组装评估的可视化工具。
Bioinformatics. 2016 Nov 1;32(21):3321-3323. doi: 10.1093/bioinformatics/btw379. Epub 2016 Jul 4.
4
Subset selection of high-depth next generation sequencing reads for de novo genome assembly using MapReduce framework.使用MapReduce框架进行从头基因组组装时对高深度下一代测序读数的子集选择。
BMC Genomics. 2015;16 Suppl 12(Suppl 12):S9. doi: 10.1186/1471-2164-16-S12-S9. Epub 2015 Dec 9.
5
FinisherSC: a repeat-aware tool for upgrading de novo assembly using long reads.FinisherSC:一种使用长读长进行从头组装升级的重复感知工具。
Bioinformatics. 2015 Oct 1;31(19):3207-9. doi: 10.1093/bioinformatics/btv280. Epub 2015 Jun 3.
6
scanPAV: a pipeline for extracting presence-absence variations in genome pairs.scanPAV:用于提取基因组对中存在-缺失变异的管道。
Bioinformatics. 2018 Sep 1;34(17):3022-3024. doi: 10.1093/bioinformatics/bty189.
7
A spectral algorithm for fast de novo layout of uncorrected long nanopore reads.一种用于快速从头设计未经校正的长纳米孔读段的谱算法。
Bioinformatics. 2017 Oct 15;33(20):3188-3194. doi: 10.1093/bioinformatics/btx370.
8
Telescoper: de novo assembly of highly repetitive regions.望远镜:高度重复区域的从头组装。
Bioinformatics. 2012 Sep 15;28(18):i311-i317. doi: 10.1093/bioinformatics/bts399.
9
Benchmarking of long-read sequencing, assemblers and polishers for yeast genome.酵母基因组长读测序、组装和精修的基准测试。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac146.
10
SQUAT: a Sequencing Quality Assessment Tool for data quality assessments of genome assemblies.SQUAT:用于基因组组装数据质量评估的测序质量评估工具。
BMC Genomics. 2019 Apr 18;19(Suppl 9):238. doi: 10.1186/s12864-019-5445-3.

引用本文的文献

1
Multi-platform metagenomic characterization of the microbial community during spontaneous cacao fermentation.可可自然发酵过程中微生物群落的多平台宏基因组特征分析
Front Bioeng Biotechnol. 2025 Aug 26;13:1630515. doi: 10.3389/fbioe.2025.1630515. eCollection 2025.
2
Genome analysis of in Norway, 2016-2023, reveals shifting epidemiology in the wake of the COVID-19 pandemic.2016 - 2023年挪威的基因组分析揭示了新冠疫情后流行病学的变化。
Microb Genom. 2025 Sep;11(9). doi: 10.1099/mgen.0.001479.
3
Emergence of a carbapenem-resistant atypical uropathogenic Escherichia coli clone as an increasing cause of urinary tract infection.

本文引用的文献

1
Assembly of long, error-prone reads using repeat graphs.使用重复图组装长的、易错的读取。
Nat Biotechnol. 2019 May;37(5):540-546. doi: 10.1038/s41587-019-0072-8. Epub 2019 Apr 1.
2
SvABA: genome-wide detection of structural variants and indels by local assembly.SvABA:通过局部组装进行全基因组结构变异和插入缺失的检测。
Genome Res. 2018 Apr;28(4):581-591. doi: 10.1101/gr.221028.117. Epub 2018 Mar 13.
3
MUMmer4: A fast and versatile genome alignment system.MUMmer4:一种快速且通用的基因组比对系统。
一种对碳青霉烯类耐药的非典型尿路致病性大肠杆菌克隆株的出现成为尿路感染日益常见的病因。
Nat Commun. 2025 Sep 2;16(1):8200. doi: 10.1038/s41467-025-63477-0.
4
Unique plastisphere viromes with habitat-dependent potential for modulating global methane cycle.具有依赖栖息地调节全球甲烷循环潜力的独特塑料球病毒群落。
Nat Commun. 2025 Aug 29;16(1):8098. doi: 10.1038/s41467-025-63215-6.
5
Bacteremia Caused by a Putative Novel Species in the Genus : A Case Report and Genomic Analysis.由属内一种假定新物种引起的菌血症:一例报告及基因组分析
Life (Basel). 2025 Aug 3;15(8):1227. doi: 10.3390/life15081227.
6
Surveillance and Characterization of Vancomycin-Resistant and Vancomycin-Variable Enterococci in a Hospital Setting.医院环境中耐万古霉素和万古霉素敏感性可变肠球菌的监测与特征分析
Antibiotics (Basel). 2025 Aug 4;14(8):795. doi: 10.3390/antibiotics14080795.
7
Discovery of New Everninomicin Analogs from a Marine-Derived sp. by Metabolomics and Genomics Approaches.通过代谢组学和基因组学方法从海洋来源的链霉菌中发现新的埃博霉素类似物
Mar Drugs. 2025 Jul 31;23(8):316. doi: 10.3390/md23080316.
8
hitchhikes on gliding colonies of .搭乘……的滑翔群体。 你提供的原文似乎不完整,“of”后面缺少具体内容。
ISME Commun. 2025 Jul 16;5(1):ycaf118. doi: 10.1093/ismeco/ycaf118. eCollection 2025 Jan.
9
Comprehensive genomic analysis of Xenorhabdus bovienii strain MEL2.2.嗜线虫致病杆菌MEL2.2菌株的全基因组分析
PLoS One. 2025 Aug 25;20(8):e0331132. doi: 10.1371/journal.pone.0331132. eCollection 2025.
10
First Assembly of a Draft Genome of the Critically Endangered Northern Muriqui (, Primates, Atelidae) Including Non-Invasive Genotyping Strategies for the Species.极度濒危的北方绒毛蛛猴(灵长目,蛛猴科)基因组草图的首次组装,包括该物种的非侵入性基因分型策略
Ecol Evol. 2025 Aug 19;15(8):e71356. doi: 10.1002/ece3.71356. eCollection 2025 Aug.
PLoS Comput Biol. 2018 Jan 26;14(1):e1005944. doi: 10.1371/journal.pcbi.1005944. eCollection 2018 Jan.
4
Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software.宏基因组解读的批判性评估——宏基因组学软件的一项基准测试
Nat Methods. 2017 Nov;14(11):1063-1071. doi: 10.1038/nmeth.4458. Epub 2017 Oct 2.
5
KMC 3: counting and manipulating k-mer statistics.KMC 3:计算和处理k-mer统计信息。
Bioinformatics. 2017 Sep 1;33(17):2759-2761. doi: 10.1093/bioinformatics/btx304.
6
Canu: scalable and accurate long-read assembly via adaptive -mer weighting and repeat separation.Canu:通过自适应k-mer加权和重复序列分离实现可扩展且准确的长读长序列拼接
Genome Res. 2017 May;27(5):722-736. doi: 10.1101/gr.215087.116. Epub 2017 Mar 15.
7
ABySS 2.0: resource-efficient assembly of large genomes using a Bloom filter.ABySS 2.0:使用布隆过滤器对大型基因组进行资源高效组装。
Genome Res. 2017 May;27(5):768-777. doi: 10.1101/gr.214346.116. Epub 2017 Feb 23.
8
Fast and accurate de novo genome assembly from long uncorrected reads.从长的未校正读段中进行快速且准确的从头基因组组装。
Genome Res. 2017 May;27(5):737-746. doi: 10.1101/gr.214270.116. Epub 2017 Jan 18.
9
Assembly of long error-prone reads using de Bruijn graphs.使用德布鲁因图组装长易错读段。
Proc Natl Acad Sci U S A. 2016 Dec 27;113(52):E8396-E8405. doi: 10.1073/pnas.1604560113. Epub 2016 Dec 12.
10
KAT: a K-mer analysis toolkit to quality control NGS datasets and genome assemblies.KAT:一个用于对二代测序数据集和基因组组装进行质量控制的K-mer分析工具包。
Bioinformatics. 2017 Feb 15;33(4):574-576. doi: 10.1093/bioinformatics/btw663.