• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SASpector:分析原核生物草图基因组中缺失的基因组区域。

SASpector: analysis of missing genomic regions in draft genomes of prokaryotes.

机构信息

Department of Microbial and Molecular Systems, KU Leuven, 3001 Leuven, Belgium.

Department of Biosystems, KU Leuven, 3001 Leuven, Belgium.

出版信息

Bioinformatics. 2022 May 13;38(10):2920-2921. doi: 10.1093/bioinformatics/btac208.

DOI:10.1093/bioinformatics/btac208
PMID:35561201
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9113259/
Abstract

SUMMARY

Missing regions in short-read assemblies of prokaryote genomes are often attributed to biases in sequencing technologies and to repetitive elements, the former resulting in low sequencing coverage of certain loci and the latter to unresolved loops in the de novo assembly graph. We developed SASpector, a command-line tool that compares short-read assemblies (draft genomes) to their corresponding closed assemblies and extracts missing regions to analyze them at the sequence and functional level. SASpector allows to benchmark the need for resolved genomes, can be integrated into pipelines to control the quality of assemblies, and could be used for comparative investigations of missingness in assemblies for which both short-read and long-read data are available in the public databases.

AVAILABILITY AND IMPLEMENTATION

SASpector is available at https://github.com/LoGT-KULeuven/SASpector. The tool is implemented in Python3 and available through pip and Docker (0mician/saspector).

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

摘要

原核生物基因组短读序列组装中缺失的区域通常归因于测序技术的偏倚和重复元件,前者导致某些基因座的测序覆盖度低,后者导致从头组装图中未解决的环。我们开发了 SASpector,这是一个命令行工具,它将短读序列组装(草图基因组)与其对应的闭合组装进行比较,并提取缺失区域,以在序列和功能水平上对其进行分析。SASpector 可以用于基准化解析基因组的需求,可集成到管道中以控制组装的质量,并且可用于在公共数据库中同时具有短读和长读数据的情况下,对组装缺失进行比较研究。

可用性和实现

SASpector 可在 https://github.com/LoGT-KULeuven/SASpector 上获得。该工具是用 Python3 实现的,可通过 pip 和 Docker(0mician/saspector)使用。

补充信息

补充数据可在 Bioinformatics 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e36/9113259/ef98c08ccbcc/btac208f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e36/9113259/ef98c08ccbcc/btac208f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e36/9113259/ef98c08ccbcc/btac208f1.jpg

相似文献

1
SASpector: analysis of missing genomic regions in draft genomes of prokaryotes.SASpector:分析原核生物草图基因组中缺失的基因组区域。
Bioinformatics. 2022 May 13;38(10):2920-2921. doi: 10.1093/bioinformatics/btac208.
2
ARBitR: an overlap-aware genome assembly scaffolder for linked reads.ARBitR:一种用于链接读取的重叠感知基因组组装支架。
Bioinformatics. 2021 Aug 9;37(15):2203-2205. doi: 10.1093/bioinformatics/btaa975.
3
MsPAC: a tool for haplotype-phased structural variant detection.MsPAC:一种用于单体型相位结构变异检测的工具。
Bioinformatics. 2020 Feb 1;36(3):922-924. doi: 10.1093/bioinformatics/btz618.
4
Completion of draft bacterial genomes by long-read sequencing of synthetic genomic pools.通过合成基因组文库的长读长测序完成细菌基因组草图
BMC Genomics. 2020 Jul 29;21(1):519. doi: 10.1186/s12864-020-06910-6.
5
ntJoin: Fast and lightweight assembly-guided scaffolding using minimizer graphs.ntJoin:基于最小生成树图的快速轻量级组装引导 scaffolding。
Bioinformatics. 2020 Jun 1;36(12):3885-3887. doi: 10.1093/bioinformatics/btaa253.
6
HaploMerger2: rebuilding both haploid sub-assemblies from high-heterozygosity diploid genome assembly.HaploMerger2:从高杂合度二倍体基因组组装中重建两个单倍体亚组装体。
Bioinformatics. 2017 Aug 15;33(16):2577-2579. doi: 10.1093/bioinformatics/btx220.
7
Versatile genome assembly evaluation with QUAST-LG.QUAST-LG 进行多功能基因组组装评估。
Bioinformatics. 2018 Jul 1;34(13):i142-i150. doi: 10.1093/bioinformatics/bty266.
8
Figbird: a probabilistic method for filling gaps in genome assemblies.绯文鸟:一种填补基因组组装缺口的概率方法。
Bioinformatics. 2022 Aug 2;38(15):3717-3724. doi: 10.1093/bioinformatics/btac404.
9
ARCS: scaffolding genome drafts with linked reads.ARCS:使用链接读取构建基因组草图。
Bioinformatics. 2018 Mar 1;34(5):725-731. doi: 10.1093/bioinformatics/btx675.
10
NextPolish: a fast and efficient genome polishing tool for long-read assembly.NextPolish:一种用于长读长组装的快速高效基因组精修工具。
Bioinformatics. 2020 Apr 1;36(7):2253-2255. doi: 10.1093/bioinformatics/btz891.

引用本文的文献

1
Bioprospecting of 101 facultative rumen bacterial isolates through comprehensive genome analysis.通过全面基因组分析对101株兼性瘤胃细菌分离株进行生物勘探。
Mol Biol Rep. 2025 Feb 27;52(1):265. doi: 10.1007/s11033-025-10291-y.

本文引用的文献

1
Genomics of an endemic cystic fibrosis Burkholderia multivorans strain reveals low within-patient evolution but high between-patient diversity.地方性囊性纤维化伯克霍尔德菌的基因组研究揭示了其在患者体内的低进化速度,但在患者间具有高度多样性。
PLoS Pathog. 2021 Mar 15;17(3):e1009418. doi: 10.1371/journal.ppat.1009418. eCollection 2021 Mar.
2
Opportunities and challenges in long-read sequencing data analysis.长读测序数据分析中的机遇与挑战。
Genome Biol. 2020 Feb 7;21(1):30. doi: 10.1186/s13059-020-1935-5.
3
Large-scale sequence comparisons with .
与……进行大规模序列比较
F1000Res. 2019 Jul 4;8:1006. doi: 10.12688/f1000research.19675.1. eCollection 2019.
4
Completing bacterial genome assemblies with multiplex MinION sequencing.使用多重 MinION 测序完成细菌基因组组装。
Microb Genom. 2017 Sep 14;3(10):e000132. doi: 10.1099/mgen.0.000132. eCollection 2017 Oct.
5
On the (im)possibility of reconstructing plasmids from whole-genome short-read sequencing data.从全基因组短读测序数据重建质粒的(不)可能性。
Microb Genom. 2017 Aug 18;3(10):e000128. doi: 10.1099/mgen.0.000128. eCollection 2017 Oct.
6
Coming of age: ten years of next-generation sequencing technologies.成年:下一代测序技术的十年
Nat Rev Genet. 2016 May 17;17(6):333-51. doi: 10.1038/nrg.2016.49.
7
Characterization of sequence-specific errors in various next-generation sequencing systems.各种新一代测序系统中序列特异性错误的特征分析。
Mol Biosyst. 2016 Mar;12(3):914-22. doi: 10.1039/c5mb00750j.
8
Interactions between horizontally acquired genes create a fitness cost in Pseudomonas aeruginosa.水平获得的基因之间的相互作用在铜绿假单胞菌中产生了适应性代价。
Nat Commun. 2015 Apr 21;6:6845. doi: 10.1038/ncomms7845.
9
Prokka: rapid prokaryotic genome annotation.Prokka:快速的原核生物基因组注释。
Bioinformatics. 2014 Jul 15;30(14):2068-9. doi: 10.1093/bioinformatics/btu153. Epub 2014 Mar 18.
10
QUAST: quality assessment tool for genome assemblies.QUAST:基因组组装质量评估工具。
Bioinformatics. 2013 Apr 15;29(8):1072-5. doi: 10.1093/bioinformatics/btt086. Epub 2013 Feb 19.