• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

细菌基因组测序:使用开源工具进行测序、从头组装和快速分析。

Genome sequencing of bacteria: sequencing, de novo assembly and rapid analysis using open source tools.

机构信息

Institute of Technology, Tartu University, Nooruse 1, Tartu 50411, Estonia.

出版信息

BMC Genomics. 2013 Apr 1;14:211. doi: 10.1186/1471-2164-14-211.

DOI:10.1186/1471-2164-14-211
PMID:23547799
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3618134/
Abstract

BACKGROUND

De novo genome sequencing of previously uncharacterized microorganisms has the potential to open up new frontiers in microbial genomics by providing insight into both functional capabilities and biodiversity. Until recently, Roche 454 pyrosequencing was the NGS method of choice for de novo assembly because it generates hundreds of thousands of long reads (<450 bps), which are presumed to aid in the analysis of uncharacterized genomes. The array of tools for processing NGS data are increasingly free and open source and are often adopted for both their high quality and role in promoting academic freedom.

RESULTS

The error rate of pyrosequencing the Alcanivorax borkumensis genome was such that thousands of insertions and deletions were artificially introduced into the finished genome. Despite a high coverage (~30 fold), it did not allow the reference genome to be fully mapped. Reads from regions with errors had low quality, low coverage, or were missing. The main defect of the reference mapping was the introduction of artificial indels into contigs through lower than 100% consensus and distracting gene calling due to artificial stop codons. No assembler was able to perform de novo assembly comparable to reference mapping. Automated annotation tools performed similarly on reference mapped and de novo draft genomes, and annotated most CDSs in the de novo assembled draft genomes.

CONCLUSIONS

Free and open source software (FOSS) tools for assembly and annotation of NGS data are being developed rapidly to provide accurate results with less computational effort. Usability is not high priority and these tools currently do not allow the data to be processed without manual intervention. Despite this, genome assemblers now readily assemble medium short reads into long contigs (>97-98% genome coverage). A notable gap in pyrosequencing technology is the quality of base pair calling and conflicting base pairs between single reads at the same nucleotide position. Regardless, using draft whole genomes that are not finished and remain fragmented into tens of contigs allows one to characterize unknown bacteria with modest effort.

摘要

背景

对以前未被描述的微生物进行从头基因组测序,有可能通过深入了解功能能力和生物多样性,为微生物基因组学开辟新的前沿。直到最近,罗氏 454 焦磷酸测序仍是从头组装的首选 NGS 方法,因为它生成了数十万条长读长(<450 bp),这些读长被认为有助于分析未被描述的基因组。用于处理 NGS 数据的工具套件越来越多是免费和开源的,并且经常因其高质量和在促进学术自由方面的作用而被采用。

结果

对 Alcanivorax borkumensis 基因组进行焦磷酸测序的错误率导致数千个插入和缺失被人为地引入到完成的基因组中。尽管覆盖率很高(~30 倍),但它并没有允许参考基因组完全被映射。来自有错误的区域的reads 质量低、覆盖度低或缺失。参考映射的主要缺陷是通过低于 100%的一致性将人为的 indels 引入到 contigs 中,并由于人为的终止密码子而导致基因调用分散。没有组装程序能够执行与参考映射相当的从头组装。自动化注释工具在参考映射和从头草案基因组上的表现相似,并注释了从头组装的草案基因组中大多数 CDS。

结论

用于 NGS 数据组装和注释的免费和开源软件(FOSS)工具正在迅速发展,以提供更少计算工作量的准确结果。可用性不是高优先级,这些工具目前不允许在没有人工干预的情况下处理数据。尽管如此,基因组组装程序现在可以轻松地将中等短读长组装成长 contigs(>97-98%的基因组覆盖率)。焦磷酸测序技术的一个显著缺陷是碱基对调用的质量以及在同一核苷酸位置处单读长之间的冲突碱基对。尽管如此,使用未完成且仍然碎片化为数十个 contigs 的草稿全基因组仍然可以让人们以适度的努力来描述未知细菌。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef9e/3618134/b880610a5012/1471-2164-14-211-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef9e/3618134/1c2861be3843/1471-2164-14-211-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef9e/3618134/b880610a5012/1471-2164-14-211-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef9e/3618134/1c2861be3843/1471-2164-14-211-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef9e/3618134/b880610a5012/1471-2164-14-211-2.jpg

相似文献

1
Genome sequencing of bacteria: sequencing, de novo assembly and rapid analysis using open source tools.细菌基因组测序:使用开源工具进行测序、从头组装和快速分析。
BMC Genomics. 2013 Apr 1;14:211. doi: 10.1186/1471-2164-14-211.
2
NeatFreq: reference-free data reduction and coverage normalization for De Novo sequence assembly.NeatFreq:用于从头序列组装的无参考数据缩减和覆盖度归一化
BMC Bioinformatics. 2014 Nov 19;15(1):357. doi: 10.1186/s12859-014-0357-3.
3
Benchmarking of de novo assembly algorithms for Nanopore data reveals optimal performance of OLC approaches.用于纳米孔数据的从头组装算法基准测试揭示了重叠布局一致(OLC)方法的最佳性能。
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):507. doi: 10.1186/s12864-016-2895-8.
4
Software for pre-processing Illumina next-generation sequencing short read sequences.用于预处理Illumina下一代测序短读序列的软件。
Source Code Biol Med. 2014 May 3;9:8. doi: 10.1186/1751-0473-9-8. eCollection 2014.
5
Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads.单轮循环器:从短读长和长读长测序数据中解析细菌基因组组装结果
PLoS Comput Biol. 2017 Jun 8;13(6):e1005595. doi: 10.1371/journal.pcbi.1005595. eCollection 2017 Jun.
6
Evaluation and Validation of Assembling Corrected PacBio Long Reads for Microbial Genome Completion via Hybrid Approaches.通过混合方法组装校正后的PacBio长读段以完成微生物基因组的评估与验证
PLoS One. 2015 Dec 7;10(12):e0144305. doi: 10.1371/journal.pone.0144305. eCollection 2015.
7
Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies.利用短读长读测序技术进行尿液细菌的从头杂交基因组组装,生成完整基因组。
J Vis Exp. 2021 Aug 20(174). doi: 10.3791/62872.
8
A combined de novo assembly approach increases the quality of prokaryotic draft genomes.一种联合从头组装方法提高了原核草案基因组的质量。
Folia Microbiol (Praha). 2022 Oct;67(5):801-810. doi: 10.1007/s12223-022-00980-7. Epub 2022 Jun 6.
9
GAPPadder: a sensitive approach for closing gaps on draft genomes with short sequence reads.GAPPadder:一种使用短序列读长来闭合草图基因组缺口的灵敏方法。
BMC Genomics. 2019 Jun 6;20(Suppl 5):426. doi: 10.1186/s12864-019-5703-4.
10
GapFiller: a de novo assembly approach to fill the gap within paired reads.GapFiller:一种从头开始的组装方法,用于填补配对读取中的缺口。
BMC Bioinformatics. 2012;13 Suppl 14(Suppl 14):S8. doi: 10.1186/1471-2105-13-S14-S8. Epub 2012 Sep 7.

引用本文的文献

1
Acquisition of plasmids from Shiga toxin-producing Escherichia coli strains had low or neutral fitness cost on commensal E. coli.从产志贺毒素大肠杆菌菌株中获得质粒对共生大肠杆菌的适应性代价较低或无影响。
Braz J Microbiol. 2024 Jun;55(2):1297-1304. doi: 10.1007/s42770-024-01269-2. Epub 2024 Feb 24.
2
Molecular source attribution.分子溯源
PLoS Comput Biol. 2022 Nov 17;18(11):e1010649. doi: 10.1371/journal.pcbi.1010649. eCollection 2022 Nov.
3
Genome-Wide Searching Single Nucleotide-Polymorphisms (SNPs) and SNPs-Targeting a Multiplex Primer for Identification of Common Serotypes.

本文引用的文献

1
Performance comparison of benchtop high-throughput sequencing platforms.桌面高通量测序平台的性能比较。
Nat Biotechnol. 2012 May;30(5):434-9. doi: 10.1038/nbt.2198.
2
Complete genome sequence of the fish pathogen Flavobacterium branchiophilum.鱼类病原体嗜纤维菌的全基因组序列。
Appl Environ Microbiol. 2011 Nov;77(21):7656-62. doi: 10.1128/AEM.05625-11. Epub 2011 Sep 16.
3
RATT: Rapid Annotation Transfer Tool.RATT:快速注释转移工具。
全基因组搜索单核苷酸多态性(SNP)以及用于鉴定常见血清型的靶向SNP的多重引物
Pathogens. 2022 Sep 21;11(10):1075. doi: 10.3390/pathogens11101075.
4
Black pepper and tarragon essential oils suppress the lipolytic potential and the type II secretion system of P. psychrophila KM02.黑胡椒和龙蒿精油抑制了嗜冷菌 KM02 的脂肪酶活性和 II 型分泌系统。
Sci Rep. 2022 Mar 31;12(1):5487. doi: 10.1038/s41598-022-09311-9.
5
Development of Single Nucleotide Polymorphism (SNP)-Based Triplex PCR Marker for Serotype-specific Detection.用于血清型特异性检测的基于单核苷酸多态性(SNP)的三重PCR标记物的开发
Pathogens. 2022 Jan 19;11(2):115. doi: 10.3390/pathogens11020115.
6
Hybrid Atypical Enteropathogenic and Extraintestinal (aEPEC/ExPEC) BA1250 Strain: A Draft Genome.混合型非典型肠道致病性和肠外致病性(aEPEC/ExPEC)BA1250菌株:基因组草图
Pathogens. 2021 Apr 14;10(4):475. doi: 10.3390/pathogens10040475.
7
Evolution of Microbial Genomics: Conceptual Shifts over a Quarter Century.微生物基因组学的演变:二十五年来的概念转变。
Trends Microbiol. 2021 Jul;29(7):582-592. doi: 10.1016/j.tim.2021.01.005. Epub 2021 Feb 1.
8
Using affinity propagation clustering for identifying bacterial clades and subclades with whole-genome sequences of Francisella tularensis.使用亲和传播聚类分析鉴定土拉弗朗西斯菌全基因组序列中的细菌进化枝和亚进化枝。
PLoS Negl Trop Dis. 2020 Sep 29;14(9):e0008018. doi: 10.1371/journal.pntd.0008018. eCollection 2020 Sep.
9
A Novel Strain UTNGt21O Isolated from Wild Fruit: Genome Sequence and Characterization of a Peptide with Highly Inhibitory Potential toward Gram-Negative Bacteria.从野生水果中分离出的新型菌株UTNGt21O:对革兰氏阴性菌具有高度抑制潜力的肽的基因组序列与特性分析
Foods. 2020 Sep 5;9(9):1242. doi: 10.3390/foods9091242.
10
Genomic comparison of diverse Salmonella serovars isolated from swine.从猪身上分离的不同血清型沙门氏菌的基因组比较。
PLoS One. 2019 Nov 1;14(11):e0224518. doi: 10.1371/journal.pone.0224518. eCollection 2019.
Nucleic Acids Res. 2011 May;39(9):e57. doi: 10.1093/nar/gkq1268. Epub 2011 Feb 8.
4
Genome (re-)annotation and open-source annotation pipelines.基因组(重新)注释与开源注释流程。
Microb Biotechnol. 2010 Jul;3(4):362-9. doi: 10.1111/j.1751-7915.2010.00191.x.
5
Using comparative genome analysis to identify problems in annotated microbial genomes.利用比较基因组分析鉴定注释微生物基因组中的问题。
Microbiology (Reading). 2010 Jul;156(Pt 7):1909-1917. doi: 10.1099/mic.0.033811-0. Epub 2010 Apr 29.
6
Finishing genomes with limited resources: lessons from an ensemble of microbial genomes.有限资源下的基因组完成:来自微生物基因组集成的经验教训。
BMC Genomics. 2010 Apr 16;11:242. doi: 10.1186/1471-2164-11-242.
7
Assembly algorithms for next-generation sequencing data.下一代测序数据的组装算法。
Genomics. 2010 Jun;95(6):315-27. doi: 10.1016/j.ygeno.2010.03.001. Epub 2010 Mar 6.
8
Genome reannotation of Escherichia coli CFT073 with new insights into virulence.对具有新毒力见解的大肠杆菌 CFT073 进行基因组重新注释。
BMC Genomics. 2009 Nov 22;10:552. doi: 10.1186/1471-2164-10-552.
9
The Genomes On Line Database (GOLD) in 2009: status of genomic and metagenomic projects and their associated metadata.《基因组在线数据库(GOLD)》2009 年报告:基因组和宏基因组项目及其相关元数据的现状。
Nucleic Acids Res. 2010 Jan;38(Database issue):D346-54. doi: 10.1093/nar/gkp848. Epub 2009 Nov 13.
10
Novel features of the polysaccharide-digesting gliding bacterium Flavobacterium johnsoniae as revealed by genome sequence analysis.基因组序列分析揭示了多糖消化滑行细菌黄杆菌的新特征。
Appl Environ Microbiol. 2009 Nov;75(21):6864-75. doi: 10.1128/AEM.01495-09. Epub 2009 Aug 28.