工业相关细菌自养乙醇梭菌的全基因组序列及人工注释

Whole genome sequence and manual annotation of Clostridium autoethanogenum, an industrially relevant bacterium.

作者信息

Humphreys Christopher M, McLean Samantha, Schatschneider Sarah, Millat Thomas, Henstra Anne M, Annan Florence J, Breitkopf Ronja, Pander Bart, Piatek Pawel, Rowe Peter, Wichlacz Alexander T, Woods Craig, Norman Rupert, Blom Jochen, Goesman Alexander, Hodgman Charlie, Barrett David, Thomas Neil R, Winzer Klaus, Minton Nigel P

机构信息

BBSRC/EPSRC Synthetic Biology Research Centre, School of Life Sciences, University of Nottingham, Nottingham, NG7 2RD, UK.

School of Pharmacy, University of Nottingham, Nottingham, NG7 2RD, UK.

出版信息

BMC Genomics. 2015 Dec 21;16:1085. doi: 10.1186/s12864-015-2287-5.

DOI:10.1186/s12864-015-2287-5

PMID:26692227

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4687164/

Abstract

BACKGROUND

Clostridium autoethanogenum is an acetogenic bacterium capable of producing high value commodity chemicals and biofuels from the C1 gases present in synthesis gas. This common industrial waste gas can act as the sole energy and carbon source for the bacterium that converts the low value gaseous components into cellular building blocks and industrially relevant products via the action of the reductive acetyl-CoA (Wood-Ljungdahl) pathway. Current research efforts are focused on the enhancement and extension of product formation in this organism via synthetic biology approaches. However, crucial to metabolic modelling and directed pathway engineering is a reliable and comprehensively annotated genome sequence.

RESULTS

We performed next generation sequencing using Illumina MiSeq technology on the DSM10061 strain of Clostridium autoethanogenum and observed 243 single nucleotide discrepancies when compared to the published finished sequence (NCBI: GCA_000484505.1), with 59.1 % present in coding regions. These variations were confirmed by Sanger sequencing and subsequent analysis suggested that the discrepancies were sequencing errors in the published genome not true single nucleotide polymorphisms. This was corroborated by the observation that over 90 % occurred within homopolymer regions of greater than 4 nucleotides in length. It was also observed that many genes containing these sequencing errors were annotated in the published closed genome as encoding proteins containing frameshift mutations (18 instances) or were annotated despite the coding frame containing stop codons, which if genuine, would severely hinder the organism's ability to survive. Furthermore, we have completed a comprehensive manual curation to reduce errors in the annotation that occur through serial use of automated annotation pipelines in related species. As a result, different functions were assigned to gene products or previous functional annotations rejected because of missing evidence in various occasions.

CONCLUSIONS

We present a revised manually curated full genome sequence for Clostridium autoethanogenum DSM10061, which provides reliable information for genome-scale models that rely heavily on the accuracy of annotation, and represents an important step towards the manipulation and metabolic modelling of this industrially relevant acetogen.

摘要

背景

自养乙醇梭菌是一种产乙酸细菌，能够利用合成气中的C1气体生产高价值的商品化学品和生物燃料。这种常见的工业废气可作为该细菌的唯一能量和碳源，该细菌通过还原性乙酰辅酶A（伍德-Ljungdahl）途径的作用，将低价值的气态成分转化为细胞组成成分和具有工业相关性的产品。目前的研究工作集中在通过合成生物学方法增强和扩展该生物体中的产物形成。然而，对于代谢建模和定向途径工程而言，可靠且注释全面的基因组序列至关重要。

结果

我们使用Illumina MiSeq技术对自养乙醇梭菌DSM10061菌株进行了下一代测序，与已发表的完整序列（NCBI：GCA_000484505.1）相比，观察到243个单核苷酸差异，其中59.1%存在于编码区域。这些变异通过桑格测序得到证实，随后的分析表明这些差异是已发表基因组中的测序错误，而非真正的单核苷酸多态性。超过90%的差异出现在长度大于4个核苷酸的同聚物区域这一观察结果证实了这一点。还观察到许多包含这些测序错误的基因在已发表的封闭基因组中被注释为编码含有移码突变的蛋白质（18例），或者尽管编码框中含有终止密码子仍被注释，如果这些是真实的，将严重阻碍该生物体的生存能力。此外，我们完成了全面的人工校正，以减少在相关物种中连续使用自动注释管道时出现的注释错误。结果，不同的功能被赋予基因产物，或者由于各种情况下缺乏证据而拒绝了先前的功能注释。

结论

我们提供了一份经过人工校正的自养乙醇梭菌DSM10061修订全基因组序列，该序列为严重依赖注释准确性的基因组规模模型提供了可靠信息，并且代表了朝着对这种具有工业相关性的产乙酸菌进行操作和代谢建模迈出的重要一步。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c439/4687164/066cabd1aaed/12864_2015_2287_Fig1_HTML.jpg

相似文献

Whole genome sequence and manual annotation of Clostridium autoethanogenum, an industrially relevant bacterium.

BMC Genomics. 2015 Dec 21;16:1085. doi: 10.1186/s12864-015-2287-5.

Reconstruction of Acetogenesis Pathway Using Short-Read Sequencing of Clostridium aceticum Genome.

J Nanosci Nanotechnol. 2015 May;15(5):3852-61. doi: 10.1166/jnn.2015.9537.

Comparison of single-molecule sequencing and hybrid approaches for finishing the genome of Clostridium autoethanogenum and analysis of CRISPR systems in industrial relevant Clostridia.

Biotechnol Biofuels. 2014 Mar 21;7:40. doi: 10.1186/1754-6834-7-40. eCollection 2014.

Manual curation and reannotation of the genomes of Clostridium difficile 630Δerm and C. difficile 630.

J Med Microbiol. 2017 Mar;66(3):286-293. doi: 10.1099/jmm.0.000427.

Enhanced whole genome sequence and annotation of Clostridium stercorarium DSM8532T using RNA-seq transcriptomics and high-throughput proteomics.

BMC Genomics. 2014 Jul 7;15(1):567. doi: 10.1186/1471-2164-15-567.

[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].

Yi Chuan Xue Bao. 2004 May;31(5):431-43.

Required Gene Set for Autotrophic Growth of .

Appl Environ Microbiol. 2022 Apr 12;88(7):e0247921. doi: 10.1128/aem.02479-21. Epub 2022 Mar 14.

Formate-Dependent Acetogenic Utilization of Glucose by the Fecal Acetogen .

Appl Environ Microbiol. 2020 Nov 10;86(23). doi: 10.1128/AEM.01870-20.

Sequence data for Clostridium autoethanogenum using three generations of sequencing technologies.

Sci Data. 2015 Apr 14;2:150014. doi: 10.1038/sdata.2015.14. eCollection 2015.

Complete genome sequence of a malodorant-producing acetogen, Clostridium scatologenes ATCC 25775(T).

J Biotechnol. 2015 Oct 20;212:19-20. doi: 10.1016/j.jbiotec.2015.07.013. Epub 2015 Jul 22.

引用本文的文献

Reverse-Engineered Gas-Fermenting Acetogen Strains Recover Enhanced Phenotypes From Autotrophic Adaptive Laboratory Evolution.

Microb Biotechnol. 2025 Aug;18(8):e70208. doi: 10.1111/1751-7915.70208.

Evaluation of Clostridium autoethanogenum protein as a new protein source for broiler chickens in replacement of soybean meal.

Anim Biosci. 2024 Jul;37(7):1236-1245. doi: 10.5713/ab.23.0419. Epub 2024 Apr 1.

Recent progress in engineering to synthesize the biochemicals and biocommodities.

Synth Syst Biotechnol. 2023 Dec 15;9(1):19-25. doi: 10.1016/j.synbio.2023.12.001. eCollection 2024 Mar.

Pleiotropic Regulator GssR Positively Regulates Autotrophic Growth of Gas-Fermenting .

Microorganisms. 2023 Jul 31;11(8):1968. doi: 10.3390/microorganisms11081968.

Base editing enables duplex point mutagenesis in at the price of numerous off-target mutations.

Front Bioeng Biotechnol. 2023 Jul 10;11:1211197. doi: 10.3389/fbioe.2023.1211197. eCollection 2023.

Deletion of genes linked to the C-fixing gene cluster affects growth, by-products, and proteome of .

Front Bioeng Biotechnol. 2023 May 15;11:1167892. doi: 10.3389/fbioe.2023.1167892. eCollection 2023.

Effects of Fishmeal Replacement by Protein Meal on Cholesterol Bile Acid Metabolism, Antioxidant Capacity, Hepatic and Intestinal Health of Pearl Gentian Grouper ( ♀ × ♂).

Animals (Basel). 2023 Mar 18;13(6):1090. doi: 10.3390/ani13061090.

Dietary Effect of Protein on Growth, Intestinal Histology and Flesh Lipid Metabolism of Largemouth Bass () Based on Metabolomics.

Metabolites. 2022 Nov 9;12(11):1088. doi: 10.3390/metabo12111088.

Replacement of dietary fish meal with meal on growth performance, intestinal amino acids transporters, protein metabolism and hepatic lipid metabolism of juvenile turbot ( L.).

Front Physiol. 2022 Aug 24;13:981750. doi: 10.3389/fphys.2022.981750. eCollection 2022.

isopropanol production native plasmid pCA replicon.

Front Bioeng Biotechnol. 2022 Aug 5;10:932363. doi: 10.3389/fbioe.2022.932363. eCollection 2022.

本文引用的文献

Genome Wide Re-Annotation of Caldicellulosiruptor saccharolyticus with New Insights into Genes Involved in Biomass Degradation and Hydrogen Production.

PLoS One. 2015 Jul 21;10(7):e0133183. doi: 10.1371/journal.pone.0133183. eCollection 2015.

Sequence data for Clostridium autoethanogenum using three generations of sequencing technologies.

Sci Data. 2015 Apr 14;2:150014. doi: 10.1038/sdata.2015.14. eCollection 2015.

Nanopore-based fourth-generation DNA sequencing technology.

Genomics Proteomics Bioinformatics. 2015 Feb;13(1):4-16. doi: 10.1016/j.gpb.2015.01.009. Epub 2015 Mar 2.

Update on RefSeq microbial genomes resources.

Nucleic Acids Res. 2015 Jan;43(Database issue):D599-605. doi: 10.1093/nar/gku1062. Epub 2014 Dec 15.

Development of microorganisms for cellulose-biofuel consolidated bioprocessings: metabolic engineers' tricks.

Comput Struct Biotechnol J. 2012 Nov 8;3:e201210007. doi: 10.5936/csbj.201210007. eCollection 2012.

Comparison of single-molecule sequencing and hybrid approaches for finishing the genome of Clostridium autoethanogenum and analysis of CRISPR systems in industrial relevant Clostridia.

Biotechnol Biofuels. 2014 Mar 21;7:40. doi: 10.1186/1754-6834-7-40. eCollection 2014.

InterProScan 5: genome-scale protein function classification.

Bioinformatics. 2014 May 1;30(9):1236-40. doi: 10.1093/bioinformatics/btu031. Epub 2014 Jan 21.

Microbial phylogenetic profiling with the Pacific Biosciences sequencing platform.

Microbiome. 2013 Mar 4;1(1):10. doi: 10.1186/2049-2618-1-10.

Pfam: the protein families database.

Nucleic Acids Res. 2014 Jan;42(Database issue):D222-30. doi: 10.1093/nar/gkt1223. Epub 2013 Nov 27.

Reducing assembly complexity of microbial genomes with single-molecule sequencing.

Genome Biol. 2013;14(9):R101. doi: 10.1186/gb-2013-14-9-r101.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

工业相关细菌自养乙醇梭菌的全基因组序列及人工注释

Whole genome sequence and manual annotation of Clostridium autoethanogenum, an industrially relevant bacterium.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献