REPARATION：核糖体谱分析辅助的细菌基因组（重新）注释

REPARATION: ribosome profiling assisted (re-)annotation of bacterial genomes.

作者信息

Ndah Elvis, Jonckheere Veronique, Giess Adam, Valen Eivind, Menschaert Gerben, Van Damme Petra

机构信息

VIB-UGent Center for Medical Biotechnology, B-9000 Ghent, Belgium.

Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium.

出版信息

Nucleic Acids Res. 2017 Nov 16;45(20):e168. doi: 10.1093/nar/gkx758.

DOI:10.1093/nar/gkx758

PMID:28977509

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5714196/

Abstract

Prokaryotic genome annotation is highly dependent on automated methods, as manual curation cannot keep up with the exponential growth of sequenced genomes. Current automated methods depend heavily on sequence composition and often underestimate the complexity of the proteome. We developed RibosomeE Profiling Assisted (re-)AnnotaTION (REPARATION), a de novo machine learning algorithm that takes advantage of experimental protein synthesis evidence from ribosome profiling (Ribo-seq) to delineate translated open reading frames (ORFs) in bacteria, independent of genome annotation (https://github.com/Biobix/REPARATION). REPARATION evaluates all possible ORFs in the genome and estimates minimum thresholds based on a growth curve model to screen for spurious ORFs. We applied REPARATION to three annotated bacterial species to obtain a more comprehensive mapping of their translation landscape in support of experimental data. In all cases, we identified hundreds of novel (small) ORFs including variants of previously annotated ORFs and >70% of all (variants of) annotated protein coding ORFs were predicted by REPARATION to be translated. Our predictions are supported by matching mass spectrometry proteomics data, sequence composition and conservation analysis. REPARATION is unique in that it makes use of experimental translation evidence to intrinsically perform a de novo ORF delineation in bacterial genomes irrespective of the sequence features linked to open reading frames.

摘要

原核生物基因组注释高度依赖自动化方法，因为人工注释无法跟上测序基因组呈指数级增长的速度。当前的自动化方法严重依赖序列组成，并且常常低估蛋白质组的复杂性。我们开发了核糖体剖析辅助（重新）注释（REPARATION），这是一种从头开始的机器学习算法，它利用核糖体剖析（Ribo-seq）的实验性蛋白质合成证据来描绘细菌中已翻译的开放阅读框（ORF），而不依赖于基因组注释（https://github.com/Biobix/REPARATION）。REPARATION评估基因组中所有可能的ORF，并基于生长曲线模型估计最小阈值以筛选假阳性ORF。我们将REPARATION应用于三种已注释的细菌物种，以获得它们翻译图谱的更全面映射，以支持实验数据。在所有情况下，我们都鉴定出了数百个新的（小）ORF，包括先前注释的ORF的变体，并且REPARATION预测所有注释的蛋白质编码ORF（及其变体）中有超过70%会被翻译。我们的预测得到了匹配的质谱蛋白质组学数据、序列组成和保守性分析的支持。REPARATION的独特之处在于，它利用实验性翻译证据在细菌基因组中内在地进行从头ORF描绘，而不考虑与开放阅读框相关的序列特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a918/5714196/d5aac59ce795/gkx758fig1.jpg

相似文献

REPARATION: ribosome profiling assisted (re-)annotation of bacterial genomes.

Nucleic Acids Res. 2017 Nov 16;45(20):e168. doi: 10.1093/nar/gkx758.

Common and phylogenetically widespread coding for peptides by bacterial small RNAs.

BMC Genomics. 2017 Jul 21;18(1):553. doi: 10.1186/s12864-017-3932-y.

smORFer: a modular algorithm to detect small ORFs in prokaryotes.

Nucleic Acids Res. 2021 Sep 7;49(15):e89. doi: 10.1093/nar/gkab477.

RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria.

Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab549.

Experimental annotation of post-translational features and translated coding regions in the pathogen Salmonella Typhimurium.

BMC Genomics. 2011 Aug 25;12:433. doi: 10.1186/1471-2164-12-433.

Identification of Translation Start Sites in Bacterial Genomes.

Methods Mol Biol. 2021;2252:27-55. doi: 10.1007/978-1-0716-1150-0_2.

OpenProt: a more comprehensive guide to explore eukaryotic coding potential and proteomes.

Nucleic Acids Res. 2019 Jan 8;47(D1):D403-D410. doi: 10.1093/nar/gky936.

Detecting actively translated open reading frames in ribosome profiling data.

Nat Methods. 2016 Feb;13(2):165-70. doi: 10.1038/nmeth.3688. Epub 2015 Dec 14.

GeneLook: a novel ab initio gene identification system suitable for automated annotation of prokaryotic sequences.

Gene. 2005 Feb 14;346:115-25. doi: 10.1016/j.gene.2004.10.018. Epub 2005 Jan 26.

De novo annotation and characterization of the translatome with ribosome profiling data.

Nucleic Acids Res. 2018 Jun 1;46(10):e61. doi: 10.1093/nar/gky179.

引用本文的文献

Investigation of the global translational response to oxidative stress in the model archaeon reveals untranslated small RNAs with ribosome occupancy.

bioRxiv. 2025 Jul 13:2025.04.08.647799. doi: 10.1101/2025.04.08.647799.

De novo gene birth and the conundrum of ORFan genes in bacteria.

Genome Res. 2025 Aug 1;35(8):1679-1688. doi: 10.1101/gr.280157.124.

Unraveling N-Terminal Proteoform Interactomes via Multiplexed Recombineering in Salmonella.

Methods Mol Biol. 2025;2953:81-102. doi: 10.1007/978-1-0716-4694-6_6.

Complementary Ribo-seq approaches map the translatome and provide a small protein census in the foodborne pathogen Campylobacter jejuni.

Nat Commun. 2025 Mar 30;16(1):3078. doi: 10.1038/s41467-025-58329-w.

Bioprospecting of culturable marine biofilm bacteria for novel antimicrobial peptides.

Imeta. 2024 Oct 17;3(6):e244. doi: 10.1002/imt2.244. eCollection 2024 Dec.

Uncovering the small proteome of Methanosarcina mazei using Ribo-seq and peptidomics under different nitrogen conditions.

Nat Commun. 2024 Oct 6;15(1):8659. doi: 10.1038/s41467-024-53008-8.

Proteins à la carte: riboproteogenomic exploration of bacterial N-terminal proteoform expression.

mBio. 2024 Apr 10;15(4):e0033324. doi: 10.1128/mbio.00333-24. Epub 2024 Mar 21.

Machine Learning and Deep Learning in Synthetic Biology: Key Architectures, Applications, and Challenges.

ACS Omega. 2024 Feb 19;9(9):9921-9945. doi: 10.1021/acsomega.3c05913. eCollection 2024 Mar 5.

ORFeus: a computational method to detect programmed ribosomal frameshifts and other non-canonical translation events.

BMC Bioinformatics. 2023 Dec 13;24(1):471. doi: 10.1186/s12859-023-05602-8.

Exposing the small protein load of bacterial life.

FEMS Microbiol Rev. 2023 Nov 1;47(6). doi: 10.1093/femsre/fuad063.

本文引用的文献

Identification of Unannotated Small Genes in .

G3 (Bethesda). 2017 Mar 10;7(3):983-989. doi: 10.1534/g3.116.036939.

Comparative survey of the relative impact of mRNA features on local ribosome profiling read density.

Nat Commun. 2016 Oct 4;7:12915. doi: 10.1038/ncomms12915.

Estimation of ribosome profiling performance and reproducibility at various levels of resolution.

Biol Direct. 2016 May 10;11:24. doi: 10.1186/s13062-016-0127-4.

Redefining the Translational Status of 80S Monosomes.

Cell. 2016 Feb 11;164(4):757-69. doi: 10.1016/j.cell.2016.01.003.

RiboGalaxy: A browser based platform for the alignment, analysis and visualization of ribosome profiling data.

RNA Biol. 2016;13(3):316-9. doi: 10.1080/15476286.2016.1141862. Epub 2016 Jan 29.

Clarifying the Translational Pausing Landscape in Bacteria by Ribosome Profiling.

Cell Rep. 2016 Feb 2;14(4):686-694. doi: 10.1016/j.celrep.2015.12.073. Epub 2016 Jan 14.

Detecting actively translated open reading frames in ribosome profiling data.

Nat Methods. 2016 Feb;13(2):165-70. doi: 10.1038/nmeth.3688. Epub 2015 Dec 14.

A Regression-Based Analysis of Ribosome-Profiling Data Reveals a Conserved Complexity to Mammalian Translation.

Mol Cell. 2015 Dec 3;60(5):816-827. doi: 10.1016/j.molcel.2015.11.013.

sORFs.org: a repository of small ORFs identified by ribosome profiling.

Nucleic Acids Res. 2016 Jan 4;44(D1):D324-9. doi: 10.1093/nar/gkv1175. Epub 2015 Nov 2.

The use of duplex-specific nuclease in ribosome profiling and a user-friendly software package for Ribo-seq data analysis.

RNA. 2015 Oct;21(10):1731-45. doi: 10.1261/rna.052548.115. Epub 2015 Aug 18.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

REPARATION：核糖体谱分析辅助的细菌基因组（重新）注释

REPARATION: ribosome profiling assisted (re-)annotation of bacterial genomes.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

REPARATION：核糖体谱分析辅助的细菌基因组（重新）注释

REPARATION: ribosome profiling assisted (re-)annotation of bacterial genomes.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献