揭示细菌基因组中小蛋白的隐藏宇宙。

Unraveling the hidden universe of small proteins in bacterial genomes.

机构信息

EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain.

Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain.

出版信息

Mol Syst Biol. 2019 Feb 22;15(2):e8290. doi: 10.15252/msb.20188290.

Abstract

Identification of small open reading frames (smORFs) encoding small proteins (≤ 100 amino acids; SEPs) is a challenge in the fields of genome annotation and protein discovery. Here, by combining a novel bioinformatics tool (RanSEPs) with "-omics" approaches, we were able to describe 109 bacterial small ORFomes. Predictions were first validated by performing an exhaustive search of SEPs present in proteome via mass spectrometry, which illustrated the limitations of shotgun approaches. Then, RanSEPs predictions were validated and compared with other tools using proteomic datasets from different bacterial species and SEPs from the literature. We found that up to 16 ± 9% of proteins in an organism could be classified as SEPs. Integration of RanSEPs predictions with transcriptomics data showed that some annotated non-coding RNAs could in fact encode for SEPs. A functional study of SEPs highlighted an enrichment in the membrane, translation, metabolism, and nucleotide-binding categories. Additionally, 9.7% of the SEPs included a N-terminus predicted signal peptide. We envision RanSEPs as a tool to unmask the hidden universe of small bacterial proteins.

摘要

鉴定编码小于 100 个氨基酸的小蛋白质(SEP)的小开放阅读框(smORF)是基因组注释和蛋白质发现领域的一个挑战。在这里,我们通过将一种新的生物信息学工具(RanSEPs)与“组学”方法相结合,成功地描述了 109 个细菌的小 ORF 组。首先,通过对通过质谱法检测到的蛋白质组中存在的 SEPs 进行穷尽搜索,对预测结果进行了验证,这说明了 shotgun 方法的局限性。然后,使用来自不同细菌物种的蛋白质组数据集和文献中的 SEPs 对 RanSEPs 预测结果进行了验证和比较。我们发现,一个生物体中多达 16 ± 9%的蛋白质可以被归类为 SEPs。将 RanSEPs 预测结果与转录组数据相结合表明,一些注释的非编码 RNA 实际上可以编码 SEPs。对 SEPs 的功能研究表明,它们在膜、翻译、代谢和核苷酸结合等类别中富集。此外,9.7%的 SEPs 包含一个预测的 N 端信号肽。我们设想 RanSEPs 是一种揭示隐藏的小细菌蛋白质宇宙的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7828/6385055/ee9777064852/MSB-15-e8290-g003.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索