• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用二维液相色谱- MALDI-TOF/TOF 技术对福氏志贺菌进行蛋白质基因组分析

A proteogenomic analysis of Shigella flexneri using 2D LC-MALDI TOF/TOF.

机构信息

State Key Laboratory for Molecular Virology and Genetic Engineering, Institute of Pathogen Biology, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, PR China.

出版信息

BMC Genomics. 2011 Oct 28;12:528. doi: 10.1186/1471-2164-12-528.

DOI:10.1186/1471-2164-12-528
PMID:22032405
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3219829/
Abstract

BACKGROUND

New strategies for high-throughput sequencing are constantly appearing, leading to a great increase in the number of completely sequenced genomes. Unfortunately, computational genome annotation is out of step with this progress. Thus, the accurate annotation of these genomes has become a bottleneck of knowledge acquisition.

RESULTS

We exploited a proteogenomic approach to improve conventional genome annotation by integrating proteomic data with genomic information. Using Shigella flexneri 2a as a model, we identified total 823 proteins, including 187 hypothetical proteins. Among them, three annotated ORFs were extended upstream through comprehensive analysis against an in-house N-terminal extension database. Two genes, which could not be translated to their full length because of stop codon 'mutations' induced by genome sequencing errors, were revised and annotated as fully functional genes. Above all, seven new ORFs were discovered, which were not predicted in S. flexneri 2a str.301 by any other annotation approaches. The transcripts of four novel ORFs were confirmed by RT-PCR assay. Additionally, most of these novel ORFs were overlapping genes, some even nested within the coding region of other known genes.

CONCLUSIONS

Our findings demonstrate that current Shigella genome annotation methods are not perfect and need to be improved. Apart from the validation of predicted genes at the protein level, the additional features of proteogenomic tools include revision of annotation errors and discovery of novel ORFs. The complementary dataset could provide more targets for those interested in Shigella to perform functional studies.

摘要

背景

高通量测序的新策略不断涌现,导致完全测序基因组的数量大幅增加。不幸的是,计算基因组注释与这一进展不同步。因此,这些基因组的准确注释已成为知识获取的瓶颈。

结果

我们利用蛋白质基因组学方法通过将蛋白质组数据与基因组信息相结合来改进传统的基因组注释。我们以福氏志贺菌 2a 为模型,共鉴定出 823 种蛋白质,包括 187 种假定蛋白质。其中,通过综合分析内部 N 端延伸数据库,对三个注释的 ORF 进行了上游延伸。由于基因组测序错误引起的“突变”导致两个基因无法翻译为全长,我们对这两个基因进行了修正和注释,使其成为完全功能的基因。最重要的是,发现了七个新的 ORF,这些 ORF 无法通过任何其他注释方法预测到福氏志贺菌 2a str.301 中。通过 RT-PCR 检测证实了四个新 ORF 的转录本。此外,这些新的 ORF 中的大多数是重叠基因,有些甚至嵌套在其他已知基因的编码区内。

结论

我们的研究结果表明,目前的志贺氏菌基因组注释方法并不完善,需要改进。除了在蛋白质水平上验证预测基因外,蛋白质基因组学工具的附加功能还包括注释错误的修正和新 ORF 的发现。互补数据集可以为那些对志贺氏菌感兴趣的人提供更多的功能研究目标。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b988/3219829/d1b30c9dbb33/1471-2164-12-528-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b988/3219829/aa4f17a773da/1471-2164-12-528-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b988/3219829/d1b30c9dbb33/1471-2164-12-528-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b988/3219829/aa4f17a773da/1471-2164-12-528-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b988/3219829/d1b30c9dbb33/1471-2164-12-528-2.jpg

相似文献

1
A proteogenomic analysis of Shigella flexneri using 2D LC-MALDI TOF/TOF.应用二维液相色谱- MALDI-TOF/TOF 技术对福氏志贺菌进行蛋白质基因组分析
BMC Genomics. 2011 Oct 28;12:528. doi: 10.1186/1471-2164-12-528.
2
Subproteomic tools to increase genome annotation complexity.用于增加基因组注释复杂性的亚蛋白质组学工具。
Proteomics. 2008 Oct;8(20):4209-13. doi: 10.1002/pmic.200800226.
3
Complete genome sequence and annotation of the laboratory reference strain Shigella flexneri serotype 5a M90T and genome-wide transcriptional start site determination.志贺氏菌 5a 型 M90T 实验室参考菌株的全基因组序列和注释及全基因组转录起始位点的确定。
BMC Genomics. 2020 Apr 6;21(1):285. doi: 10.1186/s12864-020-6565-5.
4
Proteogenomic Methods to Improve Genome Annotation.用于改进基因组注释的蛋白质基因组学方法
Methods Mol Biol. 2016;1410:77-89. doi: 10.1007/978-1-4939-3524-6_5.
5
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
6
Proteogenomic mapping as a complementary method to perform genome annotation.蛋白质基因组图谱绘制作为一种用于进行基因组注释的补充方法。
Proteomics. 2004 Jan;4(1):59-77. doi: 10.1002/pmic.200300511.
7
High-throughput proteogenomics of Ruegeria pomeroyi: seeding a better genomic annotation for the whole marine Roseobacter clade. Ruegeria pomeroyi 的高通量蛋白基因组学研究:为整个海洋 Roseobacter 分支提供更好的基因组注释。
BMC Genomics. 2012 Feb 15;13:73. doi: 10.1186/1471-2164-13-73.
8
Functional Annotation and Curation of Hypothetical Proteins Present in A Newly Emerged Serotype 1c of : Emphasis on Selecting Targets for Virulence and Vaccine Design Studies.新型 1c 血清型 :对毒力和疫苗设计研究的目标选择的重点——假设蛋白的功能注释和整理。
Genes (Basel). 2020 Mar 23;11(3):340. doi: 10.3390/genes11030340.
9
Comparative omics-driven genome annotation refinement: application across Yersiniae.比较组学驱动的基因组注释精细化:在耶尔森氏菌中的应用。
PLoS One. 2012;7(3):e33903. doi: 10.1371/journal.pone.0033903. Epub 2012 Mar 27.
10

引用本文的文献

1
Proteogenomic Analysis and Discovery of Immune Antigens in .蛋白质基因组学分析及免疫抗原的发现于……(原文此处不完整)
Mol Cell Proteomics. 2017 Sep;16(9):1578-1590. doi: 10.1074/mcp.M116.065813. Epub 2017 Jul 21.
2
Tissue-specific Proteogenomic Analysis of Plutella xylostella Larval Midgut Using a Multialgorithm Pipeline.使用多算法流程对小菜蛾幼虫中肠进行组织特异性蛋白质基因组分析
Mol Cell Proteomics. 2016 Jun;15(6):1791-807. doi: 10.1074/mcp.M115.050989. Epub 2016 Feb 22.
3
N-Terminal-oriented proteogenomics of the marine bacterium roseobacter denitrificans Och114 using N-Succinimidyloxycarbonylmethyl)tris(2,4,6-trimethoxyphenyl)phosphonium bromide (TMPP) labeling and diagonal chromatography.

本文引用的文献

1
Proteogenomics.蛋白质基因组学。
Proteomics. 2011 Feb;11(4):620-30. doi: 10.1002/pmic.201000615. Epub 2011 Jan 18.
2
A proteogenomic update to Yersinia: enhancing genome annotation.对耶尔森氏菌的蛋白质基因组学更新:增强基因组注释。
BMC Genomics. 2010 Aug 5;11:460. doi: 10.1186/1471-2164-11-460.
3
Proteomics-based confirmation of protein expression and correction of annotation errors in the Brucella abortus genome.基于蛋白质组学的布鲁氏菌 abortus 基因组中蛋白质表达的确认和注释错误的修正。
利用N-琥珀酰亚胺氧基羰基甲基)三(2,4,6-三甲氧基苯基)溴化鏻(TMPP)标记和对角线色谱法对反硝化玫瑰杆菌Och114进行N端导向的蛋白质基因组学研究。
Mol Cell Proteomics. 2014 May;13(5):1369-81. doi: 10.1074/mcp.O113.032854. Epub 2014 Feb 16.
4
Exploration of novel cellular and serological antigen biomarkers in the ORFeome of Mycobacterium tuberculosis.结核分枝杆菌 ORFeome 中新型细胞和血清学抗原生物标志物的探索。
Mol Cell Proteomics. 2014 Mar;13(3):897-906. doi: 10.1074/mcp.M113.032623. Epub 2014 Jan 21.
BMC Genomics. 2010 May 12;11:300. doi: 10.1186/1471-2164-11-300.
4
Proteomic detection of non-annotated protein-coding genes in Pseudomonas fluorescens Pf0-1.荧光假单胞菌 Pf0-1 中非注释蛋白编码基因的蛋白质组学检测。
PLoS One. 2009 Dec 24;4(12):e8455. doi: 10.1371/journal.pone.0008455.
5
A guide to the Proteomics Identifications Database proteomics data repository.蛋白质组学鉴定数据库蛋白质组学数据储存库指南。
Proteomics. 2009 Sep;9(18):4276-83. doi: 10.1002/pmic.200900402.
6
Validating divergent ORF annotation of the Mycobacterium leprae genome through a full translation data set and peptide identification by tandem mass spectrometry.通过完整翻译数据集和串联质谱法进行肽段鉴定来验证麻风分枝杆菌基因组中不同开放阅读框的注释。
Proteomics. 2009 Jun;9(12):3233-43. doi: 10.1002/pmic.200800955.
7
An overview of nested genes in eukaryotic genomes.真核生物基因组中的嵌套基因概述。
Eukaryot Cell. 2009 Sep;8(9):1321-9. doi: 10.1128/EC.00143-09. Epub 2009 Jun 19.
8
Proteomic discovery of previously unannotated, rapidly evolving seminal fluid genes in Drosophila.果蝇中先前未注释的、快速进化的精液基因的蛋白质组学发现。
Genome Res. 2009 May;19(5):886-96. doi: 10.1101/gr.089391.108.
9
Alliance of proteomics and genomics to unravel the specificities of Sahara bacterium Deinococcus deserti.蛋白质组学与基因组学联盟揭示撒哈拉沙漠细菌嗜热栖热放线菌的特性
PLoS Genet. 2009 Mar;5(3):e1000434. doi: 10.1371/journal.pgen.1000434. Epub 2009 Mar 27.
10
Hydrophobic peptides: novel regulators within bacterial membrane.疏水肽:细菌膜内的新型调节因子。
Mol Microbiol. 2009 Apr;72(1):5-11. doi: 10.1111/j.1365-2958.2009.06626.x. Epub 2009 Feb 4.