蛋白质基因组学：蛋白质组学在基因组注释中需要发挥的作用

Proteogenomics: needs and roles to be filled by proteomics in genome annotation.

作者信息

Ansong Charles, Purvine Samuel O, Adkins Joshua N, Lipton Mary S, Smith Richard D

机构信息

Biological Sciences Division, Pacific Northwest National Laboratory, P.O. Box 999/K8-98, Richland, WA 99352, USA.

出版信息

Brief Funct Genomic Proteomic. 2008 Jan;7(1):50-62. doi: 10.1093/bfgp/eln010. Epub 2008 Mar 10.

DOI:10.1093/bfgp/eln010

PMID:18334489

Abstract

While genome sequencing efforts reveal the basic building blocks of life, a genome sequence alone is insufficient for elucidating biological function. Genome annotation--the process of identifying genes and assigning function to each gene in a genome sequence--provides the means to elucidate biological function from sequence. Current state-of-the-art high-throughput genome annotation uses a combination of comparative (sequence similarity data) and non-comparative (ab initio gene prediction algorithms) methods to identify protein-coding genes in genome sequences. Because approaches used to validate the presence of predicted protein-coding genes are typically based on expressed RNA sequences, they cannot independently and unequivocally determine whether a predicted protein-coding gene is translated into a protein. With the ability to directly measure peptides arising from expressed proteins, high-throughput liquid chromatography-tandem mass spectrometry-based proteomics approaches can be used to verify coding regions of a genomic sequence. Here, we highlight several ways in which high-throughput tandem mass spectrometry-based proteomics can improve the quality of genome annotations and suggest that it could be efficiently applied during the gene calling process so that the improvements are propagated through the subsequent functional annotation process.

摘要

虽然基因组测序工作揭示了生命的基本组成部分，但仅靠基因组序列不足以阐明生物学功能。基因组注释——识别基因并为基因组序列中的每个基因赋予功能的过程——提供了从序列阐明生物学功能的方法。当前最先进的高通量基因组注释使用比较（序列相似性数据）和非比较（从头基因预测算法）方法的组合来识别基因组序列中的蛋白质编码基因。由于用于验证预测的蛋白质编码基因存在的方法通常基于表达的RNA序列，因此它们不能独立且明确地确定预测的蛋白质编码基因是否被翻译成蛋白质。基于高通量液相色谱 - 串联质谱的蛋白质组学方法能够直接测量由表达的蛋白质产生的肽段，可用于验证基因组序列的编码区域。在这里，我们强调了基于高通量串联质谱的蛋白质组学可以提高基因组注释质量的几种方式，并表明它可以在基因识别过程中有效应用，以便这些改进在随后的功能注释过程中得以延续。

相似文献

Proteogenomics: needs and roles to be filled by proteomics in genome annotation.蛋白质基因组学：蛋白质组学在基因组注释中需要发挥的作用

Brief Funct Genomic Proteomic. 2008 Jan;7(1):50-62. doi: 10.1093/bfgp/eln010. Epub 2008 Mar 10.

Gene model detection using mass spectrometry.使用质谱法进行基因模型检测。

Methods Mol Biol. 2010;604:137-44. doi: 10.1007/978-1-60761-444-9_10.

Proteogenomics.蛋白质基因组学。

Proteomics. 2011 Feb;11(4):620-30. doi: 10.1002/pmic.201000615. Epub 2011 Jan 18.

A perfect genome annotation is within reach with the proteomics and genomics alliance.蛋白质组学和基因组学联盟有望实现完美的基因组注释。

Curr Opin Microbiol. 2009 Jun;12(3):292-300. doi: 10.1016/j.mib.2009.03.005. Epub 2009 May 4.

Subproteomic tools to increase genome annotation complexity.用于增加基因组注释复杂性的亚蛋白质组学工具。

Proteomics. 2008 Oct;8(20):4209-13. doi: 10.1002/pmic.200800226.

Mass spectrometry at the interface of proteomics and genomics.蛋白质组学与基因组学交叉领域的质谱分析

Mol Biosyst. 2011 Feb;7(2):284-91. doi: 10.1039/c0mb00168f. Epub 2010 Oct 21.

Genome annotation of Anopheles gambiae using mass spectrometry-derived data.利用质谱衍生数据对冈比亚按蚊进行基因组注释。

BMC Genomics. 2005 Sep 19;6:128. doi: 10.1186/1471-2164-6-128.

Whole genome searching with shotgun proteomic data: applications for genome annotation.利用鸟枪法蛋白质组学数据进行全基因组搜索：在基因组注释中的应用

J Proteome Res. 2008 Jan;7(1):80-8. doi: 10.1021/pr070198n. Epub 2007 Dec 7.

C. elegans ORFeome version 1.1: experimental verification of the genome annotation and resource for proteome-scale protein expression.秀丽隐杆线虫开放阅读框文库版本1.1：基因组注释的实验验证及蛋白质组规模蛋白质表达资源

Nat Genet. 2003 May;34(1):35-41. doi: 10.1038/ng1140.

Integrating alternative splicing detection into gene prediction.将可变剪接检测整合到基因预测中。

BMC Bioinformatics. 2005 Feb 10;6:25. doi: 10.1186/1471-2105-6-25.

引用本文的文献

Metaproteogenomic Profile of a Mesopelagic Adenylylsulfate Reductase: Course-Based Discovery Using the Ocean Protein Portal.中层腺苷硫酸盐还原酶的宏蛋白质组学特征：基于海洋蛋白质门户的课程发现。

J Proteome Res. 2023 Sep 1;22(9):2871-2879. doi: 10.1021/acs.jproteome.3c00152. Epub 2023 Aug 22.

Quantitative Proteomics Using Isobaric Labeling: A Practical Guide.定量蛋白质组学使用等重标记：实用指南。

Genomics Proteomics Bioinformatics. 2021 Oct;19(5):689-706. doi: 10.1016/j.gpb.2021.08.012. Epub 2022 Jan 8.

The genetic proteome: Using genetics to inform the proteome of mycobacterial pathogens.遗传蛋白质组学：利用遗传学来了解分枝杆菌病原体的蛋白质组。

PLoS Pathog. 2021 Jan 7;17(1):e1009124. doi: 10.1371/journal.ppat.1009124. eCollection 2021 Jan.

Discovery and Longitudinal Evaluation of Candidate Biomarkers for Ischaemic Stroke by Mass Spectrometry-Based Proteomics.基于质谱的蛋白质组学对缺血性中风候选生物标志物的发现与纵向评估

Biomark Insights. 2017 Dec 20;12:1177271917749216. doi: 10.1177/1177271917749216. eCollection 2017.

Proteogenomic Analysis Greatly Expands the Identification of Proteins Related to Reproduction in the Apogamous Fern ssp. .蛋白质基因组学分析极大地扩展了对无融合生殖蕨类植物亚种中与生殖相关蛋白质的鉴定。

Front Plant Sci. 2017 Mar 22;8:336. doi: 10.3389/fpls.2017.00336. eCollection 2017.

Impact of Solar Radiation on Gene Expression in Bacteria.太阳辐射对细菌基因表达的影响

Proteomes. 2013 Jul 16;1(2):70-86. doi: 10.3390/proteomes1020070.

Evaluating the effect of database inflation in proteogenomic search on sensitive and reliable peptide identification.评估蛋白质基因组搜索中数据库膨胀对灵敏且可靠的肽段鉴定的影响。

BMC Genomics. 2016 Dec 22;17(Suppl 13):1031. doi: 10.1186/s12864-016-3327-5.

Proteomic analysis and translational perspective of hepatocellular carcinoma: Identification of diagnostic protein biomarkers by an onco-proteogenomics approach.肝细胞癌的蛋白质组学分析与转化前景：通过肿瘤蛋白质基因组学方法鉴定诊断性蛋白质生物标志物。

Kaohsiung J Med Sci. 2016 Nov;32(11):535-544. doi: 10.1016/j.kjms.2016.09.002. Epub 2016 Nov 4.

Proteomics in India: the clinical aspect.印度的蛋白质组学：临床方面。

Clin Proteomics. 2016 Nov 5;13:21. doi: 10.1186/s12014-016-9122-0. eCollection 2016.

Dual use of peptide mass spectra: Protein atlas and genome annotation.肽质谱的双重用途：蛋白质图谱与基因组注释。

Curr Plant Biol. 2015 May 1;2:21-24. doi: 10.1016/j.cpb.2015.02.001. Epub 2015 Apr 13.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

蛋白质基因组学：蛋白质组学在基因组注释中需要发挥的作用

Proteogenomics: needs and roles to be filled by proteomics in genome annotation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献