Suppr超能文献

利用鸟枪法蛋白质组学鉴定、确认和校正秀丽隐杆线虫基因注释。

Use of shotgun proteomics for the identification, confirmation, and correction of C. elegans gene annotations.

作者信息

Merrihew Gennifer E, Davis Colleen, Ewing Brent, Williams Gary, Käll Lukas, Frewen Barbara E, Noble William Stafford, Green Phil, Thomas James H, MacCoss Michael J

机构信息

University of Washington, Department of Genome Sciences, Seattle, Washington 98195, USA.

出版信息

Genome Res. 2008 Oct;18(10):1660-9. doi: 10.1101/gr.077644.108. Epub 2008 Jul 24.

Abstract

We describe a general mass spectrometry-based approach for gene annotation of any organism and demonstrate its effectiveness using the nematode Caenorhabditis elegans. We detected 6779 C. elegans proteins (67,047 peptides), including 384 that, although annotated in WormBase WS150, lacked cDNA or other prior experimental support. We also identified 429 new coding sequences that were unannotated in WS150. Nearly half (192/429) of the new coding sequences were confirmed with RT-PCR data. Thirty-three (approximately 8%) of the new coding sequences had been predicted to be pseudogenes, 151 (approximately 35%) reveal apparent errors in gene models, and 245 (57%) appear to be novel genes. In addition, we verified 6010 exon-exon splice junctions within existing WormBase gene models. Our work confirms that mass spectrometry is a powerful experimental tool for annotating sequenced genomes. In addition, the collection of identified peptides should facilitate future proteomics experiments targeted at specific proteins of interest.

摘要

我们描述了一种基于质谱的通用方法,用于对任何生物体进行基因注释,并以线虫秀丽隐杆线虫为例展示了其有效性。我们检测到了6779种秀丽隐杆线虫蛋白质(67047个肽段),其中包括384种,尽管它们在WormBase WS150中已有注释,但缺乏cDNA或其他先前的实验支持。我们还鉴定出了429个在WS150中未注释的新编码序列。新编码序列中近一半(192/429)通过RT-PCR数据得到了证实。新编码序列中有33个(约8%)曾被预测为假基因,151个(约35%)显示出基因模型存在明显错误,245个(57%)似乎是新基因。此外,我们在现有的WormBase基因模型中验证了6010个外显子-外显子剪接位点。我们的工作证实了质谱是注释已测序基因组的强大实验工具。此外,所鉴定肽段的集合应有助于未来针对特定感兴趣蛋白质的蛋白质组学实验。

相似文献

2
nGASP--the nematode genome annotation assessment project.线虫基因组注释评估项目(nGASP)
BMC Bioinformatics. 2008 Dec 19;9:549. doi: 10.1186/1471-2105-9-549.
4
6
Overview of gene structure.基因结构概述。
WormBook. 2006 Jan 18:1-10. doi: 10.1895/wormbook.1.65.1.

引用本文的文献

1
Detecting gene expression in Caenorhabditis elegans.检测秀丽隐杆线虫中的基因表达。
Genetics. 2025 Jan 8;229(1):1-108. doi: 10.1093/genetics/iyae167.
4
Advancing omics data: bridging the gap with .推进组学数据:弥合差距。
Philos Trans R Soc Lond B Biol Sci. 2024 Jan 15;379(1894):20220437. doi: 10.1098/rstb.2022.0437. Epub 2023 Nov 27.
5
Assessing Protein Sequence Database Suitability Using Sequencing.利用测序评估蛋白质序列数据库的适用性
Mol Cell Proteomics. 2020 Jan;19(1):198-208. doi: 10.1074/mcp.TIR119.001752. Epub 2019 Nov 15.
7
Methods, Tools and Current Perspectives in Proteogenomics.蛋白质基因组学中的方法、工具及当前观点
Mol Cell Proteomics. 2017 Jun;16(6):959-981. doi: 10.1074/mcp.MR117.000024. Epub 2017 Apr 29.

本文引用的文献

8
A high-quality catalog of the Drosophila melanogaster proteome.一份高质量的黑腹果蝇蛋白质组目录。
Nat Biotechnol. 2007 May;25(5):576-83. doi: 10.1038/nbt1300. Epub 2007 Apr 22.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验