• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

低分子量蛋白质:后基因组研究面临的一项挑战。

Low molecular weight proteins: a challenge for post-genomic research.

作者信息

Rudd K E, Humphery-Smith I, Wasinger V C, Bairoch A

机构信息

Department of Biochemistry and Molecular Biology, University of Miami School of Medicine, FL 33101-6129, USA.

出版信息

Electrophoresis. 1998 Apr;19(4):536-44. doi: 10.1002/elps.1150190413.

DOI:10.1002/elps.1150190413
PMID:9588799
Abstract

The EcoGene project involves the examination of Escherichia coli K-12 DNA sequences and accompanying annotation in the public databases in order to refine the representation and prediction of the entire set of E. coli K-12 chromosomally encoded protein sequences. The results of this ongoing effort have been deposited in the SWISSPROT protein sequence database as sequencing of the E. coli genome has progressed to completion in recent years. Through this continuing research, we have discovered that the prediction of low molecular weight (small) proteins, arbitrarily defined as protein sequences < or = 150 amino acids (aa) in length, is problematic and requires special attention. We describe the small protein subset of EcoGene and the approach used to derive this subset from the complete E. coli genome sequence and database annotations. These E. coli proteins have helped to identify new small genes in other organisms and to identify conserved residues (motifs) using database searches and multiple alignments. Two thirds of the E. coli small proteins have not been characterized experimentally. The careful application of computer and laboratory methods to the analysis of small proteins is needed for accurate prediction, verification and characterization. The problem of accurate protein sequence identification is not limited to small proteins or to E. coli; these problems are encountered to varying degrees throughout all sequence databases.

摘要

EcoGene项目涉及对公共数据库中大肠杆菌K-12的DNA序列及相关注释进行研究,以便优化对大肠杆菌K-12染色体编码的整个蛋白质序列集的表示和预测。近年来,随着大肠杆菌基因组测序工作的逐步完成,这项正在进行的研究结果已存入SWISSPROT蛋白质序列数据库。通过这项持续的研究,我们发现对低分子量(小)蛋白质(任意定义为长度小于或等于150个氨基酸(aa)的蛋白质序列)的预测存在问题,需要特别关注。我们描述了EcoGene中的小蛋白质子集以及从完整的大肠杆菌基因组序列和数据库注释中获取该子集的方法。这些大肠杆菌蛋白质有助于在其他生物体中鉴定新的小基因,并通过数据库搜索和多重比对来鉴定保守残基(基序)。三分之二的大肠杆菌小蛋白质尚未经过实验表征。为了进行准确的预测、验证和表征,需要谨慎应用计算机和实验室方法来分析小蛋白质。准确的蛋白质序列鉴定问题不仅限于小蛋白质或大肠杆菌;在所有序列数据库中都会不同程度地遇到这些问题。

相似文献

1
Low molecular weight proteins: a challenge for post-genomic research.低分子量蛋白质:后基因组研究面临的一项挑战。
Electrophoresis. 1998 Apr;19(4):536-44. doi: 10.1002/elps.1150190413.
2
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
3
Comparing the predicted and observed properties of proteins encoded in the genome of Escherichia coli K-12.比较大肠杆菌K-12基因组中编码蛋白质的预测特性和观察到的特性。
Electrophoresis. 1997 Aug;18(8):1259-313. doi: 10.1002/elps.1150180807.
4
CoSMoS: Conserved Sequence Motif Search in the proteome.CoSMoS:蛋白质组中保守序列基序搜索
BMC Bioinformatics. 2006 Jan 24;7:37. doi: 10.1186/1471-2105-7-37.
5
Microbial source tracking by DNA sequence analysis of the Escherichia coli malate dehydrogenase gene.通过大肠杆菌苹果酸脱氢酶基因的DNA序列分析进行微生物源追踪
J Microbiol Methods. 2006 Dec;67(3):507-26. doi: 10.1016/j.mimet.2006.04.026. Epub 2006 Sep 14.
6
Bacterial genome reengineering.细菌基因组改造
Methods Mol Biol. 2011;765:3-25. doi: 10.1007/978-1-61779-197-0_1.
7
A novel genetic island of meningitic Escherichia coli K1 containing the ibeA invasion gene (GimA): functional annotation and carbon-source-regulated invasion of human brain microvascular endothelial cells.一株含有ibeA侵袭基因(GimA)的脑膜炎致病性大肠杆菌K1的新型基因岛:功能注释及对人脑血管内皮细胞的碳源调控侵袭
Funct Integr Genomics. 2001 Sep;1(5):312-22. doi: 10.1007/s101420100039.
8
Evolution of prokaryotic subtilases: genome-wide analysis reveals novel subfamilies with different catalytic residues.原核枯草杆菌蛋白酶的进化:全基因组分析揭示具有不同催化残基的新亚家族。
Proteins. 2007 May 15;67(3):681-94. doi: 10.1002/prot.21290.
9
Only one of the two annotated Lactococcus lactis fabG genes encodes a functional beta-ketoacyl-acyl carrier protein reductase.在两个已注释的乳酸乳球菌fabG基因中,只有一个编码有功能的β-酮酰基-酰基载体蛋白还原酶。
Biochemistry. 2004 Sep 21;43(37):11782-9. doi: 10.1021/bi0487600.
10
Functional analysis of the Escherichia coli genome using the sequence-to-structure-to-function paradigm: identification of proteins exhibiting the glutaredoxin/thioredoxin disulfide oxidoreductase activity.利用序列-结构-功能范式对大肠杆菌基因组进行功能分析:鉴定具有谷氧还蛋白/硫氧还蛋白二硫键氧化还原酶活性的蛋白质。
J Mol Biol. 1998 Oct 2;282(4):703-11. doi: 10.1006/jmbi.1998.2061.

引用本文的文献

1
Enterohemorrhagic responds to gut microbiota metabolites by altering metabolism and activating stress responses.肠出血性大肠杆菌通过改变代谢和激活应激反应来响应肠道微生物群代谢物。
Gut Microbes. 2023 Jan-Dec;15(1):2190303. doi: 10.1080/19490976.2023.2190303.
2
Serratia marcescens DUF1471-Containing Protein SrfN Is Needed for Adaptation to Acid and Oxidative Stresses.含有 DUF1471 的粘质沙雷氏菌蛋白 SrfN 是适应酸和氧化应激所必需的。
mSphere. 2022 Dec 21;7(6):e0021222. doi: 10.1128/msphere.00212-22. Epub 2022 Oct 11.
3
Towards the characterization of the hidden world of small proteins in Staphylococcus aureus, a proteogenomics approach.
为了深入了解金黄色葡萄球菌中小蛋白的隐藏世界,我们采用了蛋白质基因组学方法。
PLoS Genet. 2021 Jun 1;17(6):e1009585. doi: 10.1371/journal.pgen.1009585. eCollection 2021 Jun.
4
Arginine-Rich Small Proteins with a Domain of Unknown Function, DUF1127, Play a Role in Phosphate and Carbon Metabolism of Agrobacterium tumefaciens.富含精氨酸的小蛋白,具有未知功能域 DUF1127,在根癌农杆菌的磷酸盐和碳代谢中发挥作用。
J Bacteriol. 2020 Oct 22;202(22). doi: 10.1128/JB.00309-20.
5
Small Proteome.小蛋白质组
EcoSal Plus. 2020 May;9(1). doi: 10.1128/ecosalplus.ESP-0031-2019.
6
Computational discovery and annotation of conserved small open reading frames in fungal genomes.计算发现和注释真菌基因组中的保守小开放阅读框。
BMC Bioinformatics. 2019 Feb 4;19(Suppl 13):551. doi: 10.1186/s12859-018-2550-2.
7
Coordinated Hibernation of Transcriptional and Translational Apparatus during Growth Transition of Escherichia coli to Stationary Phase.大肠杆菌生长转变至稳定期过程中转录和翻译装置的协同休眠
mSystems. 2018 Sep 11;3(5). doi: 10.1128/mSystems.00057-18. eCollection 2018 Sep-Oct.
8
A Small Periplasmic Protein with a Hydrophobic C-Terminal Residue Enhances DegP Proteolysis as a Suicide Activator.一个具有疏水性 C 末端残基的小周质蛋白作为自杀激活剂增强 DegP 蛋白水解。
J Bacteriol. 2018 Jan 10;200(3). doi: 10.1128/JB.00519-17. Print 2018 Feb 1.
9
The KP1_4563 gene is regulated by the cAMP receptor protein and controls type 3 fimbrial function in Klebsiella pneumoniae NTUH-K2044.KP1_4563基因受环磷酸腺苷受体蛋白调控,并控制肺炎克雷伯菌NTUH-K2044中的3型菌毛功能。
PLoS One. 2017 Jul 21;12(7):e0180666. doi: 10.1371/journal.pone.0180666. eCollection 2017.
10
Identification of Unannotated Small Genes in .在……中未注释小基因的鉴定
G3 (Bethesda). 2017 Mar 10;7(3):983-989. doi: 10.1534/g3.116.036939.