• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

作为蛋白质功能预测器的敏感序列比较。

Sensitive sequence comparison as protein function predictor.

作者信息

Pawłowski K, Jaroszewski L, Rychlewski L, Godzik A

机构信息

Burnham Institute, La Jolla, CA 92037, USA.

出版信息

Pac Symp Biocomput. 2000:42-53. doi: 10.1142/9789814447331_0005.

DOI:10.1142/9789814447331_0005
PMID:10902155
Abstract

Protein function assignments based on postulated homology as recognized by high sequence similarity are used routinely in genome analysis. Improvements in sensitivity of sequence comparison algorithms got to the point, that proteins with previously undetectable sequence similarity, such as for instance 10-15% of identical residues, sometimes can be classified as similar. What is the relation between such proteins? Is it possible that they are homologous? What is the practical significance of detecting such similarities? A simplified analysis of the relation between sequence similarity and function similarity is presented here for the well-characterized proteins from the E. coli genome. Using a simple measure of functional similarity based on E.C. classification of enzymes, it is shown that it correlates well with sequence similarity measured by statistical significance of the alignment score. Proteins, similar by this standard, even in cases of low sequence identity, have a much larger chance of having similar function than the randomly chosen protein pairs. Interesting exceptions to these rules are discussed.

摘要

基于高序列相似性所识别的假定同源性进行蛋白质功能分配,在基因组分析中经常使用。序列比较算法灵敏度的提高达到了这样的程度,即具有先前无法检测到的序列相似性的蛋白质,例如10 - 15%的相同残基,有时可以被归类为相似。这些蛋白质之间有什么关系?它们有可能是同源的吗?检测到这种相似性的实际意义是什么?这里针对大肠杆菌基因组中特征明确的蛋白质,对序列相似性和功能相似性之间的关系进行了简化分析。使用基于酶的E.C.分类的简单功能相似性度量方法,结果表明它与通过比对分数的统计显著性测量的序列相似性密切相关。按照这个标准相似的蛋白质,即使在序列同一性较低的情况下,比起随机选择的蛋白质对,具有相似功能的可能性要大得多。文中讨论了这些规则的有趣例外情况。

相似文献

1
Sensitive sequence comparison as protein function predictor.作为蛋白质功能预测器的敏感序列比较。
Pac Symp Biocomput. 2000:42-53. doi: 10.1142/9789814447331_0005.
2
Widespread protein sequence similarities: origins of Escherichia coli genes.广泛的蛋白质序列相似性:大肠杆菌基因的起源
J Bacteriol. 1995 Mar;177(6):1585-8. doi: 10.1128/jb.177.6.1585-1588.1995.
3
Assessing annotation transfer for genomics: quantifying the relations between protein sequence, structure and function through traditional and probabilistic scores.评估基因组学中的注释转移:通过传统分数和概率分数量化蛋白质序列、结构与功能之间的关系。
J Mol Biol. 2000 Mar 17;297(1):233-49. doi: 10.1006/jmbi.2000.3550.
4
Automated protein sequence database classification. I. Integration of compositional similarity search, local similarity search, and multiple sequence alignment.自动化蛋白质序列数据库分类。I. 组成相似性搜索、局部相似性搜索和多序列比对的整合
Bioinformatics. 1998;14(2):164-73. doi: 10.1093/bioinformatics/14.2.164.
5
Using the FASTA program to search protein and DNA sequence databases.使用FASTA程序搜索蛋白质和DNA序列数据库。
Methods Mol Biol. 1994;25:365-89. doi: 10.1385/0-89603-276-0:365.
6
Including biological literature improves homology search.纳入生物学文献可改善同源性搜索。
Pac Symp Biocomput. 2001:374-83. doi: 10.1142/9789814447362_0037.
7
Improving the quality of twilight-zone alignments.提高暮光区对准的质量。
Protein Sci. 2000 Aug;9(8):1487-96. doi: 10.1110/ps.9.8.1487.
8
An integrated approach to the analysis and modeling of protein sequences and structures. II. On the relationship between sequence and structural similarity for proteins that are not obviously related in sequence.蛋白质序列与结构分析及建模的综合方法。II. 关于序列无明显关联的蛋白质的序列与结构相似性之间的关系。
J Mol Biol. 2000 Aug 18;301(3):679-89. doi: 10.1006/jmbi.2000.3974.
9
[Genome analysis on the basis of protein structures].基于蛋白质结构的基因组分析
Tanpakushitsu Kakusan Koso. 2001 Aug;46(11 Suppl):1496-503.
10
Effective protein sequence comparison.有效的蛋白质序列比较。
Methods Enzymol. 1996;266:227-58. doi: 10.1016/s0076-6879(96)66017-0.

引用本文的文献

1
Comparative analysis of organophosphate degrading enzymes from diverse species.
Bioinformation. 2010 Jul 6;5(2):67-72. doi: 10.6026/97320630005067.
2
Evolutionary innovations and the organization of protein functions in genotype space.进化创新与基因型空间中蛋白质功能的组织。
PLoS One. 2010 Nov 30;5(11):e14172. doi: 10.1371/journal.pone.0014172.
3
Quantitative assessment of relationship between sequence similarity and function similarity.序列相似性与功能相似性之间关系的定量评估。
BMC Genomics. 2007 Jul 9;8:222. doi: 10.1186/1471-2164-8-222.
4
Towards complete sets of farnesylated and geranylgeranylated proteins.迈向法尼基化和香叶基香叶基化蛋白质的完整集合。
PLoS Comput Biol. 2007 Apr 6;3(4):e66. doi: 10.1371/journal.pcbi.0030066. Epub 2007 Feb 23.
5
Protein-protein interactions more conserved within species than across species.蛋白质与蛋白质之间的相互作用在物种内部比在物种之间更为保守。
PLoS Comput Biol. 2006 Jul 21;2(7):e79. doi: 10.1371/journal.pcbi.0020079. Epub 2006 May 18.
6
Role-similarity based functional prediction in networked systems: application to the yeast proteome.网络系统中基于角色相似性的功能预测:应用于酵母蛋白质组
J R Soc Interface. 2005 Sep 22;2(4):327-33. doi: 10.1098/rsif.2005.0046.
7
The HHpred interactive server for protein homology detection and structure prediction.用于蛋白质同源性检测和结构预测的HHpred交互式服务器。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W244-8. doi: 10.1093/nar/gki408.
8
Structural genomics: computational methods for structure analysis.结构基因组学:用于结构分析的计算方法。
Protein Sci. 2003 Sep;12(9):1813-21. doi: 10.1110/ps.0242903.
9
Molecular analysis of the multiple GroEL proteins of Chlamydiae.衣原体多种热休克蛋白60的分子分析。
J Bacteriol. 2003 Mar;185(6):1958-66. doi: 10.1128/JB.185.6.1958-1966.2003.
10
Sequence conserved for subcellular localization.亚细胞定位的序列保守区。
Protein Sci. 2002 Dec;11(12):2836-47. doi: 10.1110/ps.0207402.