• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

鉴定家族特异性残基堆积模体及其在基于结构的蛋白质功能预测中的应用:I. 方法开发。

Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: I. Method development.

机构信息

GlaxoSmithKline, Collegeville, PA, USA.

出版信息

J Comput Aided Mol Des. 2009 Nov;23(11):773-84. doi: 10.1007/s10822-009-9273-4. Epub 2009 Jun 20.

DOI:10.1007/s10822-009-9273-4
PMID:19543979
Abstract

Protein function prediction is one of the central problems in computational biology. We present a novel automated protein structure-based function prediction method using libraries of local residue packing patterns that are common to most proteins in a known functional family. Critical to this approach is the representation of a protein structure as a graph where residue vertices (residue name used as a vertex label) are connected by geometrical proximity edges. The approach employs two steps. First, it uses a fast subgraph mining algorithm to find all occurrences of family-specific labeled subgraphs for all well characterized protein structural and functional families. Second, it queries a new structure for occurrences of a set of motifs characteristic of a known family, using a graph index to speed up Ullman's subgraph isomorphism algorithm. The confidence of function inference from structure depends on the number of family-specific motifs found in the query structure compared with their distribution in a large non-redundant database of proteins. This method can assign a new structure to a specific functional family in cases where sequence alignments, sequence patterns, structural superposition and active site templates fail to provide accurate annotation.

摘要

蛋白质功能预测是计算生物学的核心问题之一。我们提出了一种新的基于蛋白质结构的自动化功能预测方法,该方法使用了常见于已知功能家族的大多数蛋白质的局部残基堆积模式库。这种方法的关键在于将蛋白质结构表示为一个图,其中残基顶点(用作顶点标签的残基名称)通过几何接近边缘连接。该方法采用两步法。首先,它使用快速子图挖掘算法来查找所有特征良好的蛋白质结构和功能家族的特定于家族的标记子图的所有出现情况。其次,它使用图形索引来加速 Ullman 的子图同构算法,查询一组特征与已知家族的基序在新结构中的出现情况。从结构推断功能的置信度取决于在查询结构中找到的特定于家族的基序数量与在大型非冗余蛋白质数据库中的分布相比。在序列比对、序列模式、结构叠加和活性位点模板无法提供准确注释的情况下,这种方法可以将新结构分配给特定的功能家族。

相似文献

1
Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: I. Method development.鉴定家族特异性残基堆积模体及其在基于结构的蛋白质功能预测中的应用:I. 方法开发。
J Comput Aided Mol Des. 2009 Nov;23(11):773-84. doi: 10.1007/s10822-009-9273-4. Epub 2009 Jun 20.
2
Comparing graph representations of protein structure for mining family-specific residue-based packing motifs.比较蛋白质结构的图形表示以挖掘基于家族特异性残基的堆积基序。
J Comput Biol. 2005 Jul-Aug;12(6):657-71. doi: 10.1089/cmb.2005.12.657.
3
Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: II. Case studies and applications.鉴定家族特异性残基堆积模体及其在基于结构的蛋白质功能预测中的应用:II. 案例研究与应用。
J Comput Aided Mol Des. 2009 Nov;23(11):785-97. doi: 10.1007/s10822-009-9277-0. Epub 2009 Jun 23.
4
Distance-based identification of structure motifs in proteins using constrained frequent subgraph mining.使用受限频繁子图挖掘基于距离的蛋白质结构基序识别
Comput Syst Bioinformatics Conf. 2006:227-38.
5
Structure-based function inference using protein family-specific fingerprints.使用蛋白质家族特异性指纹进行基于结构的功能推断。
Protein Sci. 2006 Jun;15(6):1537-43. doi: 10.1110/ps.062189906.
6
Functional neighbors: inferring relationships between nonhomologous protein families using family-specific packing motifs.功能邻域:利用家族特异性包装基序推断非同源蛋白质家族之间的关系。
IEEE Trans Inf Technol Biomed. 2010 Sep;14(5):1137-43. doi: 10.1109/TITB.2010.2053550. Epub 2010 Jun 21.
7
Accurate classification of protein structural families using coherent subgraph analysis.使用相干子图分析对蛋白质结构家族进行准确分类。
Pac Symp Biocomput. 2004:411-22. doi: 10.1142/9789812704856_0039.
8
Discovery of Functional Motifs from the Interface Region of Oligomeric Proteins Using Frequent Subgraph Mining.利用频繁子图挖掘技术从寡聚蛋白界面区域发现功能基序。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Sep-Oct;16(5):1537-1549. doi: 10.1109/TCBB.2017.2756879. Epub 2017 Sep 26.
9
Scoring protein interaction decoys using exposed residues (SPIDER): a novel multibody interaction scoring function based on frequent geometric patterns of interfacial residues.利用暴露残基对蛋白质相互作用诱饵进行评分(SPIDER):一种基于界面残基频繁几何模式的新型多体相互作用评分函数。
Proteins. 2012 Aug;80(9):2207-17. doi: 10.1002/prot.24110. Epub 2012 Jun 7.
10
Towards comprehensive structural motif mining for better fold annotation in the "twilight zone" of sequence dissimilarity.迈向全面的结构基序挖掘,以在序列相似性的“模糊地带”实现更好的折叠注释。
BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S46. doi: 10.1186/1471-2105-10-S1-S46.

引用本文的文献

1
Modulating Glycoside Hydrolase Activity between Hydrolysis and Transfer Reactions Using an Evolutionary Approach.利用进化方法调节糖苷水解酶在水解和转移反应之间的活性。
Molecules. 2021 Oct 30;26(21):6586. doi: 10.3390/molecules26216586.
2
An amino acid code for irregular and mixed protein packing.一种用于不规则和混合蛋白质堆积的氨基酸编码。
Proteins. 2015 Dec;83(12):2147-61. doi: 10.1002/prot.24929. Epub 2015 Oct 5.
3
Ballast: a ball-based algorithm for structural motifs.镇流器:一种用于结构基序的基于球的算法。

本文引用的文献

1
Structure-based function prediction: approaches and applications.基于结构的功能预测:方法与应用
Brief Funct Genomic Proteomic. 2008 Jul;7(4):291-302. doi: 10.1093/bfgp/eln030. Epub 2008 Jul 3.
2
Exploring the structure and function paradigm.探索结构与功能范式。
Curr Opin Struct Biol. 2008 Jun;18(3):394-402. doi: 10.1016/j.sbi.2008.05.007.
3
Computed protonation properties: unique capabilities for protein functional site prediction.计算质子化特性:蛋白质功能位点预测的独特能力。
J Comput Biol. 2013 Feb;20(2):137-51. doi: 10.1089/cmb.2012.0246.
4
PASS2 version 4: an update to the database of structure-based sequence alignments of structural domain superfamilies.PASS2 版本 4:结构域超家族基于结构的序列比对数据库的更新。
Nucleic Acids Res. 2012 Jan;40(Database issue):D531-4. doi: 10.1093/nar/gkr1096. Epub 2011 Nov 28.
5
Characterizing the regularity of tetrahedral packing motifs in protein tertiary structure.描述蛋白质三级结构中四面体堆积模体的规则性。
Bioinformatics. 2010 Dec 15;26(24):3059-66. doi: 10.1093/bioinformatics/btq573. Epub 2010 Nov 2.
6
Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: II. Case studies and applications.鉴定家族特异性残基堆积模体及其在基于结构的蛋白质功能预测中的应用:II. 案例研究与应用。
J Comput Aided Mol Des. 2009 Nov;23(11):785-97. doi: 10.1007/s10822-009-9277-0. Epub 2009 Jun 23.
Genome Inform. 2007;19:107-18.
4
Prediction of enzyme function by combining sequence similarity and protein interactions.通过结合序列相似性和蛋白质相互作用预测酶的功能。
BMC Bioinformatics. 2008 May 27;9:249. doi: 10.1186/1471-2105-9-249.
5
A tool for the prediction of functionally important sites in proteins using a library of functional templates.一种使用功能模板库预测蛋白质中功能重要位点的工具。
Bioinformation. 2008 Feb 22;2(7):279-83. doi: 10.6026/97320630002279.
6
Sequence similarity network reveals common ancestry of multidomain proteins.序列相似性网络揭示了多结构域蛋白的共同祖先。
PLoS Comput Biol. 2008 May 16;4(4):e1000063. doi: 10.1371/journal.pcbi.1000063.
7
De-orphaning the structural proteome through reciprocal comparison of evolutionarily important structural features.通过对具有进化重要性的结构特征进行相互比较来确定结构蛋白质组中的未知功能蛋白。
PLoS One. 2008 May 7;3(5):e2136. doi: 10.1371/journal.pone.0002136.
8
Graphical models of residue coupling in protein families.蛋白质家族中残基偶联的图形模型。
IEEE/ACM Trans Comput Biol Bioinform. 2008 Apr-Jun;5(2):183-97. doi: 10.1109/TCBB.2007.70225.
9
The InterPro database and tools for protein domain analysis.用于蛋白质结构域分析的InterPro数据库及工具。
Curr Protoc Bioinformatics. 2008 Mar;Chapter 2:Unit 2.7. doi: 10.1002/0471250953.bi0207s21.
10
Protein function prediction with high-throughput data.利用高通量数据进行蛋白质功能预测。
Amino Acids. 2008 Oct;35(3):517-30. doi: 10.1007/s00726-008-0077-y. Epub 2008 Apr 22.