• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基序概念的关系扩展:在常见三维蛋白质子结构搜索问题中的应用。

A relational extension of the notion of motifs: application to the common 3D protein substructures searching problem.

作者信息

Pisanti Nadia, Soldano Henry, Carpentier Mathilde, Pothier Joel

机构信息

Dipartimento di Informatica, Largo B. Pontecorvo, Università di Pisa, Pisa, Italy.

出版信息

J Comput Biol. 2009 Dec;16(12):1635-60. doi: 10.1089/cmb.2008.0019.

DOI:10.1089/cmb.2008.0019
PMID:20047489
Abstract

The geometrical configurations of atoms in protein structures can be viewed as approximate relations among them. Then, finding similar common substructures within a set of protein structures belongs to a new class of problems that generalizes that of finding repeated motifs. The novelty lies in the addition of constraints on the motifs in terms of relations that must hold between pairs of positions of the motifs. We will hence denote them as relational motifs. For this class of problems, we present an algorithm that is a suitable extension of the KMR paradigm and, in particular, of the KMRC as it uses a degenerate alphabet. Our algorithm contains several improvements that become especially useful when-as it is required for relational motifs-the inference is made by partially overlapping shorter motifs, rather than concatenating them. The efficiency, correctness and completeness of the algorithm is ensured by several non-trivial properties that are proven in this paper. The algorithm has been applied in the important field of protein common 3D substructure searching. The methods implemented have been tested on several examples of protein families such as serine proteases, globins and cytochromes P450 additionally. The detected motifs have been compared to those found by multiple structural alignments methods.

摘要

蛋白质结构中原子的几何构型可被视为它们之间的近似关系。那么,在一组蛋白质结构中寻找相似的共同子结构属于一类新的问题,它是对寻找重复基序问题的推广。其新颖之处在于,根据基序位置对之间必须成立的关系,对基序添加了约束。因此,我们将它们称为关系基序。对于这类问题,我们提出了一种算法,它是KMR范式的合适扩展,特别是KMRC的扩展,因为它使用了简并字母表。我们的算法包含多项改进,当(正如关系基序所要求的那样)通过部分重叠较短基序而非连接它们来进行推理时,这些改进会变得特别有用。本文证明的几个重要特性确保了算法的效率、正确性和完整性。该算法已应用于蛋白质常见三维子结构搜索这一重要领域。此外,所实现的方法已在丝氨酸蛋白酶、球蛋白和细胞色素P450等多个蛋白质家族的实例上进行了测试。已将检测到的基序与通过多结构比对方法找到的基序进行了比较。

相似文献

1
A relational extension of the notion of motifs: application to the common 3D protein substructures searching problem.基序概念的关系扩展:在常见三维蛋白质子结构搜索问题中的应用。
J Comput Biol. 2009 Dec;16(12):1635-60. doi: 10.1089/cmb.2008.0019.
2
Progressive combinatorial algorithm for multiple structural alignments: application to distantly related proteins.用于多重结构比对的渐进组合算法:应用于远缘相关蛋白质
Proteins. 2004 May 1;55(2):436-54. doi: 10.1002/prot.10587.
3
MUSTA--a general, efficient, automated method for multiple structure alignment and detection of common motifs: application to proteins.MUSTA——一种用于多结构比对和共同基序检测的通用、高效、自动化方法:应用于蛋白质。
J Comput Biol. 2001;8(2):93-121. doi: 10.1089/106652701300312896.
4
MUSTANG: a multiple structural alignment algorithm.MUSTANG:一种多重结构比对算法。
Proteins. 2006 Aug 15;64(3):559-74. doi: 10.1002/prot.20921.
5
SEGA: semiglobal graph alignment for structure-based protein comparison.SEGA:基于结构的蛋白质比对的半全局图比对。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Sep-Oct;8(5):1330-43. doi: 10.1109/TCBB.2011.35.
6
CAALIGN: a program for pairwise and multiple protein-structure alignment.CAALIGN:一个用于蛋白质结构两两比对和多序列比对的程序。
Acta Crystallogr D Biol Crystallogr. 2007 Apr;63(Pt 4):514-25. doi: 10.1107/S0907444907000844. Epub 2007 Mar 16.
7
Discovery of structural motifs using protein structural alphabets and 1D motif-finding methods.利用蛋白质结构字母表和一维 motif-finding 方法发现结构基序。
Adv Exp Med Biol. 2010;680:117-23. doi: 10.1007/978-1-4419-5913-3_14.
8
Distance-based identification of structure motifs in proteins using constrained frequent subgraph mining.使用受限频繁子图挖掘基于距离的蛋白质结构基序识别
Comput Syst Bioinformatics Conf. 2006:227-38.
9
DIALIGN-T: an improved algorithm for segment-based multiple sequence alignment.DIALIGN-T:一种改进的基于片段的多序列比对算法。
BMC Bioinformatics. 2005 Mar 22;6:66. doi: 10.1186/1471-2105-6-66.
10
Automated multiple structure alignment and detection of a common substructural motif.自动多结构比对及常见子结构基序的检测。
Proteins. 2001 May 15;43(3):235-45. doi: 10.1002/prot.1034.

引用本文的文献

1
Identification of local conformational similarity in structurally variable regions of homologous proteins using protein blocks.利用蛋白质模块识别同源蛋白结构可变区的局部构象相似性。
PLoS One. 2011 Mar 18;6(3):e17826. doi: 10.1371/journal.pone.0017826.