• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质中三维共同子结构的成对和多重识别。

Pairwise and multiple identification of three-dimensional common substructures in proteins.

作者信息

Escalier V, Pothier J, Soldano H, Viari A

机构信息

Atelier de BioInformatique, Paris, France.

出版信息

J Comput Biol. 1998 Spring;5(1):41-56. doi: 10.1089/cmb.1998.5.41.

DOI:10.1089/cmb.1998.5.41
PMID:9541870
Abstract

In this paper, we present an algorithm to find three-dimensional substructures common to two or more molecules. The basic algorithm is devoted to pairwise structural comparison. Given two sets of atomic coordinates, it finds the largest subsets of atoms which are "similar" in the sense that all internal distances are approximately conserved. The basic idea of the algorithm is to recursively build subsets of increasing sizes, combining two sets of size k to build a set of size k + 1. The algorithm can be used "as is" for small molecules or local parts of proteins (about 30 atoms). When a high number of atoms is involved, we use a two step procedure. First we look for common "local" fragments by using the previous algorithm, and then we gather these fragments by using a Branch and Bound technique. We also extend the basic algorithm to perform multiple comparisons, by using one of the structures as a reference point (pivot) to which all other structures are compared. The solution is the largest subsets of atoms common to the pivot and at least q other structures. Although both algorithms are theoretically exponential in the number of atoms, experiments performed on biological data and using realistic parameters show that the solution is obtained within a few minutes. Finally, an application to the determination of the structural core of seven globins is presented.

摘要

在本文中,我们提出了一种算法,用于找出两个或多个分子共有的三维子结构。基本算法致力于成对结构比较。给定两组原子坐标,它会找到原子的最大子集,这些原子在所有内部距离大致保持不变的意义上是“相似的”。该算法的基本思想是递归地构建尺寸不断增大的子集,将两个尺寸为k的集合组合起来构建一个尺寸为k + 1的集合。该算法可直接用于小分子或蛋白质的局部区域(约30个原子)。当涉及大量原子时,我们采用两步法。首先,我们使用先前的算法寻找共同的“局部”片段,然后使用分支定界技术收集这些片段。我们还扩展了基本算法以进行多重比较,通过将其中一个结构用作参考点(枢轴),将所有其他结构与之比较。结果是枢轴与至少q个其他结构共有的原子的最大子集。尽管这两种算法在理论上随原子数量呈指数增长,但对生物数据进行的实验以及使用实际参数表明,几分钟内就能得到结果。最后,展示了该算法在确定七种球蛋白结构核心方面的应用。

相似文献

1
Pairwise and multiple identification of three-dimensional common substructures in proteins.蛋白质中三维共同子结构的成对和多重识别。
J Comput Biol. 1998 Spring;5(1):41-56. doi: 10.1089/cmb.1998.5.41.
2
MUSTA--a general, efficient, automated method for multiple structure alignment and detection of common motifs: application to proteins.MUSTA——一种用于多结构比对和共同基序检测的通用、高效、自动化方法:应用于蛋白质。
J Comput Biol. 2001;8(2):93-121. doi: 10.1089/106652701300312896.
3
MUSTANG: a multiple structural alignment algorithm.MUSTANG:一种多重结构比对算法。
Proteins. 2006 Aug 15;64(3):559-74. doi: 10.1002/prot.20921.
4
Protein structure comparison by alignment of distance matrices.通过距离矩阵比对进行蛋白质结构比较。
J Mol Biol. 1993 Sep 5;233(1):123-38. doi: 10.1006/jmbi.1993.1489.
5
Finding an average core structure: application to the globins.寻找平均核心结构:在珠蛋白中的应用。
Proc Int Conf Intell Syst Mol Biol. 1994;2:19-27.
6
Multiple protein sequence alignment from tertiary structure comparison: assignment of global and residue confidence levels.基于三级结构比较的多蛋白序列比对:全局和残基置信水平的赋值
Proteins. 1992 Oct;14(2):309-23. doi: 10.1002/prot.340140216.
7
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.一种蛋白质序列与结构分析及建模的综合方法。III. 使用多重结构比对对蛋白质结构家族中的序列保守性进行比较研究。
J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975.
8
A local alignment method for protein structure motifs.一种用于蛋白质结构基序的局部比对方法。
J Mol Biol. 1993 Oct 5;233(3):488-97. doi: 10.1006/jmbi.1993.1526.
9
Automated multiple structure alignment and detection of a common substructural motif.自动多结构比对及常见子结构基序的检测。
Proteins. 2001 May 15;43(3):235-45. doi: 10.1002/prot.1034.
10
CAALIGN: a program for pairwise and multiple protein-structure alignment.CAALIGN:一个用于蛋白质结构两两比对和多序列比对的程序。
Acta Crystallogr D Biol Crystallogr. 2007 Apr;63(Pt 4):514-25. doi: 10.1107/S0907444907000844. Epub 2007 Mar 16.

引用本文的文献

1
Estimation of protein function using template-based alignment of enzyme active sites.基于酶活性位点模板比对估算蛋白质功能。
BMC Bioinformatics. 2014 Mar 27;15:87. doi: 10.1186/1471-2105-15-87.
2
Ligand scaffold hopping combining 3D maximal substructure search and molecular similarity.配体骨架跃迁结合 3D 最大子结构搜索和分子相似性。
BMC Bioinformatics. 2009 Aug 11;10:245. doi: 10.1186/1471-2105-10-245.
3
Multiple structural alignment by secondary structures: algorithm and applications.基于二级结构的多重结构比对:算法与应用
Protein Sci. 2003 Nov;12(11):2492-507. doi: 10.1110/ps.03200603.