• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

几何相似子结构的提取:最小二乘法与切比雪夫拟合以及差异距离矩阵

Extraction of geometrically similar substructures: least-squares and Chebyshev fitting and the difference distance matrix.

作者信息

Lesk A M

机构信息

Department of Haematology, University of Cambridge Clinical School, United Kingdom.

出版信息

Proteins. 1998 Nov 15;33(3):320-8.

PMID:9829692
Abstract

In analysis, comparison and classification of conformations of proteins, a common computational task involves extractions of similar substructures. Structural comparisons are usually based on either of two measures of similarity: the root-mean-square (r.m.s.) deviation upon optimal superposition, or the maximal element of the difference distance matrix. The analysis presented here clarifies the relationships between different measures of structural similarity, and can provide a basis for developing algorithms and software to extract all maximal common well-fitting substructures from proteins. Given atomic coordinates of two proteins, many methods have been described for extracting some substantial (if not provably maximal) common substructure with low r.m.s. deviation. This is a relatively easy task compared with the problem addressed here, i.e., that of finding all common substructures with r.m.s. deviation less than a prespecified threshold. The combinatorial problems associated with similar subset extraction are more tractable if expressed in terms of the maximal element of the difference distance matrix than in terms of the r.m.s. deviation. However, it has been difficult to correlate these alternative measures of structural similarity. The purpose of this article is to make this connection. We first introduce a third measure of structural similarity: the maximum distance between corresponding pairs of points after superposition to minimize this value. This corresponds to fitting in the Chebyshev norm. Properties of Chebyshev superposition are derived. We describe relationships between the r.m.s. and minimax (Chebyshev) deviations upon optimal superposition, and between the Chebyshev deviation and the maximal element of the difference distance matrix. Combining these produces a relationship between the r.m.s. deviation upon optimal superposition and the maximal element of the difference distance matrix. Based on these results, we can apply algorithms and software for finding subsets of the difference distance matrix for which all elements are less than a specified bound, either to select only subsets for which the r.m.s.deviation is less than or equal to a specified threshold, or to select subsets that include all subsets for which the r.m.s. deviation is less than or equal to a threshold.

摘要

在对蛋白质构象进行分析、比较和分类时,一项常见的计算任务涉及提取相似的子结构。结构比较通常基于两种相似性度量中的一种:最优叠加后的均方根(r.m.s.)偏差,或差异距离矩阵的最大元素。本文所呈现的分析阐明了不同结构相似性度量之间的关系,并可为开发从蛋白质中提取所有最大公共适配良好子结构的算法和软件提供基础。给定两种蛋白质的原子坐标,已经描述了许多方法来提取具有低均方根偏差的一些实质(即使不是可证明最大)公共子结构。与这里所解决的问题相比,这是一项相对容易的任务,即找到所有均方根偏差小于预先指定阈值的公共子结构。如果用差异距离矩阵的最大元素来表示,与相似子集提取相关的组合问题比用均方根偏差来表示更易于处理。然而,一直难以关联这些结构相似性的替代度量。本文的目的就是建立这种联系。我们首先引入第三种结构相似性度量:叠加后对应点对之间的最大距离,以使该值最小化。这对应于切比雪夫范数下的拟合。推导了切比雪夫叠加的性质。我们描述了最优叠加时均方根偏差与极小极大(切比雪夫)偏差之间的关系,以及切比雪夫偏差与差异距离矩阵的最大元素之间的关系。将这些结合起来就得到了最优叠加时均方根偏差与差异距离矩阵的最大元素之间的关系。基于这些结果,我们可以应用用于找到差异距离矩阵中所有元素都小于指定界限的子集的算法和软件,要么仅选择均方根偏差小于或等于指定阈值的子集,要么选择包含所有均方根偏差小于或等于阈值的子集的子集。

相似文献

1
Extraction of geometrically similar substructures: least-squares and Chebyshev fitting and the difference distance matrix.几何相似子结构的提取:最小二乘法与切比雪夫拟合以及差异距离矩阵
Proteins. 1998 Nov 15;33(3):320-8.
2
Extraction of well-fitting substructures: root-mean-square deviation and the difference distance matrix.拟合良好的子结构的提取:均方根偏差和差异距离矩阵。
Fold Des. 1997;2(3):S12-4. doi: 10.1016/s1359-0278(97)00057-6.
3
An integrated approach to the analysis and modeling of protein sequences and structures. I. Protein structural alignment and a quantitative measure for protein structural distance.一种用于蛋白质序列和结构分析与建模的综合方法。I. 蛋白质结构比对及蛋白质结构距离的定量度量。
J Mol Biol. 2000 Aug 18;301(3):665-78. doi: 10.1006/jmbi.2000.3973.
4
Algorithms for optimal protein structure alignment.最优蛋白质结构比对算法。
Bioinformatics. 2009 Nov 1;25(21):2751-6. doi: 10.1093/bioinformatics/btp530. Epub 2009 Sep 4.
5
Size-independent comparison of protein three-dimensional structures.蛋白质三维结构的非大小依赖性比较。
Proteins. 1995 Jul;22(3):273-83. doi: 10.1002/prot.340220308.
6
Comparing short protein substructures by a method based on backbone torsion angles.通过基于主链扭转角的方法比较短蛋白质亚结构。
Proteins. 1989;6(2):155-67. doi: 10.1002/prot.340060206.
7
On the quality of tree-based protein classification.论基于树的蛋白质分类的质量。
Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.
8
Statistical validation of the root-mean-square-distance, a measure of protein structural proximity.均方根距离的统计验证,一种蛋白质结构接近度的度量方法。
Protein Eng Des Sel. 2007 Jan;20(1):33-7. doi: 10.1093/protein/gzl051. Epub 2007 Jan 11.
9
An alternative view of protein fold space.蛋白质折叠空间的另一种观点。
Proteins. 2000 Feb 15;38(3):247-60.
10
Surfactant solutions and porous substrates: spreading and imbibition.表面活性剂溶液与多孔基质:铺展与吸液
Adv Colloid Interface Sci. 2004 Nov 29;111(1-2):3-27. doi: 10.1016/j.cis.2004.07.007.

引用本文的文献

1
Unearthing Insights into Metabolic Syndrome by Linking Drugs, Targets, and Gene Expressions Using Similarity Measures and Graph Theory.利用相似性度量和图论挖掘药物、靶点和基因表达之间的代谢综合征关联信息。
Curr Comput Aided Drug Des. 2024;20(6):773-783. doi: 10.2174/1573409920666230817101913.
2
Homology Models and Molecular Dynamics Simulations of Main Proteinase from Coronavirus Associated with Severe Acute Respiratory Syndrome (SARS).严重急性呼吸综合征(SARS)冠状病毒主要蛋白酶的同源模型与分子动力学模拟
J Chin Chem Soc. 2004 Oct;51(5A):889-900. doi: 10.1002/jccs.200400134. Epub 2013 Sep 25.
3
Robust probabilistic superposition and comparison of protein structures.
蛋白质结构的稳健概率叠加和比较。
BMC Bioinformatics. 2010 Jul 1;11:363. doi: 10.1186/1471-2105-11-363.
4
Prediction of plasma protein binding of drugs using Kier-Hall valence connectivity indices and 4D-fingerprint molecular similarity analyses.
J Comput Aided Mol Des. 2005 Aug;19(8):567-83. doi: 10.1007/s10822-005-9012-4. Epub 2005 Nov 3.
5
Automatic classification of protein structure by using Gauss integrals.利用高斯积分对蛋白质结构进行自动分类。
Proc Natl Acad Sci U S A. 2003 Jan 7;100(1):119-24. doi: 10.1073/pnas.2636460100. Epub 2002 Dec 27.