• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

量化蛋白质结构叠加的空间位阻和拓扑阻碍。

Quantifying steric hindrance and topological obstruction to protein structure superposition.

作者信息

Røgen Peter

机构信息

Department of Applied Mathematics and Computer Science, Technical University of Denmark, Asmussens Allé, Building 322, Kongens Lyngby, Denmark.

出版信息

Algorithms Mol Biol. 2021 Feb 27;16(1):1. doi: 10.1186/s13015-020-00180-3.

DOI:10.1186/s13015-020-00180-3
PMID:33639968
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7913338/
Abstract

BACKGROUND

In computational structural biology, structure comparison is fundamental for our understanding of proteins. Structure comparison is, e.g., algorithmically the starting point for computational studies of structural evolution and it guides our efforts to predict protein structures from their amino acid sequences. Most methods for structural alignment of protein structures optimize the distances between aligned and superimposed residue pairs, i.e., the distances traveled by the aligned and superimposed residues during linear interpolation. Considering such a linear interpolation, these methods do not differentiate if there is room for the interpolation, if it causes steric clashes, or more severely, if it changes the topology of the compared protein backbone curves.

RESULTS

To distinguish such cases, we analyze the linear interpolation between two aligned and superimposed backbones. We quantify the amount of steric clashes and find all self-intersections in a linear backbone interpolation. To determine if the self-intersections alter the protein's backbone curve significantly or not, we present a path-finding algorithm that checks if there exists a self-avoiding path in a neighborhood of the linear interpolation. A new path is constructed by altering the linear interpolation using a novel interpretation of Reidemeister moves from knot theory working on three-dimensional curves rather than on knot diagrams. Either the algorithm finds a self-avoiding path or it returns a smallest set of essential self-intersections. Each of these indicates a significant difference between the folds of the aligned protein structures. As expected, we find at least one essential self-intersection separating most unknotted structures from a knotted structure, and we find even larger motions in proteins connected by obstruction free linear interpolations. We also find examples of homologous proteins that are differently threaded, and we find many distinct folds connected by longer but simple deformations. TM-align is one of the most restrictive alignment programs. With standard parameters, it only aligns residues superimposed within 5 Ångström distance. We find 42165 topological obstructions between aligned parts in 142068 TM-alignments. Thus, this restrictive alignment procedure still allows topological dissimilarity of the aligned parts.

CONCLUSIONS

Based on the data we conclude that our program ProteinAlignmentObstruction provides significant additional information to alignment scores based solely on distances between aligned and superimposed residue pairs.

摘要

背景

在计算结构生物学中,结构比较是我们理解蛋白质的基础。例如,从算法角度来看,结构比较是结构进化计算研究的起点,并且它指导我们从氨基酸序列预测蛋白质结构的工作。大多数蛋白质结构的结构比对方法会优化比对和叠加残基对之间的距离,即线性插值过程中比对和叠加残基移动的距离。考虑到这种线性插值,这些方法不会区分是否有插值空间、是否会导致空间冲突,或者更严重的是,是否会改变所比较蛋白质主链曲线的拓扑结构。

结果

为了区分这些情况,我们分析了两条比对和叠加后的主链之间的线性插值。我们量化了空间冲突的数量,并找出线性主链插值中的所有自相交点。为了确定自相交点是否显著改变了蛋白质的主链曲线,我们提出了一种路径查找算法,该算法检查在线性插值的邻域内是否存在一条避免自相交的路径。通过使用纽结理论中对三维曲线而非纽结图的雷德迈斯特移动的一种新颖解释来改变线性插值,从而构建一条新路径。该算法要么找到一条避免自相交的路径,要么返回一组最小的基本自相交点。其中每一个都表明比对的蛋白质结构折叠之间存在显著差异。正如预期的那样,我们发现至少有一个基本自相交点将大多数无纽结结构与一个纽结结构区分开来,并且我们发现在通过无阻碍线性插值连接的蛋白质中存在更大的构象变化。我们还发现了同源蛋白质以不同方式穿线的例子,并且我们发现许多不同的折叠通过更长但简单的变形连接在一起。TM-align是最具限制性的比对程序之一。使用标准参数时,它仅比对叠加距离在5埃以内的残基。我们在142068次TM-align比对中的比对部分之间发现了42165个拓扑阻碍。因此,这种限制性的比对过程仍然允许比对部分存在拓扑差异。

结论

基于这些数据,我们得出结论,我们的程序ProteinAlignmentObstruction为仅基于比对和叠加残基对之间距离的比对分数提供了重要的额外信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/d940cb27bacd/13015_2020_180_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/ca44aefd91c1/13015_2020_180_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/90dc1bd36a7f/13015_2020_180_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/9e0adb4d97f6/13015_2020_180_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/201fb20a7cd0/13015_2020_180_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/a1f911c947d3/13015_2020_180_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/04f2fda19014/13015_2020_180_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/3ed94176023a/13015_2020_180_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/dade437162f9/13015_2020_180_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/d940cb27bacd/13015_2020_180_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/ca44aefd91c1/13015_2020_180_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/90dc1bd36a7f/13015_2020_180_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/9e0adb4d97f6/13015_2020_180_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/201fb20a7cd0/13015_2020_180_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/a1f911c947d3/13015_2020_180_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/04f2fda19014/13015_2020_180_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/3ed94176023a/13015_2020_180_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/dade437162f9/13015_2020_180_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3b8/7913338/d940cb27bacd/13015_2020_180_Fig9_HTML.jpg

相似文献

1
Quantifying steric hindrance and topological obstruction to protein structure superposition.量化蛋白质结构叠加的空间位阻和拓扑阻碍。
Algorithms Mol Biol. 2021 Feb 27;16(1):1. doi: 10.1186/s13015-020-00180-3.
2
Sequence-Similar Protein Domain Pairs With Structural or Topological Dissimilarity.具有结构或拓扑差异的序列相似蛋白质结构域对。
Proteins. 2025 Mar;93(3):588-597. doi: 10.1002/prot.26753. Epub 2024 Oct 11.
3
Large-scale comparison of protein sequence alignment algorithms with structure alignments.蛋白质序列比对算法与结构比对的大规模比较。
Proteins. 2000 Jul 1;40(1):6-22. doi: 10.1002/(sici)1097-0134(20000701)40:1<6::aid-prot30>3.0.co;2-7.
4
CAB-Align: A Flexible Protein Structure Alignment Method Based on the Residue-Residue Contact Area.CAB比对:一种基于残基-残基接触面积的灵活蛋白质结构比对方法。
PLoS One. 2015 Oct 26;10(10):e0141440. doi: 10.1371/journal.pone.0141440. eCollection 2015.
5
MM-align: a quick algorithm for aligning multiple-chain protein complex structures using iterative dynamic programming.MM比对:一种使用迭代动态规划对多链蛋白质复合体结构进行比对的快速算法。
Nucleic Acids Res. 2009 Jun;37(11):e83. doi: 10.1093/nar/gkp318. Epub 2009 May 14.
6
Accuracy of structure-based sequence alignment of automatic methods.自动方法的基于结构的序列比对准确性。
BMC Bioinformatics. 2007 Sep 20;8:355. doi: 10.1186/1471-2105-8-355.
7
Protein structure comparison by alignment of distance matrices.通过距离矩阵比对进行蛋白质结构比较。
J Mol Biol. 1993 Sep 5;233(1):123-38. doi: 10.1006/jmbi.1993.1489.
8
CAALIGN: a program for pairwise and multiple protein-structure alignment.CAALIGN:一个用于蛋白质结构两两比对和多序列比对的程序。
Acta Crystallogr D Biol Crystallogr. 2007 Apr;63(Pt 4):514-25. doi: 10.1107/S0907444907000844. Epub 2007 Mar 16.
9
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.一种蛋白质序列与结构分析及建模的综合方法。III. 使用多重结构比对对蛋白质结构家族中的序列保守性进行比较研究。
J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975.
10
Optimal pairwise alignment of fixed protein structures in subquadratic time.在亚二次时间内对固定蛋白质结构进行最优成对比对。
J Bioinform Comput Biol. 2011 Jun;9(3):367-82. doi: 10.1142/s0219720011005562.

引用本文的文献

1
Metabolic engineering for biosynthesis of hydroxycinnamoyl glycerols.用于羟基肉桂酰甘油生物合成的代谢工程
Synth Syst Biotechnol. 2025 Jul 17;10(4):1257-1266. doi: 10.1016/j.synbio.2025.07.005. eCollection 2025 Dec.
2
Sequence-Similar Protein Domain Pairs With Structural or Topological Dissimilarity.具有结构或拓扑差异的序列相似蛋白质结构域对。
Proteins. 2025 Mar;93(3):588-597. doi: 10.1002/prot.26753. Epub 2024 Oct 11.

本文引用的文献

1
Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths.在对不同长度的重复使用的蛋白质片段的分析中揭示了复杂的进化足迹。
Proc Natl Acad Sci U S A. 2017 Oct 31;114(44):11703-11708. doi: 10.1073/pnas.1707642114. Epub 2017 Oct 19.
2
Topological knots and links in proteins.蛋白质中的拓扑纽结和链接。
Proc Natl Acad Sci U S A. 2017 Mar 28;114(13):3415-3420. doi: 10.1073/pnas.1615862114. Epub 2017 Mar 9.
3
Complex lasso: new entangled motifs in proteins.复杂套索:蛋白质中的新纠缠模体。
Sci Rep. 2016 Nov 22;6:36895. doi: 10.1038/srep36895.
4
Knotted proteins: A tangled tale of Structural Biology.纽结蛋白:结构生物学的纠结故事
Comput Struct Biotechnol J. 2015 Aug 19;13:459-68. doi: 10.1016/j.csbj.2015.08.003. eCollection 2015.
5
FlexE: Using elastic network models to compare models of protein structure.FlexE:使用弹性网络模型比较蛋白质结构模型。
J Chem Theory Comput. 2012 Oct 9;8(10):3985-3991. doi: 10.1021/ct300148f.
6
CATH: comprehensive structural and functional annotations for genome sequences.CATH:基因组序列的全面结构和功能注释。
Nucleic Acids Res. 2015 Jan;43(Database issue):D376-81. doi: 10.1093/nar/gku947. Epub 2014 Oct 27.
7
Global view of the protein universe.蛋白质宇宙的全球视角。
Proc Natl Acad Sci U S A. 2014 Aug 12;111(32):11691-6. doi: 10.1073/pnas.1403395111. Epub 2014 Jul 28.
8
lDDT: a local superposition-free score for comparing protein structures and models using distance difference tests.lDDT:一种局部无叠加评分方法,用于通过距离差异测试比较蛋白质结构和模型。
Bioinformatics. 2013 Nov 1;29(21):2722-8. doi: 10.1093/bioinformatics/btt473. Epub 2013 Aug 27.
9
pKNOT v.2: the protein KNOT web server.pKNOT v.2:蛋白质 KNOT 网络服务器。
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W228-31. doi: 10.1093/nar/gks592. Epub 2012 Jun 12.
10
PyKnot: a PyMOL tool for the discovery and analysis of knots in proteins.PyKnot:用于发现和分析蛋白质中纽结的 PyMOL 工具。
Bioinformatics. 2012 Aug 1;28(15):2069-71. doi: 10.1093/bioinformatics/bts299. Epub 2012 May 18.