MM比对：一种使用迭代动态规划对多链蛋白质复合体结构进行比对的快速算法。

MM-align: a quick algorithm for aligning multiple-chain protein complex structures using iterative dynamic programming.

作者信息

Mukherjee Srayanta, Zhang Yang

机构信息

Center for Bioinformatics and Department of Molecular Bioscience, University of Kansas, 2030 Becker Dr, Lawrence, KS 66047, USA.

出版信息

Nucleic Acids Res. 2009 Jun;37(11):e83. doi: 10.1093/nar/gkp318. Epub 2009 May 14.

DOI:10.1093/nar/gkp318

PMID:19443443

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2699532/

Abstract

Structural comparison of multiple-chain protein complexes is essential in many studies of protein-protein interactions. We develop a new algorithm, MM-align, for sequence-independent alignment of protein complex structures. The algorithm is built on a heuristic iteration of a modified Needleman-Wunsch dynamic programming (DP) algorithm, with the alignment score specified by the inter-complex residue distances. The multiple chains in each complex are first joined, in every possible order, and then simultaneously aligned with cross-chain alignments prevented. The alignments of interface residues are enhanced by an interface-specific weighting factor. MM-align is tested on a large-scale benchmark set of 205 x 3897 non-homologous multiple-chain complex pairs. Compared with a naïve extension of the monomer alignment program of TM-align, the alignment accuracy of MM-align is significantly higher as judged by the average TM-score of the physically-aligned residues. MM-align is about two times faster than TM-align because of omitting the cross-alignment zone of the DP matrix. It also shows that the enhanced alignment of the interfaces helps in identifying biologically relevant protein complex pairs.

摘要

在许多蛋白质-蛋白质相互作用研究中，多链蛋白质复合物的结构比较至关重要。我们开发了一种新算法MM-align，用于蛋白质复合物结构的序列无关比对。该算法基于对改进的Needleman-Wunsch动态规划（DP）算法的启发式迭代，比对分数由复合物间残基距离指定。每个复合物中的多条链首先以每种可能的顺序连接，然后在防止跨链比对的情况下同时进行比对。通过特定于界面的加权因子增强界面残基的比对。MM-align在由205×3897个非同源多链复合物对组成的大规模基准数据集上进行了测试。与TM-align单体比对程序的简单扩展相比，根据物理比对残基的平均TM分数判断，MM-align的比对准确性显著更高。由于省略了DP矩阵的交叉比对区域，MM-align比TM-align快约两倍。这也表明，界面的增强比对有助于识别生物学上相关的蛋白质复合物对。

相似文献

MM-align: a quick algorithm for aligning multiple-chain protein complex structures using iterative dynamic programming.MM比对：一种使用迭代动态规划对多链蛋白质复合体结构进行比对的快速算法。

Nucleic Acids Res. 2009 Jun;37(11):e83. doi: 10.1093/nar/gkp318. Epub 2009 May 14.

TM-align: a protein structure alignment algorithm based on the TM-score.TM-align：一种基于TM分数的蛋白质结构比对算法。

Nucleic Acids Res. 2005 Apr 22;33(7):2302-9. doi: 10.1093/nar/gki524. Print 2005.

Fr-TM-align: a new protein structural alignment method based on fragment alignments and the TM-score.Fr-TM-align：一种基于片段比对和TM分数的新型蛋白质结构比对方法。

BMC Bioinformatics. 2008 Dec 12;9:531. doi: 10.1186/1471-2105-9-531.

Optimal contact map alignment of protein-protein interfaces.蛋白质-蛋白质界面的最优接触图比对

Bioinformatics. 2008 Oct 15;24(20):2324-8. doi: 10.1093/bioinformatics/btn432. Epub 2008 Aug 18.

Iterative refinement of structure-based sequence alignments by Seed Extension.通过种子延伸对基于结构的序列比对进行迭代优化。

BMC Bioinformatics. 2009 Jul 9;10:210. doi: 10.1186/1471-2105-10-210.

Finding optimal interaction interface alignments between biological complexes.寻找生物复合物之间的最佳相互作用界面比对。

Bioinformatics. 2015 Jun 15;31(12):i133-41. doi: 10.1093/bioinformatics/btv242.

CAB-Align: A Flexible Protein Structure Alignment Method Based on the Residue-Residue Contact Area.CAB比对：一种基于残基-残基接触面积的灵活蛋白质结构比对方法。

PLoS One. 2015 Oct 26;10(10):e0141440. doi: 10.1371/journal.pone.0141440. eCollection 2015.

SE: an algorithm for deriving sequence alignment from a pair of superimposed structures.SE：一种从一对叠加结构中推导序列比对的算法。

BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S4. doi: 10.1186/1471-2105-10-S1-S4.

A multiple sequence alignment algorithm for homologous proteins using secondary structure information and optionally keying alignments to functionally important sites.一种用于同源蛋白质的多序列比对算法，该算法利用二级结构信息，并可选择将比对与功能重要位点关联起来。

Comput Appl Biosci. 1989 Apr;5(2):141-50. doi: 10.1093/bioinformatics/5.2.141.

US-align: universal structure alignments of proteins, nucleic acids, and macromolecular complexes.US-align：蛋白质、核酸和大分子复合物的通用结构比对。

Nat Methods. 2022 Sep;19(9):1109-1115. doi: 10.1038/s41592-022-01585-1. Epub 2022 Aug 29.

引用本文的文献

Kiwa is a membrane-embedded defense supercomplex activated at phage attachment sites.基瓦是一种在噬菌体附着位点被激活的膜嵌入防御超复合体。

Cell. 2025 Jul 23. doi: 10.1016/j.cell.2025.07.002.

Integrating metagenomics and cultivation unveils oral phage diversity and potential impact on hosts.整合宏基因组学与培养技术揭示口腔噬菌体的多样性及其对宿主的潜在影响。

NPJ Biofilms Microbiomes. 2025 Jul 26;11(1):145. doi: 10.1038/s41522-025-00773-z.

Toward Modeling Protein Multimers by Combining AlphaFold 3 Predictions with Secondary Structures from Medium-Resolution Cryo-EM Maps.通过结合AlphaFold 3预测与中等分辨率冷冻电镜图的二级结构来构建蛋白质多聚体模型

Comput Struct Bioinform (2024). 2025;2396:71-83. doi: 10.1007/978-3-031-85435-4_6. Epub 2025 Mar 26.

AlphaFold models of host-pathogen interactions elucidate the prevalence and structural modes of molecular mimicry.宿主-病原体相互作用的AlphaFold模型阐明了分子模拟的普遍性和结构模式。

bioRxiv. 2025 Jun 6:2025.06.04.657796. doi: 10.1101/2025.06.04.657796.

Sal is a proteobacterial bile acid aldolase that repurposes key thiolase catalytic residues for retroaldol cleavage of C steroid side chains.Sal是一种变形菌属胆汁酸醛缩酶，它将关键硫解酶催化残基重新用于C类固醇侧链的逆醛醇裂解。

J Biol Chem. 2025 Jul 1;301(8):110439. doi: 10.1016/j.jbc.2025.110439.

A graphic and command line protocol for quick and accurate comparisons of protein and nucleic acid structures with US-align.一种用于通过US-align快速准确比较蛋白质和核酸结构的图形化及命令行协议。

Nat Protoc. 2025 Jul 2. doi: 10.1038/s41596-025-01189-x.

AI-first structural identification of pathogenic protein target interfaces.基于人工智能的致病蛋白靶点界面结构首次鉴定。

PLoS Comput Biol. 2025 Jun 26;21(6):e1013168. doi: 10.1371/journal.pcbi.1013168. eCollection 2025 Jun.

Impact of HOMER2 frameshift extension variant on auditory function and development.HOMER2移码延伸变异对听觉功能和发育的影响。

J Mol Med (Berl). 2025 Aug;103(8):975-987. doi: 10.1007/s00109-025-02556-7. Epub 2025 Jun 14.

Modeling CAPRI Targets of Round 55 by Combining AlphaFold and Docking.通过结合AlphaFold和对接技术对第55轮CAPRI目标进行建模。

Proteins. 2025 Jun 6. doi: 10.1002/prot.26853.

DeepUMQA-X: Comprehensive and insightful estimation of model accuracy for protein single-chain and complex.DeepUMQA-X：对蛋白质单链和复合物模型准确性的全面且有深刻见解的评估。

Nucleic Acids Res. 2025 May 5. doi: 10.1093/nar/gkaf380.

本文引用的文献

Protein structure prediction: when is it useful?蛋白质结构预测：何时有用？

Curr Opin Struct Biol. 2009 Apr;19(2):145-55. doi: 10.1016/j.sbi.2009.02.005. Epub 2009 Mar 25.

MultiBind and MAPPIS: webservers for multiple alignment of protein 3D-binding sites and their interactions.MultiBind和MAPPIS：用于蛋白质3D结合位点多重比对及其相互作用的网络服务器。

Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W260-4. doi: 10.1093/nar/gkn185. Epub 2008 May 8.

Alignment of non-covalent interactions at protein-protein interfaces.蛋白质-蛋白质界面处非共价相互作用的排列

PLoS One. 2008 Apr 2;3(4):e1926. doi: 10.1371/journal.pone.0001926.

SABERTOOTH: protein structural alignment based on a vectorial structure representation.剑齿虎：基于矢量结构表示的蛋白质结构比对

BMC Bioinformatics. 2007 Oct 31;8:425. doi: 10.1186/1471-2105-8-425.

The impact of translocations and gene fusions on cancer causation.易位和基因融合对癌症病因的影响。

Nat Rev Cancer. 2007 Apr;7(4):233-45. doi: 10.1038/nrc2091. Epub 2007 Mar 15.

Structural basis for the photochemistry of alpha-phycoerythrocyanin.α-藻红蛋白光化学的结构基础

Biochemistry. 2007 Jan 16;46(2):416-23. doi: 10.1021/bi061844j.

DOCKGROUND resource for studying protein-protein interfaces.用于研究蛋白质-蛋白质相互作用界面的DOCKGROUND资源。

Bioinformatics. 2006 Nov 1;22(21):2612-8. doi: 10.1093/bioinformatics/btl447. Epub 2006 Aug 23.

Structure-based functional annotation: yeast ymr099c codes for a D-hexose-6-phosphate mutarotase.基于结构的功能注释：酵母ymr099c编码一种D-己糖-6-磷酸变旋酶。

J Biol Chem. 2006 Oct 6;281(40):30175-85. doi: 10.1074/jbc.M604443200. Epub 2006 Jul 20.

Generation and analysis of a protein-protein interface data set with similar chemical and spatial patterns of interactions.具有相似化学和空间相互作用模式的蛋白质-蛋白质界面数据集的生成与分析。

Proteins. 2005 Oct 1;61(1):6-20. doi: 10.1002/prot.20580.

TM-align: a protein structure alignment algorithm based on the TM-score.TM-align：一种基于TM分数的蛋白质结构比对算法。

Nucleic Acids Res. 2005 Apr 22;33(7):2302-9. doi: 10.1093/nar/gki524. Print 2005.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

MM比对：一种使用迭代动态规划对多链蛋白质复合体结构进行比对的快速算法。

MM-align: a quick algorithm for aligning multiple-chain protein complex structures using iterative dynamic programming.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献