• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质结构比对的比较分析

Comparative analysis of protein structure alignments.

作者信息

Mayr Gabriele, Domingues Francisco S, Lackner Peter

机构信息

Department of Molecular Biology, University of Salzburg, Salzburg, Austria.

出版信息

BMC Struct Biol. 2007 Jul 26;7:50. doi: 10.1186/1472-6807-7-50.

DOI:10.1186/1472-6807-7-50
PMID:17672887
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1959231/
Abstract

BACKGROUND

Several methods are currently available for the comparison of protein structures. These methods have been analysed regarding the performance in the identification of structurally/evolutionary related proteins, but so far there has been less focus on the objective comparison between the alignments produced by different methods.

RESULTS

We analysed and compared the structural alignments obtained by different methods using three sets of pairs of structurally related proteins. The first set corresponds to 355 pairs of remote homologous proteins according to the SCOP database (ASTRAL40 set). The second set was derived from the SISYPHUS database and includes 69 protein pairs (SISY set). The third set consists of 40 pairs that are challenging to align (RIPC set). The alignment of pairs of this set requires indels of considerable number and size and some of the proteins are related by circular permutations, show extensive conformational variability or include repetitions. Two standard methods (CE and DALI) were applied to align the proteins in the ASTRAL40 set. The extent of structural similarity identified by both methods is highly correlated and the alignments from the two methods agree on average in more than half of the aligned positions. CE, DALI, as well as four additional methods (FATCAT, MATRAS, Calpha-match and SHEBA) were then compared using the SISY and RIPC sets. The accuracy of the alignments was assessed by comparison to reference alignments. The alignments generated by the different methods on average match more than half of the reference alignments in the SISY set. The alignments obtained in the more challenging RIPC set tend to differ considerably and match reference alignments less successfully than the SISY set alignments.

CONCLUSION

The alignments produced by different methods tend to agree to a considerable extent, but the agreement is lower for the more challenging pairs. The results for the comparison to reference alignments are encouraging, but also indicate that there is still room for improvement.

摘要

背景

目前有多种方法可用于比较蛋白质结构。这些方法已针对在识别结构/进化相关蛋白质方面的性能进行了分析,但迄今为止,对于不同方法产生的比对结果之间的客观比较关注较少。

结果

我们使用三组结构相关蛋白质对分析并比较了不同方法获得的结构比对。第一组对应于根据SCOP数据库(ASTRAL40集)的355对远源同源蛋白质。第二组来自SISYPHUS数据库,包括69对蛋白质(SISY集)。第三组由40对难以比对的蛋白质组成(RIPC集)。该组蛋白质对的比对需要相当数量和大小的插入缺失,并且一些蛋白质通过环状排列相关,表现出广泛的构象变异性或包含重复序列。应用两种标准方法(CE和DALI)比对ASTRAL40集中的蛋白质。两种方法识别的结构相似程度高度相关,并且两种方法的比对结果在超过一半的比对位置上平均是一致的。然后使用SISY和RIPC集比较了CE、DALI以及另外四种方法(FATCAT、MATRAS、Cα匹配和SHEBA)。通过与参考比对进行比较来评估比对的准确性。不同方法生成的比对结果在SISY集中平均与超过一半的参考比对匹配。在更具挑战性的RIPC集中获得的比对结果往往差异很大,并且与SISY集比对相比,与参考比对的匹配成功率较低。

结论

不同方法产生的比对结果在很大程度上趋于一致,但对于更具挑战性的蛋白质对,一致性较低。与参考比对的比较结果令人鼓舞,但也表明仍有改进的空间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/e64e77532a6a/1472-6807-7-50-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/76ce1c511e1e/1472-6807-7-50-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/e07565f6594f/1472-6807-7-50-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/61cdb7bd93b2/1472-6807-7-50-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/015b4e8f5af7/1472-6807-7-50-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/b0fada7cfe23/1472-6807-7-50-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/27801c604ddd/1472-6807-7-50-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/d0dcfd72f998/1472-6807-7-50-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/fde48504ad59/1472-6807-7-50-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/d6e22383a98a/1472-6807-7-50-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/502558972ae7/1472-6807-7-50-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/e64e77532a6a/1472-6807-7-50-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/76ce1c511e1e/1472-6807-7-50-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/e07565f6594f/1472-6807-7-50-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/61cdb7bd93b2/1472-6807-7-50-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/015b4e8f5af7/1472-6807-7-50-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/b0fada7cfe23/1472-6807-7-50-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/27801c604ddd/1472-6807-7-50-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/d0dcfd72f998/1472-6807-7-50-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/fde48504ad59/1472-6807-7-50-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/d6e22383a98a/1472-6807-7-50-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/502558972ae7/1472-6807-7-50-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc58/1959231/e64e77532a6a/1472-6807-7-50-11.jpg

相似文献

1
Comparative analysis of protein structure alignments.蛋白质结构比对的比较分析
BMC Struct Biol. 2007 Jul 26;7:50. doi: 10.1186/1472-6807-7-50.
2
Accuracy of structure-based sequence alignment of automatic methods.自动方法的基于结构的序列比对准确性。
BMC Bioinformatics. 2007 Sep 20;8:355. doi: 10.1186/1471-2105-8-355.
3
Large-scale comparison of protein sequence alignment algorithms with structure alignments.蛋白质序列比对算法与结构比对的大规模比较。
Proteins. 2000 Jul 1;40(1):6-22. doi: 10.1002/(sici)1097-0134(20000701)40:1<6::aid-prot30>3.0.co;2-7.
4
CAB-Align: A Flexible Protein Structure Alignment Method Based on the Residue-Residue Contact Area.CAB比对:一种基于残基-残基接触面积的灵活蛋白质结构比对方法。
PLoS One. 2015 Oct 26;10(10):e0141440. doi: 10.1371/journal.pone.0141440. eCollection 2015.
5
Comparison of sequence and structure alignments for protein domains.蛋白质结构域的序列比对与结构比对比较
Proteins. 2002 Aug 15;48(3):439-46. doi: 10.1002/prot.10163.
6
Evolutionary inaccuracy of pairwise structural alignments.成对结构比对的进化不准确性。
Bioinformatics. 2012 May 1;28(9):1209-15. doi: 10.1093/bioinformatics/bts103. Epub 2012 Mar 6.
7
MUSTANG: a multiple structural alignment algorithm.MUSTANG:一种多重结构比对算法。
Proteins. 2006 Aug 15;64(3):559-74. doi: 10.1002/prot.20921.
8
A Shannon entropy-based filter detects high- quality profile-profile alignments in searches for remote homologues.一种基于香农熵的过滤器在搜索远源同源物时可检测到高质量的序列轮廓比对。
Proteins. 2004 Feb 1;54(2):351-60. doi: 10.1002/prot.10564.
9
Structure-based evaluation of sequence comparison and fold recognition alignment accuracy.基于结构的序列比对和折叠识别比对准确性评估。
J Mol Biol. 2000 Apr 7;297(4):1003-13. doi: 10.1006/jmbi.2000.3615.
10
Fr-TM-align: a new protein structural alignment method based on fragment alignments and the TM-score.Fr-TM-align:一种基于片段比对和TM分数的新型蛋白质结构比对方法。
BMC Bioinformatics. 2008 Dec 12;9:531. doi: 10.1186/1471-2105-9-531.

引用本文的文献

1
Detection of circular permutations by Protein Language Models.通过蛋白质语言模型检测环形排列
Comput Struct Biotechnol J. 2024 Dec 30;27:214-220. doi: 10.1016/j.csbj.2024.12.029. eCollection 2025.
2
ICARUS: flexible protein structural alignment based on Protein Units.ICARUS:基于蛋白质单元的灵活蛋白质结构比对。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad459.
3
A unified approach to sequential and non-sequential structure alignment of proteins, RNAs, and DNAs.一种用于蛋白质、RNA和DNA的序列和非序列结构比对的统一方法。

本文引用的文献

1
SISYPHUS--structural alignments for proteins with non-trivial relationships.西西弗斯——用于具有复杂关系蛋白质的结构比对
Nucleic Acids Res. 2007 Jan;35(Database issue):D253-9. doi: 10.1093/nar/gkl746. Epub 2006 Oct 26.
2
MUSTANG: a multiple structural alignment algorithm.MUSTANG:一种多重结构比对算法。
Proteins. 2006 Aug 15;64(3):559-74. doi: 10.1002/prot.20921.
3
Development and validation of a consistency based multiple structure alignment algorithm.基于一致性的多重结构比对算法的开发与验证
iScience. 2022 Sep 28;25(10):105218. doi: 10.1016/j.isci.2022.105218. eCollection 2022 Oct 21.
4
Development of a TSR-Based Method for Protein 3-D Structural Comparison With Its Applications to Protein Classification and Motif Discovery.基于张量空间表示法的蛋白质三维结构比较方法的开发及其在蛋白质分类和基序发现中的应用
Front Chem. 2021 Jan 13;8:602291. doi: 10.3389/fchem.2020.602291. eCollection 2020.
5
DALI and the persistence of protein shape.DALI 与蛋白质构象的稳定性。
Protein Sci. 2020 Jan;29(1):128-140. doi: 10.1002/pro.3749. Epub 2019 Nov 5.
6
Non-sequential protein structure alignment by conformational space annealing and local refinement.通过构象空间退火和局部精修进行非序列蛋白质结构比对。
PLoS One. 2019 Jan 30;14(1):e0210177. doi: 10.1371/journal.pone.0210177. eCollection 2019.
7
Toxoplasma gondii immune mapped protein 1 is anchored to the inner leaflet of the plasma membrane and adopts a novel protein fold.刚地弓形虫免疫定位蛋白 1 锚定在质膜的内小叶,并采用一种新颖的蛋白质折叠方式。
Biochim Biophys Acta Proteins Proteom. 2017 Feb;1865(2):208-219. doi: 10.1016/j.bbapap.2016.11.010. Epub 2016 Nov 22.
8
Enhanced fold recognition using efficient short fragment clustering.利用高效短片段聚类增强折叠识别。
J Mol Biochem. 2012;1(2):76-85. Epub 2012 Jun 16.
9
An efficient algorithm for protein structure comparison using elastic shape analysis.一种使用弹性形状分析进行蛋白质结构比较的高效算法。
Algorithms Mol Biol. 2016 Sep 29;11:27. doi: 10.1186/s13015-016-0089-1. eCollection 2016.
10
PhyreStorm: A Web Server for Fast Structural Searches Against the PDB.PhyreStorm:一个用于对蛋白质数据库进行快速结构搜索的网络服务器。
J Mol Biol. 2016 Feb 22;428(4):702-708. doi: 10.1016/j.jmb.2015.10.017. Epub 2015 Oct 27.
Bioinformatics. 2006 May 1;22(9):1080-7. doi: 10.1093/bioinformatics/btl046. Epub 2006 Feb 10.
4
A novel approach to structural alignment using realistic structural and environmental information.一种利用逼真的结构和环境信息进行结构比对的新方法。
Protein Sci. 2005 Dec;14(12):2935-46. doi: 10.1110/ps.051428205. Epub 2005 Oct 31.
5
TM-align: a protein structure alignment algorithm based on the TM-score.TM-align:一种基于TM分数的蛋白质结构比对算法。
Nucleic Acids Res. 2005 Apr 22;33(7):2302-9. doi: 10.1093/nar/gki524. Print 2005.
6
Using a library of structural templates to recognise catalytic sites and explore their evolution in homologous families.使用结构模板库来识别催化位点并探索其在同源家族中的进化。
J Mol Biol. 2005 Apr 1;347(3):565-81. doi: 10.1016/j.jmb.2005.01.044.
7
Comprehensive evaluation of protein structure alignment methods: scoring by geometric measures.蛋白质结构比对方法的综合评估:基于几何度量的评分
J Mol Biol. 2005 Mar 4;346(4):1173-88. doi: 10.1016/j.jmb.2004.12.032. Epub 2005 Jan 16.
8
PDBsum more: new summaries and analyses of the known 3D structures of proteins and nucleic acids.PDBsum 更多内容:蛋白质和核酸已知三维结构的新总结与分析。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D266-8. doi: 10.1093/nar/gki001.
9
Déjà vu all over again: finding and analyzing protein structure similarities.似曾相识之感再度袭来:寻找并分析蛋白质结构相似性
Structure. 2004 Dec;12(12):2103-11. doi: 10.1016/j.str.2004.09.016.
10
Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions.二级结构匹配(SSM),一种用于三维蛋白质结构快速比对的新工具。
Acta Crystallogr D Biol Crystallogr. 2004 Dec;60(Pt 12 Pt 1):2256-68. doi: 10.1107/S0907444904026460. Epub 2004 Nov 26.