• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有结构或拓扑差异的序列相似蛋白质结构域对。

Sequence-Similar Protein Domain Pairs With Structural or Topological Dissimilarity.

作者信息

Røgen Peter

机构信息

Department of Applied Mathematics and Computer Science, Technical University of Denmark, Kongens Lyngby, Denmark.

出版信息

Proteins. 2025 Mar;93(3):588-597. doi: 10.1002/prot.26753. Epub 2024 Oct 11.

DOI:10.1002/prot.26753
PMID:39392124
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11809131/
Abstract

For a variety of applications, protein structures are clustered by sequence similarity, and sequence-redundant structures are disregarded. Sequence-similar chains are likely to have similar structures, but significant structural variation, as measured with RMSD, has been documented for sequence-similar chains and found usually to have a functional explanation. Moving two neighboring stretches of backbone through each other may change the chain topology and alter possible folding paths. The size of this motion is compatible to a variation in a flexible loop. We search and find domains with alternate chain topology in CATH4.2 sequence families relatively independent of sequence identity and of structural similarity as measured by RMSD. Structural, topological, and functional representative sets should therefore keep sequence-similar domains not just with structural variation but also with topological variation. We present BCAlign that finds Alignment and superposition of protein Backbone Curves by optimizing a user chosen convex combination of structural derivation and derivation between the structure-based sequence alignment and an input sequence alignment. Steric and topological obstructions from deforming a curve into an aligned curve are then found by a previously developed algorithm. For highly sequence-similar domains, sequence-based structural alignment better represents the chains motion and generally reveals larger structural and topological variation than structure-based does. Fold-switching protein pairs have been reported to be most frequent between X-ray and NMR structures and estimated to be underrepresented in the PDB as the alternate configuration is harder to resolve. Here we similarly find chain topology most frequently altered between X-ray and NMR structures.

摘要

对于各种应用,蛋白质结构按序列相似性进行聚类,序列冗余的结构则被忽略。序列相似的链可能具有相似的结构,但已记录到,用均方根偏差(RMSD)衡量,序列相似的链存在显著的结构变异,且通常能找到其功能上的解释。使两段相邻的主链相互穿过可能会改变链的拓扑结构,并改变可能的折叠路径。这种运动的大小与柔性环的变异程度相当。我们在CATH4.2序列家族中搜索并发现具有交替链拓扑结构的结构域,这些结构域相对独立于序列同一性以及用RMSD衡量的结构相似性。因此,结构、拓扑和功能代表性集不仅应保留具有结构变异的序列相似结构域,还应保留具有拓扑变异的结构域。我们提出了BCAlign,它通过优化用户选择的结构推导与基于结构的序列比对和输入序列比对之间的推导的凸组合,来找到蛋白质主链曲线的比对和叠加。然后,通过先前开发的算法来发现将曲线变形为比对曲线时的空间位阻和拓扑障碍。对于高度序列相似的结构域,基于序列的结构比对能更好地体现链的运动,并且通常比基于结构的比对揭示出更大的结构和拓扑变异。据报道,折叠转换蛋白对在X射线和核磁共振结构之间最为常见,并且据估计在蛋白质数据库(PDB)中代表性不足,因为其交替构型更难解析。在这里,我们同样发现链拓扑结构在X射线和核磁共振结构之间变化最为频繁。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/8b6a35bb5e02/PROT-93-588-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/5a3bf541f65b/PROT-93-588-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/be2ce852b899/PROT-93-588-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/c82cd27c0cd5/PROT-93-588-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/925c7e015120/PROT-93-588-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/3006f6ed6113/PROT-93-588-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/7a363d39a843/PROT-93-588-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/77177cbfed63/PROT-93-588-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/26730ec1228e/PROT-93-588-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/8b6a35bb5e02/PROT-93-588-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/5a3bf541f65b/PROT-93-588-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/be2ce852b899/PROT-93-588-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/c82cd27c0cd5/PROT-93-588-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/925c7e015120/PROT-93-588-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/3006f6ed6113/PROT-93-588-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/7a363d39a843/PROT-93-588-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/77177cbfed63/PROT-93-588-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/26730ec1228e/PROT-93-588-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e63a/11809131/8b6a35bb5e02/PROT-93-588-g009.jpg

相似文献

1
Sequence-Similar Protein Domain Pairs With Structural or Topological Dissimilarity.具有结构或拓扑差异的序列相似蛋白质结构域对。
Proteins. 2025 Mar;93(3):588-597. doi: 10.1002/prot.26753. Epub 2024 Oct 11.
2
Detailed protein sequence alignment based on Spectral Similarity Score (SSS).基于光谱相似性评分(SSS)的详细蛋白质序列比对。
BMC Bioinformatics. 2005 Apr 23;6:105. doi: 10.1186/1471-2105-6-105.
3
Biological insights from topology independent comparison of protein 3D structures.从蛋白质 3D 结构的拓扑独立比较中获得的生物学见解。
Nucleic Acids Res. 2011 Aug;39(14):e94. doi: 10.1093/nar/gkr348. Epub 2011 May 19.
4
Quantifying steric hindrance and topological obstruction to protein structure superposition.量化蛋白质结构叠加的空间位阻和拓扑阻碍。
Algorithms Mol Biol. 2021 Feb 27;16(1):1. doi: 10.1186/s13015-020-00180-3.
5
Quantification of the variation in percentage identity for protein sequence alignments.蛋白质序列比对中百分比一致性变化的量化。
BMC Bioinformatics. 2006 Sep 19;7:415. doi: 10.1186/1471-2105-7-415.
6
Fr-TM-align: a new protein structural alignment method based on fragment alignments and the TM-score.Fr-TM-align:一种基于片段比对和TM分数的新型蛋白质结构比对方法。
BMC Bioinformatics. 2008 Dec 12;9:531. doi: 10.1186/1471-2105-9-531.
7
Topology independent protein structural alignment.拓扑结构无关的蛋白质结构比对
BMC Bioinformatics. 2007 Oct 15;8:388. doi: 10.1186/1471-2105-8-388.
8
A database of protein structure families with common folding motifs.一个具有共同折叠基序的蛋白质结构家族数据库。
Protein Sci. 1992 Dec;1(12):1691-8. doi: 10.1002/pro.5560011217.
9
PASS2: an automated database of protein alignments organised as structural superfamilies.PASS2:一个以结构超家族形式组织的蛋白质比对自动化数据库。
BMC Bioinformatics. 2004 Apr 2;5:35. doi: 10.1186/1471-2105-5-35.
10
Dynamics alignment: comparison of protein dynamics in the SCOP database.动力学比对:SCOP 数据库中蛋白质动力学的比较。
Proteins. 2012 Apr;80(4):1167-76. doi: 10.1002/prot.24017. Epub 2012 Feb 10.

本文引用的文献

1
Knot or not? Identifying unknotted proteins in knotted families with sequence-based Machine Learning model.打结还是不打结?基于序列的机器学习模型识别打结家族中的无打结蛋白。
Protein Sci. 2024 Jul;33(7):e4998. doi: 10.1002/pro.4998.
2
Topological links in predicted protein complex structures reveal limitations of AlphaFold.预测蛋白质复合物结构中的拓扑连接揭示了 AlphaFold 的局限性。
Commun Biol. 2023 Oct 28;6(1):1098. doi: 10.1038/s42003-023-05489-4.
3
Sequence-structure-function relationships in the microbial protein universe.微生物蛋白质宇宙中的序列-结构-功能关系。
Nat Commun. 2023 Apr 26;14(1):2351. doi: 10.1038/s41467-023-37896-w.
4
AlphaFold2 fails to predict protein fold switching.AlphaFold2 无法预测蛋白质构象转变。
Protein Sci. 2022 Jun;31(6):e4353. doi: 10.1002/pro.4353.
5
Impact of protein conformational diversity on AlphaFold predictions.蛋白质构象多样性对 AlphaFold 预测的影响。
Bioinformatics. 2022 May 13;38(10):2742-2748. doi: 10.1093/bioinformatics/btac202.
6
Quantifying steric hindrance and topological obstruction to protein structure superposition.量化蛋白质结构叠加的空间位阻和拓扑阻碍。
Algorithms Mol Biol. 2021 Feb 27;16(1):1. doi: 10.1186/s13015-020-00180-3.
7
GISA: using Gauss Integrals to identify rare conformations in protein structures.GISA:利用高斯积分识别蛋白质结构中的罕见构象。
PeerJ. 2020 Jun 11;8:e9159. doi: 10.7717/peerj.9159. eCollection 2020.
8
On folding of entangled proteins: knots, lassos, links and θ-curves.纠缠蛋白的折叠:纽结、套索、链接和 θ 曲线。
Curr Opin Struct Biol. 2020 Feb;60:131-141. doi: 10.1016/j.sbi.2020.01.007. Epub 2020 Feb 12.
9
Automatic Inference of Sequence from Low-Resolution Crystallographic Data.自动从低分辨率晶体学数据推断序列。
Structure. 2018 Nov 6;26(11):1546-1554.e2. doi: 10.1016/j.str.2018.08.011. Epub 2018 Oct 4.
10
Extant fold-switching proteins are widespread.现存的构象转换蛋白广泛存在。
Proc Natl Acad Sci U S A. 2018 Jun 5;115(23):5968-5973. doi: 10.1073/pnas.1800168115. Epub 2018 May 21.