• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

二级结构空间构象足迹:一种快速蛋白质结构比较与分类的新方法。

Secondary structure spatial conformation footprint: a novel method for fast protein structure comparison and classification.

作者信息

Zotenko Elena, O'Leary Dianne P, Przytycka Teresa M

机构信息

Department of Computer Science, University of Maryland, College Park, MD 20742, USA.

出版信息

BMC Struct Biol. 2006 Jun 8;6:12. doi: 10.1186/1472-6807-6-12.

DOI:10.1186/1472-6807-6-12
PMID:16762072
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1526735/
Abstract

BACKGROUND

Recently a new class of methods for fast protein structure comparison has emerged. We call the methods in this class projection methods as they rely on a mapping of protein structure into a high-dimensional vector space. Once the mapping is done, the structure comparison is reduced to distance computation between corresponding vectors. As structural similarity is approximated by distance between projections, the success of any projection method depends on how well its mapping function is able to capture the salient features of protein structure. There is no agreement on what constitutes a good projection technique and the three currently known projection methods utilize very different approaches to the mapping construction, both in terms of what structural elements are included and how this information is integrated to produce a vector representation.

RESULTS

In this paper we propose a novel projection method that uses secondary structure information to produce the mapping. First, a diverse set of spatial arrangements of triplets of secondary structure elements, a set of structural models, is automatically selected. Then, each protein structure is mapped into a high-dimensional vector of "counts" or footprint, where each count corresponds to the number of times a given structural model is observed in the structure, weighted by the precision with which the model is reproduced. We perform the first comprehensive evaluation of our method together with all other currently known projection methods.

CONCLUSION

The results of our evaluation suggest that the type of structural information used by a projection method affects the ability of the method to detect structural similarity. In particular, our method that uses the spatial conformations of triplets of secondary structure elements outperforms other methods in most of the tests.

摘要

背景

最近出现了一类用于快速蛋白质结构比较的新方法。我们将这类方法称为投影方法,因为它们依赖于将蛋白质结构映射到高维向量空间。一旦完成映射,结构比较就简化为相应向量之间的距离计算。由于结构相似性通过投影之间的距离来近似,任何投影方法的成功都取决于其映射函数能够多好地捕捉蛋白质结构的显著特征。对于什么构成一种好的投影技术尚无共识,目前已知的三种投影方法在映射构建方面采用了非常不同的方法,无论是在包含哪些结构元素以及如何整合这些信息以生成向量表示方面。

结果

在本文中,我们提出了一种使用二级结构信息来进行映射的新型投影方法。首先,自动选择一组多样化的二级结构元素三联体的空间排列,即一组结构模型。然后,将每个蛋白质结构映射到一个由“计数”或足迹组成的高维向量中,其中每个计数对应于在该结构中观察到给定结构模型的次数,并由模型再现的精度加权。我们对我们的方法以及所有其他目前已知的投影方法进行了首次全面评估。

结论

我们的评估结果表明,投影方法所使用的结构信息类型会影响该方法检测结构相似性的能力。特别是,我们使用二级结构元素三联体空间构象的方法在大多数测试中优于其他方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/1526735/a89ab33c9abc/1472-6807-6-12-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/1526735/d8920d928b5d/1472-6807-6-12-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/1526735/0856322feff3/1472-6807-6-12-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/1526735/f7a38dc88eef/1472-6807-6-12-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/1526735/a89ab33c9abc/1472-6807-6-12-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/1526735/d8920d928b5d/1472-6807-6-12-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/1526735/0856322feff3/1472-6807-6-12-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/1526735/f7a38dc88eef/1472-6807-6-12-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/1526735/a89ab33c9abc/1472-6807-6-12-4.jpg

相似文献

1
Secondary structure spatial conformation footprint: a novel method for fast protein structure comparison and classification.二级结构空间构象足迹:一种快速蛋白质结构比较与分类的新方法。
BMC Struct Biol. 2006 Jun 8;6:12. doi: 10.1186/1472-6807-6-12.
2
Structural footprinting in protein structure comparison: the impact of structural fragments.蛋白质结构比较中的结构足迹分析:结构片段的影响
BMC Struct Biol. 2007 Aug 9;7:53. doi: 10.1186/1472-6807-7-53.
3
Novel structure-driven features for accurate prediction of protein structural class.用于准确预测蛋白质结构类别的新型结构驱动特征。
Genomics. 2014 Apr;103(4):292-7. doi: 10.1016/j.ygeno.2014.04.002. Epub 2014 Apr 18.
4
High-accuracy prediction of protein structural class for low-similarity sequences based on predicted secondary structure.基于预测的二级结构对低相似度序列进行蛋白质结构类别高精度预测。
Biochimie. 2011 Apr;93(4):710-4. doi: 10.1016/j.biochi.2011.01.001. Epub 2011 Jan 13.
5
A simple topological representation of protein structure: implications for new, fast, and robust structural classification.蛋白质结构的一种简单拓扑表示:对新型、快速且稳健的结构分类的启示。
Proteins. 2004 Aug 15;56(3):487-501. doi: 10.1002/prot.20146.
6
A novel predictor for protein structural class based on integrated information of the secondary structure sequence.一种基于二级结构序列综合信息的蛋白质结构类新型预测器。
Biochimie. 2014 Aug;103:131-6. doi: 10.1016/j.biochi.2014.05.008. Epub 2014 May 22.
7
A protein structural class prediction method based on novel features.基于新型特征的蛋白质结构类别预测方法。
Biochimie. 2013 Sep;95(9):1741-4. doi: 10.1016/j.biochi.2013.05.017. Epub 2013 Jun 14.
8
A high-accuracy protein structural class prediction algorithm using predicted secondary structural information.利用预测的二级结构信息进行高精度蛋白质结构类预测算法。
J Theor Biol. 2010 Dec 7;267(3):272-5. doi: 10.1016/j.jtbi.2010.09.007. Epub 2010 Sep 8.
9
Angle-distance image matching techniques for protein structure comparison.用于蛋白质结构比较的角度-距离图像匹配技术
J Mol Recognit. 2008 Nov-Dec;21(6):442-52. doi: 10.1002/jmr.914.
10
Secondary structure-based assignment of the protein structural classes.基于二级结构的蛋白质结构类别的划分
Amino Acids. 2008 Oct;35(3):551-64. doi: 10.1007/s00726-008-0080-3. Epub 2008 Apr 22.

引用本文的文献

1
Faithful Interpretation of Protein Structures through Weighted Persistent Homology Improves Evolutionary Distance Estimation.通过加权持久同调对蛋白质结构进行忠实解释可改进进化距离估计。
Mol Biol Evol. 2025 Feb 3;42(2). doi: 10.1093/molbev/msae271.
2
Zero-shot prediction of mutation effects with multimodal deep representation learning guides protein engineering.基于多模态深度表示学习的零-shot 突变效应预测指导蛋白质工程。
Cell Res. 2024 Sep;34(9):630-647. doi: 10.1038/s41422-024-00989-2. Epub 2024 Jul 5.
3
Persistent homology reveals strong phylogenetic signal in 3D protein structures.

本文引用的文献

1
Fold classification based on secondary structure--how much is gained by including loop topology?基于二级结构的折叠分类——纳入环拓扑结构能带来多少收获?
BMC Struct Biol. 2006 Mar 8;6:3. doi: 10.1186/1472-6807-6-3.
2
Efficient recognition of folds in protein 3D structures by the improved PRIDE algorithm.通过改进的PRIDE算法高效识别蛋白质三维结构中的折叠。
Bioinformatics. 2005 Aug 1;21(15):3322-3. doi: 10.1093/bioinformatics/bti513. Epub 2005 May 24.
3
PROuST: a comparison method of three-dimensional structures of proteins using indexing techniques.
持久同调揭示了三维蛋白质结构中强大的系统发育信号。
PNAS Nexus. 2024 Apr 17;3(4):pgae158. doi: 10.1093/pnasnexus/pgae158. eCollection 2024 Apr.
4
Fast protein structure comparison through effective representation learning with contrastive graph neural networks.通过对比图神经网络的有效表示学习进行快速蛋白质结构比较。
PLoS Comput Biol. 2022 Mar 24;18(3):e1009986. doi: 10.1371/journal.pcbi.1009986. eCollection 2022 Mar.
5
Learning structural motif representations for efficient protein structure search.学习结构基元表示以实现高效的蛋白质结构搜索。
Bioinformatics. 2018 Sep 1;34(17):i773-i780. doi: 10.1093/bioinformatics/bty585.
6
Multi-view methods for protein structure comparison using latent dirichlet allocation.基于潜在狄利克雷分配的蛋白质结构对比的多视图方法。
Bioinformatics. 2011 Jul 1;27(13):i61-8. doi: 10.1093/bioinformatics/btr249.
7
An Algebro-topological description of protein domain structure.蛋白质结构域的代数拓扑描述。
PLoS One. 2011;6(5):e19670. doi: 10.1371/journal.pone.0019670. Epub 2011 May 24.
8
GOSSIP: a method for fast and accurate global alignment of protein structures.闲话:一种快速准确的蛋白质结构全局比对方法。
Bioinformatics. 2011 Apr 1;27(7):925-32. doi: 10.1093/bioinformatics/btr044. Epub 2011 Feb 3.
9
Alignment-free local structural search by writhe decomposition.无比对的局部结构搜索通过纽结分解。
Bioinformatics. 2010 May 1;26(9):1176-84. doi: 10.1093/bioinformatics/btq127. Epub 2010 Apr 5.
10
FragBag, an accurate representation of protein structure, retrieves structural neighbors from the entire PDB quickly and accurately.FragBag 是一种准确表示蛋白质结构的方法,它可以快速准确地从整个 PDB 中检索结构邻居。
Proc Natl Acad Sci U S A. 2010 Feb 23;107(8):3481-6. doi: 10.1073/pnas.0914097107. Epub 2010 Feb 3.
PROuST:一种使用索引技术比较蛋白质三维结构的方法。
J Comput Biol. 2004;11(6):1061-72. doi: 10.1089/cmb.2004.11.1061.
4
Survey of current protein family databases and their application in comparative, structural and functional genomics.当前蛋白质家族数据库及其在比较、结构和功能基因组学中的应用综述。
J Chromatogr B Analyt Technol Biomed Life Sci. 2005 Feb 5;815(1-2):97-107. doi: 10.1016/j.jchromb.2004.11.010.
5
Superfamilies of evolved and designed networks.进化与设计网络的超家族。
Science. 2004 Mar 5;303(5663):1538-42. doi: 10.1126/science.1089167.
6
Local feature frequency profile: a method to measure structural similarity in proteins.局部特征频率分布图:一种测量蛋白质结构相似性的方法。
Proc Natl Acad Sci U S A. 2004 Mar 16;101(11):3797-802. doi: 10.1073/pnas.0308656100. Epub 2004 Feb 25.
7
The ASTRAL Compendium in 2004.2004年的《星盘汇编》。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D189-92. doi: 10.1093/nar/gkh034.
8
PSI: indexing protein structures for fast similarity search.PSI:为快速相似性搜索对蛋白质结构进行索引。
Bioinformatics. 2003;19 Suppl 1:i81-3. doi: 10.1093/bioinformatics/btg1009.
9
Automatic classification of protein structure by using Gauss integrals.利用高斯积分对蛋白质结构进行自动分类。
Proc Natl Acad Sci U S A. 2003 Jan 7;100(1):119-24. doi: 10.1073/pnas.2636460100. Epub 2002 Dec 27.
10
Protein fold similarity estimated by a probabilistic approach based on C(alpha)-C(alpha) distance comparison.基于Cα-Cα距离比较的概率方法估计的蛋白质折叠相似性。
J Mol Biol. 2002 Jan 25;315(4):887-98. doi: 10.1006/jmbi.2001.5250.