蛋白质结构的简化表示：对结构相似性检测的效率和范围的影响。

Reduced representation of protein structure: implications on efficiency and scope of detection of structural similarity.

机构信息

Bioinformatics Institute, A*STAR, 30 Biopolis Street, #07-01 Matrix, Singapore 138671.

出版信息

BMC Bioinformatics. 2010 Mar 26;11:155. doi: 10.1186/1471-2105-11-155.

DOI:10.1186/1471-2105-11-155

PMID:20338066

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3098053/

Abstract

BACKGROUND

Computational comparison of two protein structures is the starting point of many methods that build on existing knowledge, such as structure modeling (including modeling of protein complexes and conformational changes), molecular replacement, or annotation by structural similarity. In a commonly used strategy, significant effort is invested in matching two sets of atoms. In a complementary approach, a global descriptor is assigned to the overall structure, thus losing track of the substructures within.

RESULTS

Using a small set of geometric features, we define a reduced representation of protein structure, together with an optimizing function for matching two representations, to provide a pre-filtering stage in a database search. We show that, in a straightforward implementation, the representation performs well in terms of resolution in the space of protein structures, and its ability to make new predictions.

CONCLUSIONS

Perhaps unexpectedly, a substantial discriminating power already exists at the level of main features of protein structure, such as directions of secondary structural elements, possibly constrained by their sequential order. This can be used toward efficient comparison of protein (sub)structures, allowing for various degrees of conformational flexibility within the compared pair, which in turn can be used for modeling by homology of protein structure and dynamics.

摘要

背景

比较两种蛋白质结构的计算是许多基于现有知识的方法的起点，例如结构建模（包括蛋白质复合物和构象变化的建模）、分子置换或结构相似性注释。在常用的策略中，需要投入大量精力来匹配两组原子。在互补方法中，会为整体结构分配全局描述符，从而失去对内部子结构的跟踪。

结果

使用一小部分几何特征，我们定义了蛋白质结构的简化表示，以及用于匹配两个表示的优化函数，以在数据库搜索中提供预筛选阶段。我们表明，在直接实现中，该表示在蛋白质结构空间中的分辨率及其进行新预测的能力方面表现良好。

结论

出乎意料的是，蛋白质结构的主要特征（如二级结构元素的方向）可能受到其顺序的限制，在该水平上已经存在很大的区分能力。这可用于有效地比较蛋白质（子）结构，允许在比较对中具有各种程度的构象灵活性，这反过来又可用于同源建模蛋白质结构和动力学。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1098/3098053/049f9082b06c/1471-2105-11-155-1.jpg

相似文献

Reduced representation of protein structure: implications on efficiency and scope of detection of structural similarity.蛋白质结构的简化表示：对结构相似性检测的效率和范围的影响。

BMC Bioinformatics. 2010 Mar 26;11:155. doi: 10.1186/1471-2105-11-155.

Efficient and automated large-scale detection of structural relationships in proteins with a flexible aligner.利用灵活比对器对蛋白质中的结构关系进行高效自动化大规模检测。

BMC Bioinformatics. 2016 Jan 5;17:20. doi: 10.1186/s12859-015-0866-8.

Navigating Among Known Structures in Protein Space.在蛋白质空间中的已知结构间导航。

Methods Mol Biol. 2019;1851:233-249. doi: 10.1007/978-1-4939-8736-8_12.

J Chem Inf Comput Sci. 2003 Mar-Apr;43(2):391-405. doi: 10.1021/ci025569t.

Can molecular dynamics simulations help in discriminating correct from erroneous protein 3D models?分子动力学模拟能否有助于区分正确与错误的蛋白质三维模型？

BMC Bioinformatics. 2008 Jan 7;9:6. doi: 10.1186/1471-2105-9-6.

Structural alphabets for protein structure classification: a comparison study.用于蛋白质结构分类的结构字母表：一项比较研究。

J Mol Biol. 2009 Mar 27;387(2):431-50. doi: 10.1016/j.jmb.2008.12.044. Epub 2008 Dec 25.

Scoring predictive models using a reduced representation of proteins: model and energy definition.使用蛋白质的简化表示来评分预测模型：模型与能量定义

BMC Struct Biol. 2007 Mar 23;7:15. doi: 10.1186/1472-6807-7-15.

Annotation in three dimensions. PINTS: Patterns in Non-homologous Tertiary Structures.三维注释。PINTS：非同源三级结构中的模式。

Nucleic Acids Res. 2003 Jul 1;31(13):3341-4. doi: 10.1093/nar/gkg506.

Protein ligand-binding site comparison by a reduced vector representation derived from multidimensional scaling of generalized description of binding sites.通过从结合位点广义描述的多维标度导出的简化向量表示进行蛋白质配体结合位点比较。

Methods. 2016 Jan 15;93:35-40. doi: 10.1016/j.ymeth.2015.08.007. Epub 2015 Aug 11.

A 3D sequence-independent representation of the protein data bank.蛋白质数据库的一种与序列无关的三维表示。

Protein Eng. 1995 Oct;8(10):981-97. doi: 10.1093/protein/8.10.981.

引用本文的文献

Spectrum of neurodevelopmental disease associated with the GNAO1 guanosine triphosphate-binding region.与 GNAO1 鸟苷三磷酸结合区相关的神经发育疾病谱。

Epilepsia. 2019 Mar;60(3):406-418. doi: 10.1111/epi.14653. Epub 2019 Jan 25.

Towards an efficient compression of 3D coordinates of macromolecular structures.迈向对大分子结构三维坐标的高效压缩。

PLoS One. 2017 Mar 31;12(3):e0174846. doi: 10.1371/journal.pone.0174846. eCollection 2017.

Exploring representations of protein structure for automated remote homology detection and mapping of protein structure space.探索蛋白质结构的表示方法以进行自动远程同源性检测和蛋白质结构空间映射。

BMC Bioinformatics. 2014;15 Suppl 8(Suppl 8):S4. doi: 10.1186/1471-2105-15-S8-S4. Epub 2014 Jul 14.

deconSTRUCT: general purpose protein database search on the substructure level.deconSTRUCT：亚结构水平上的通用蛋白质数据库搜索。

Nucleic Acids Res. 2010 Jul;38(Web Server issue):W590-4. doi: 10.1093/nar/gkq489. Epub 2010 Jun 3.

本文引用的文献

3D-Fun: predicting enzyme function from structure.3D乐趣：从结构预测酶的功能。

Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W303-7. doi: 10.1093/nar/gkn308. Epub 2008 May 30.

Fast protein tertiary structure retrieval based on global surface shape similarity.基于全局表面形状相似性的快速蛋白质三级结构检索

Proteins. 2008 Sep;72(4):1259-73. doi: 10.1002/prot.22030.

Structural search and retrieval using a tableau representation of protein folding patterns.使用蛋白质折叠模式的表格表示进行结构搜索和检索。

Bioinformatics. 2008 Mar 1;24(5):645-51. doi: 10.1093/bioinformatics/btm641. Epub 2008 Jan 5.

SABERTOOTH: protein structural alignment based on a vectorial structure representation.剑齿虎：基于矢量结构表示的蛋白质结构比对

BMC Bioinformatics. 2007 Oct 31;8:425. doi: 10.1186/1471-2105-8-425.

Rapid detection of similarity in protein structure and function through contact metric distances.通过接触度量距离快速检测蛋白质结构和功能的相似性。

Nucleic Acids Res. 2006;34(22):e152. doi: 10.1093/nar/gkl788. Epub 2006 Nov 27.

Protein structure database search and evolutionary classification.蛋白质结构数据库搜索与进化分类。

Nucleic Acids Res. 2006 Aug 2;34(13):3646-59. doi: 10.1093/nar/gkl395. Print 2006.

Quaternions in molecular modeling.分子建模中的四元数

J Mol Graph Model. 2007 Jan;25(5):595-604. doi: 10.1016/j.jmgm.2006.04.002. Epub 2006 Apr 18.

Protein structure comparison: implications for the nature of 'fold space', and structure and function prediction.蛋白质结构比较：对“折叠空间”性质以及结构与功能预测的启示

Curr Opin Struct Biol. 2006 Jun;16(3):393-8. doi: 10.1016/j.sbi.2006.04.007. Epub 2006 May 4.

Pfam: clans, web tools and services.蛋白质家族数据库（Pfam）：家族分类、网络工具及服务

Nucleic Acids Res. 2006 Jan 1;34(Database issue):D247-51. doi: 10.1093/nar/gkj149.

YAKUSA: a fast structural database scanning method.YAKUSA：一种快速的结构数据库扫描方法。

Proteins. 2005 Oct 1;61(1):137-51. doi: 10.1002/prot.20517.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

蛋白质结构的简化表示：对结构相似性检测的效率和范围的影响。

Reduced representation of protein structure: implications on efficiency and scope of detection of structural similarity.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献