Suppr超能文献

基于系统发育树和简单半胱氨酸突变模型的二硫键连接性预测的进化观点

An Evolutionary View on Disulfide Bond Connectivities Prediction Using Phylogenetic Trees and a Simple Cysteine Mutation Model.

作者信息

Raimondi Daniele, Orlando Gabriele, Vranken Wim F

机构信息

Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, Brussels, Belgium; Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium; Department of Structural Biology, VIB, Brussels, Belgium; Machine Learning Group, ULB, Brussels, Belgium.

Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, Brussels, Belgium; Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium; Department of Structural Biology, VIB, Brussels, Belgium.

出版信息

PLoS One. 2015 Jul 10;10(7):e0131792. doi: 10.1371/journal.pone.0131792. eCollection 2015.

Abstract

Disulfide bonds are crucial for many structural and functional aspects of proteins. They have a stabilizing role during folding, can regulate enzymatic activity and can trigger allosteric changes in the protein structure. Moreover, knowledge of the topology of the disulfide connectivity can be relevant in genomic annotation tasks and can provide long range constraints for ab-initio protein structure predictors. In this paper we describe PhyloCys, a novel unsupervised predictor of disulfide bond connectivity from known cysteine oxidation states. For each query protein, PhyloCys retrieves and aligns homologs with HHblits and builds a phylogenetic tree using ClustalW. A simplified model of cysteine co-evolution is then applied to the tree in order to hypothesize the presence of oxidized cysteines in the inner nodes of the tree, which represent ancestral protein sequences. The tree is then traversed from the leaves to the root and the putative disulfide connectivity is inferred by observing repeated patterns of tandem mutations between a sequence and its ancestors. A final correction is applied using the Edmonds-Gabow maximum weight perfect matching algorithm. The evolutionary approach applied in PhyloCys results in disulfide bond predictions equivalent to Sephiroth, another approach that takes whole sequence information into account, and is 26-29% better than state of the art methods based on cysteine covariance patterns in multiple sequence alignments, while requiring one order of magnitude fewer homologous sequences (10(3) instead of 10(4)), thus extending its range of applicability. The software described in this article and the datasets used are available at http://ibsquare.be/phylocys.

摘要

二硫键对于蛋白质的许多结构和功能方面都至关重要。它们在蛋白质折叠过程中具有稳定作用,能够调节酶活性,并可引发蛋白质结构的变构变化。此外,二硫键连接拓扑结构的知识在基因组注释任务中可能具有相关性,并且可以为从头开始的蛋白质结构预测器提供长程约束。在本文中,我们描述了PhyloCys,一种基于已知半胱氨酸氧化态的新型二硫键连接性无监督预测器。对于每个查询蛋白质,PhyloCys使用HHblits检索并比对同源物,并使用ClustalW构建系统发育树。然后将半胱氨酸协同进化的简化模型应用于该树,以推测树内部节点(代表祖先蛋白质序列)中氧化半胱氨酸的存在。接着从叶节点到根节点遍历该树,并通过观察序列与其祖先之间串联突变的重复模式来推断推定的二硫键连接性。最后使用Edmonds-Gabow最大权重完美匹配算法进行校正。PhyloCys中应用的进化方法产生的二硫键预测结果与Sephiroth相当,Sephiroth是另一种考虑全序列信息的方法,并且比基于多序列比对中半胱氨酸协方差模式的现有方法好26 - 29%,同时所需的同源序列数量少一个数量级(10³而不是10⁴),从而扩展了其适用范围。本文所述软件及使用的数据集可在http://ibsquare.be/phylocys获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/548c/4498770/64f4ef906b49/pone.0131792.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验