脯氨酸顺式肽键附近鉴别性序列模式的检测及其功能注释。

Detection of discriminative sequence patterns in the neighborhood of proline cis peptide bonds and their functional annotation.

作者信息

Exarchos Konstantinos P, Exarchos Themis P, Papaloukas Costas, Troganis Anastassios N, Fotiadis Dimitrios I

机构信息

Unit of Medical Technology and Intelligent Information Systems, Department of Computer Science, University of Ioannina, Ioannina, Greece.

出版信息

BMC Bioinformatics. 2009 Apr 20;10:113. doi: 10.1186/1471-2105-10-113.

DOI:10.1186/1471-2105-10-113

PMID:19379512

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2678097/

Abstract

BACKGROUND

Polypeptides are composed of amino acids covalently bonded via a peptide bond. The majority of peptide bonds in proteins is found to occur in the trans conformation. In spite of their infrequent occurrence, cis peptide bonds play a key role in the protein structure and function, as well as in many significant biological processes.

RESULTS

We perform a systematic analysis of regions in protein sequences that contain a proline cis peptide bond in order to discover non-random associations between the primary sequence and the nature of proline cis/trans isomerization. For this purpose an efficient pattern discovery algorithm is employed which discovers regular expression-type patterns that are overrepresented (i.e. appear frequently repeated) in a set of sequences. Four types of pattern discovery are performed: i) exact pattern discovery, ii) pattern discovery using a chemical equivalency set, iii) pattern discovery using a structural equivalency set and iv) pattern discovery using certain amino acids' physicochemical properties. The extracted patterns are carefully validated using a specially implemented scoring function and a significance measure (i.e. log-probability estimate) indicative of their specificity. The score threshold for the first three types of pattern discovery is 0.90 while for the last type of pattern discovery 0.80. Regarding the significance measure, all patterns yielded values in the range [-9, -31] which ensure that the derived patterns are highly unlikely to have emerged by chance. Among the highest scoring patterns, most of them are consistent with previous investigations concerning the neighborhood of cis proline peptide bonds, and many new ones are identified. Finally, the extracted patterns are systematically compared against the PROSITE database, in order to gain insight into the functional implications of cis prolyl bonds.

CONCLUSION

Cis patterns with matches in the PROSITE database fell mostly into two main functional clusters: family signatures and protein signatures. However considerable propensity was also observed for targeting signals, active and phosphorylation sites as well as domain signatures.

摘要

背景

多肽由通过肽键共价连接的氨基酸组成。蛋白质中的大多数肽键以反式构象存在。尽管顺式肽键出现频率较低，但它们在蛋白质结构和功能以及许多重要生物过程中起着关键作用。

结果

我们对蛋白质序列中包含脯氨酸顺式肽键的区域进行了系统分析，以发现一级序列与脯氨酸顺/反异构化性质之间的非随机关联。为此，采用了一种高效的模式发现算法，该算法可发现一组序列中过度代表（即频繁重复出现）的正则表达式类型模式。进行了四种类型的模式发现：i）精确模式发现，ii）使用化学等价集的模式发现，iii）使用结构等价集的模式发现，iv）使用某些氨基酸理化性质的模式发现。使用专门实现的评分函数和表示其特异性的显著性度量（即对数概率估计）对提取的模式进行仔细验证。前三种模式发现的得分阈值为0.90，而最后一种模式发现的得分阈值为0.80。关于显著性度量，所有模式产生的值在[-9, -31]范围内，这确保了推导的模式极不可能是偶然出现的。在得分最高的模式中，大多数与先前关于顺式脯氨酸肽键邻域的研究一致，并且还识别出许多新的模式。最后，将提取的模式与PROSITE数据库进行系统比较，以深入了解顺式脯氨酰键的功能含义。

结论

在PROSITE数据库中匹配的顺式模式大多分为两个主要功能簇：家族特征和蛋白质特征。然而，在靶向信号、活性和磷酸化位点以及结构域特征方面也观察到了相当大的倾向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d2f/2678097/743bdc10c9e1/1471-2105-10-113-1.jpg

相似文献

Detection of discriminative sequence patterns in the neighborhood of proline cis peptide bonds and their functional annotation.

BMC Bioinformatics. 2009 Apr 20;10:113. doi: 10.1186/1471-2105-10-113.

Extraction of consensus protein patterns in regions containing non-proline cis peptide bonds and their functional assessment.

BMC Bioinformatics. 2011 May 10;12:142. doi: 10.1186/1471-2105-12-142.

Conservation of cis prolyl bonds in proteins during evolution.

Proteins. 2005 Feb 15;58(3):589-95. doi: 10.1002/prot.20342.

Proline-dependent structural and biological properties of peptides and proteins.

Crit Rev Biochem Mol Biol. 1993;28(1):31-81. doi: 10.3109/10409239309082572.

Structural mechanism governing cis and trans isomeric states and an intramolecular switch for cis/trans isomerization of a non-proline peptide bond observed in crystal structures of scorpion toxins.

J Mol Biol. 2004 Aug 27;341(5):1189-204. doi: 10.1016/j.jmb.2004.06.067.

Non-proline cis peptide bonds in proteins.

J Mol Biol. 1999 Feb 12;286(1):291-304. doi: 10.1006/jmbi.1998.2459.

Prediction of cis/trans isomerization in proteins using PSI-BLAST profiles and secondary structure information.

BMC Bioinformatics. 2006 Mar 9;7:124. doi: 10.1186/1471-2105-7-124.

Local control of peptide conformation: stabilization of cis proline peptide bonds by aromatic proline interactions.

Biopolymers. 1998 Apr;45(5):381-94. doi: 10.1002/(SICI)1097-0282(19980415)45:5<381::AID-BIP6>3.0.CO;2-H.

Cell adhesion promoting peptide GVKGDKGNPGWPGAP from the collagen type IV triple helix: cis/trans proline-induced multiple 1H NMR conformations and evidence for a KG/PG multiple turn repeat motif in the all-trans proline state.

Biochemistry. 1991 Aug 20;30(33):8251-67. doi: 10.1021/bi00247a022.

Statistically significant dependence of the Xaa-Pro peptide bond conformation on secondary structure and amino acid sequence.

BMC Struct Biol. 2005 Apr 1;5:8. doi: 10.1186/1472-6807-5-8.

引用本文的文献

Subcellular localization of mutated β-catenins with different incidences of -peptide bonds at the Xaa246-P247 site in HepG2 cells.

FASEB J. 2019 May;33(5):6574-6583. doi: 10.1096/fj.201801937RR. Epub 2019 Feb 26.

The alphabet of intrinsic disorder: I. Act like a Pro: On the abundance and roles of proline residues in intrinsically disordered proteins.

Intrinsically Disord Proteins. 2013 Apr 1;1(1):e24360. doi: 10.4161/idp.24360. eCollection 2013 Jan-Dec.

Sixty-five years of the long march in protein secondary structure prediction: the final stretch?

Brief Bioinform. 2018 May 1;19(3):482-494. doi: 10.1093/bib/bbw129.

Adrenal androgen production in catarrhine primates and the evolution of adrenarche.

Am J Phys Anthropol. 2012 Mar;147(3):389-400. doi: 10.1002/ajpa.22001. Epub 2012 Jan 23.

Extraction of consensus protein patterns in regions containing non-proline cis peptide bonds and their functional assessment.

BMC Bioinformatics. 2011 May 10;12:142. doi: 10.1186/1471-2105-12-142.

Human GLTP: Three distinct functions for the three tryptophans in a novel peripheral amphitropic fold.

Biophys J. 2010 Oct 20;99(8):2626-35. doi: 10.1016/j.bpj.2010.08.038.

本文引用的文献

GenBank.

Nucleic Acids Res. 2009 Jan;37(Database issue):D26-31. doi: 10.1093/nar/gkn723. Epub 2008 Oct 21.

Prediction of cis/trans isomerization using feature selection and support vector machines.

J Biomed Inform. 2009 Feb;42(1):140-9. doi: 10.1016/j.jbi.2008.05.006. Epub 2008 May 23.

The 20 years of PROSITE.

Nucleic Acids Res. 2008 Jan;36(Database issue):D245-9. doi: 10.1093/nar/gkm977. Epub 2007 Nov 14.

Predicting peptide bond conformation using feature selection and the Naïve Bayes approach.

Annu Int Conf IEEE Eng Med Biol Soc. 2007;2007:5009-12. doi: 10.1109/IEMBS.2007.4353465.

Prolyl cis-trans isomerization as a molecular timer.

Nat Chem Biol. 2007 Oct;3(10):619-29. doi: 10.1038/nchembio.2007.35.

Prediction of cis/trans isomerization in proteins using PSI-BLAST profiles and secondary structure information.

BMC Bioinformatics. 2006 Mar 9;7:124. doi: 10.1186/1471-2105-7-124.

Cis-trans isomerization at a proline opens the pore of a neurotransmitter-gated ion channel.

Nature. 2005 Nov 10;438(7065):248-52. doi: 10.1038/nature04130.

PISCES: recent improvements to a PDB sequence culling server.

Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W94-8. doi: 10.1093/nar/gki402.

Statistically significant dependence of the Xaa-Pro peptide bond conformation on secondary structure and amino acid sequence.

BMC Struct Biol. 2005 Apr 1;5:8. doi: 10.1186/1472-6807-5-8.

Conservation of cis prolyl bonds in proteins during evolution.

Proteins. 2005 Feb 15;58(3):589-95. doi: 10.1002/prot.20342.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

脯氨酸顺式肽键附近鉴别性序列模式的检测及其功能注释。

Detection of discriminative sequence patterns in the neighborhood of proline cis peptide bonds and their functional annotation.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献