Suppr超能文献

预测拟南芥蛋白质的结晶倾向。

Predicting Crystallization Propensity of Proteins from Arabidopsis Thaliana.

作者信息

Yan Shaomin, Wu Guang

机构信息

State Key Laboratory of Non-food Biomass Enzyme Technology, National Engineering Research Center for Non-food Biorefinery, Guangxi Key Laboratory of Biorefinery, Guangxi Academy of Sciences, 98 Daling Road, Nanning, Guangxi 530007 China.

出版信息

Biol Proced Online. 2015 Nov 23;17:16. doi: 10.1186/s12575-015-0029-3. eCollection 2015.

Abstract

BACKGROUND

Many studies have correlated characteristics of amino acids with crystallization propensity, as part of the effort to determine the factors that affect the propensity of protein crystallization. However, these characteristics are constant; that is, the encoded amino acid sequences have the same value for each type of amino acid. To overcome this inflexibility, three dynamic characteristics of amino acids and protein were introduced to analyze the crystallization propensity of proteins. Both logistic regression and neural network models were used to correlate each of two dynamic characteristics with the crystallization propensity of 301 proteins from Arabidopsis thaliana, and their results were compared with those obtained from each of 531 constant amino acid characteristics, which served as the benchmark.

RESULTS

The neural network model was more powerful for predicting the crystallization propensity of proteins than the logistic regression model. Compared with the benchmark, the dynamic characteristics of amino acids provided good prediction results for the crystallization propensity, and the distribution probability gave the highest sensitivity. Using 90 % accuracy as a cutoff point, the predictable portion of A. thaliana portions was ranked, and the statistical analysis showed that the larger the predictable portion, the better the prediction.

CONCLUSIONS

These results demonstrate that dynamic characteristics have a certain relationship with the crystallization propensity, and they could be helpful for the prediction of protein crystallization, which may provide a theoretical concept for certain proteins before conducting experimental crystallization.

摘要

背景

作为确定影响蛋白质结晶倾向因素工作的一部分,许多研究已将氨基酸的特性与结晶倾向相关联。然而,这些特性是固定不变的;也就是说,编码的氨基酸序列对于每种类型的氨基酸都具有相同的值。为了克服这种僵化性,引入了氨基酸和蛋白质的三个动态特性来分析蛋白质的结晶倾向。使用逻辑回归和神经网络模型将两个动态特性中的每一个与来自拟南芥的301种蛋白质的结晶倾向相关联,并将它们的结果与作为基准的531种固定氨基酸特性中的每一个所获得的结果进行比较。

结果

神经网络模型在预测蛋白质结晶倾向方面比逻辑回归模型更强大。与基准相比,氨基酸的动态特性为结晶倾向提供了良好的预测结果,并且分布概率给出了最高的灵敏度。以90%的准确率作为截止点,对拟南芥部分的可预测部分进行了排名,统计分析表明可预测部分越大,预测效果越好。

结论

这些结果表明动态特性与结晶倾向有一定关系,它们可能有助于蛋白质结晶的预测,这可能为在进行实验结晶之前对某些蛋白质提供一个理论概念。

相似文献

1
Predicting Crystallization Propensity of Proteins from Arabidopsis Thaliana.
Biol Proced Online. 2015 Nov 23;17:16. doi: 10.1186/s12575-015-0029-3. eCollection 2015.
2
SVMCRYS: an SVM approach for the prediction of protein crystallization propensity from protein sequence.
Protein Pept Lett. 2010 Apr;17(4):423-30. doi: 10.2174/092986610790963726.
3
Meta prediction of protein crystallization propensity.
Biochem Biophys Res Commun. 2009 Dec 4;390(1):10-5. doi: 10.1016/j.bbrc.2009.09.036. Epub 2009 Sep 13.
4
RFCRYS: sequence-based protein crystallization propensity prediction by means of random forest.
J Theor Biol. 2012 Aug 7;306:115-9. doi: 10.1016/j.jtbi.2012.04.028. Epub 2012 May 2.
5
Randomness in crystallization of proteins from Staphylococcus aureus.
Protein Pept Lett. 2012 Jul;19(7):784-9. doi: 10.2174/092986612800793190.
6
Predicting RNA-binding sites in proteins using the interaction propensity of amino acid triplets.
Protein Pept Lett. 2010 Sep;17(9):1102-10. doi: 10.2174/092986610791760388.
8
fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization.
BMC Bioinformatics. 2018 Jan 3;18(1):580. doi: 10.1186/s12859-017-1995-z.
9
Crystallization of Arabidopsis thaliana acetohydroxyacid synthase in complex with the sulfonylurea herbicide chlorimuron ethyl.
Acta Crystallogr D Biol Crystallogr. 2004 Jan;60(Pt 1):153-5. doi: 10.1107/s0907444903025423. Epub 2003 Dec 18.
10
Predicting the Crystallization Propensity of Drug-Like Molecules.
J Pharm Sci. 2017 Jan;106(1):28-30. doi: 10.1016/j.xphs.2016.07.031. Epub 2016 Sep 22.

本文引用的文献

2
Possible roles for polycomb repressive complex 2 in cereal endosperm.
Front Plant Sci. 2015 Mar 12;6:144. doi: 10.3389/fpls.2015.00144. eCollection 2015.
3
Dual protein trafficking to secretory and non-secretory cell compartments: clear or double vision?
Plant Sci. 2015 May;234:174-9. doi: 10.1016/j.plantsci.2015.02.013. Epub 2015 Feb 26.
4
Circadian Clock Genes Universally Control Key Agricultural Traits.
Mol Plant. 2015 Aug;8(8):1135-52. doi: 10.1016/j.molp.2015.03.003. Epub 2015 Mar 13.
6
B-GATA transcription factors - insights into their structure, regulation, and role in plant development.
Front Plant Sci. 2015 Feb 23;6:90. doi: 10.3389/fpls.2015.00090. eCollection 2015.
8
Improving the chances of successful protein structure determination with a random forest classifier.
Acta Crystallogr D Biol Crystallogr. 2014 Mar;70(Pt 3):627-35. doi: 10.1107/S1399004713032070. Epub 2014 Feb 15.
10
Protein crystallization prediction with AdaBoost.
Int J Data Min Bioinform. 2013;7(2):214-27. doi: 10.1504/ijdmb.2013.053197.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验