Suppr超能文献

从蛋白质序列预测蛋白质结晶倾向。

Predicting protein crystallization propensity from protein sequence.

作者信息

Babnigg György, Joachimiak Andrzej

机构信息

Midwest Center for Structural Genomics, Biosciences Division, Argonne National Laboratory, 9700 S Cass Ave., Argonne, IL 60439, USA.

出版信息

J Struct Funct Genomics. 2010 Mar;11(1):71-80. doi: 10.1007/s10969-010-9080-0. Epub 2010 Feb 23.

Abstract

The high-throughput structure determination pipelines developed by structural genomics programs offer a unique opportunity for data mining. One important question is how protein properties derived from a primary sequence correlate with the protein's propensity to yield X-ray quality crystals (crystallizability) and 3D X-ray structures. A set of protein properties were computed for over 1,300 proteins that expressed well but were insoluble, and for approximately 720 unique proteins that resulted in X-ray structures. The correlation of the protein's iso-electric point and grand average hydropathy (GRAVY) with crystallizability was analyzed for full length and domain constructs of protein targets. In a second step, several additional properties that can be calculated from the protein sequence were added and evaluated. Using statistical analyses we have identified a set of the attributes correlating with a protein's propensity to crystallize and implemented a Support Vector Machine (SVM) classifier based on these. We have created applications to analyze and provide optimal boundary information for query sequences and to visualize the data. These tools are available via the web site http://bioinformatics.anl.gov/cgi-bin/tools/pdpredictor .

摘要

结构基因组学项目开发的高通量结构测定流程为数据挖掘提供了独特的机会。一个重要问题是,从一级序列推导的蛋白质特性如何与蛋白质产生X射线质量晶体的倾向(结晶性)及三维X射线结构相关。针对1300多种表达良好但不溶的蛋白质以及约720种产生了X射线结构的独特蛋白质,计算了一组蛋白质特性。针对蛋白质靶点的全长和结构域构建体,分析了蛋白质的等电点和总平均亲水性(GRAVY)与结晶性的相关性。在第二步中,添加并评估了可从蛋白质序列计算得出的其他几个特性。通过统计分析,我们确定了一组与蛋白质结晶倾向相关的属性,并基于这些属性实现了支持向量机(SVM)分类器。我们创建了应用程序,用于分析查询序列并提供最佳边界信息,以及可视化数据。这些工具可通过网站http://bioinformatics.anl.gov/cgi-bin/tools/pdpredictor获取。

相似文献

1
Predicting protein crystallization propensity from protein sequence.从蛋白质序列预测蛋白质结晶倾向。
J Struct Funct Genomics. 2010 Mar;11(1):71-80. doi: 10.1007/s10969-010-9080-0. Epub 2010 Feb 23.
6
Protein crystallizability.蛋白质结晶性
Methods Mol Biol. 2010;609:385-400. doi: 10.1007/978-1-60327-241-4_22.
8
Prediction of protein crystallization using collocation of amino acid pairs.利用氨基酸对的搭配预测蛋白质结晶
Biochem Biophys Res Commun. 2007 Apr 13;355(3):764-9. doi: 10.1016/j.bbrc.2007.02.040. Epub 2007 Feb 15.

引用本文的文献

2
Data collection from crystals grown in microfluidic droplets.从微流控液滴中生长的晶体中收集数据。
Acta Crystallogr D Struct Biol. 2022 Aug 1;78(Pt 8):997-1009. doi: 10.1107/S2059798322004661. Epub 2022 Jul 21.
4
Crystal Nucleation Using Surface-Energy-Modified Glass Substrates.使用表面能改性玻璃基板的晶体成核
Cryst Growth Des. 2017 Aug 2;17(8):4049-4055. doi: 10.1021/acs.cgd.7b00574. Epub 2017 Jul 21.
10
Computational crystallization.计算结晶学
Arch Biochem Biophys. 2016 Jul 15;602:12-20. doi: 10.1016/j.abb.2016.01.004. Epub 2016 Jan 11.

本文引用的文献

4
Target selection for structural genomics: an overview.结构基因组学的靶点选择:综述
Methods Mol Biol. 2008;426:3-25. doi: 10.1007/978-1-60327-058-8_1.
6
An overview of statistical learning theory.统计学习理论概述。
IEEE Trans Neural Netw. 1999;10(5):988-99. doi: 10.1109/72.788640.
7
AAindex: amino acid index database, progress report 2008.AAindex:氨基酸索引数据库,2008年进展报告。
Nucleic Acids Res. 2008 Jan;36(Database issue):D202-5. doi: 10.1093/nar/gkm998. Epub 2007 Nov 12.
10
XtalPred: a web server for prediction of protein crystallizability.XtalPred:一个用于预测蛋白质结晶性的网络服务器。
Bioinformatics. 2007 Dec 15;23(24):3403-5. doi: 10.1093/bioinformatics/btm477. Epub 2007 Oct 5.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验