• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从蛋白质序列预测蛋白质结晶倾向。

Predicting protein crystallization propensity from protein sequence.

作者信息

Babnigg György, Joachimiak Andrzej

机构信息

Midwest Center for Structural Genomics, Biosciences Division, Argonne National Laboratory, 9700 S Cass Ave., Argonne, IL 60439, USA.

出版信息

J Struct Funct Genomics. 2010 Mar;11(1):71-80. doi: 10.1007/s10969-010-9080-0. Epub 2010 Feb 23.

DOI:10.1007/s10969-010-9080-0
PMID:20177794
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3366497/
Abstract

The high-throughput structure determination pipelines developed by structural genomics programs offer a unique opportunity for data mining. One important question is how protein properties derived from a primary sequence correlate with the protein's propensity to yield X-ray quality crystals (crystallizability) and 3D X-ray structures. A set of protein properties were computed for over 1,300 proteins that expressed well but were insoluble, and for approximately 720 unique proteins that resulted in X-ray structures. The correlation of the protein's iso-electric point and grand average hydropathy (GRAVY) with crystallizability was analyzed for full length and domain constructs of protein targets. In a second step, several additional properties that can be calculated from the protein sequence were added and evaluated. Using statistical analyses we have identified a set of the attributes correlating with a protein's propensity to crystallize and implemented a Support Vector Machine (SVM) classifier based on these. We have created applications to analyze and provide optimal boundary information for query sequences and to visualize the data. These tools are available via the web site http://bioinformatics.anl.gov/cgi-bin/tools/pdpredictor .

摘要

结构基因组学项目开发的高通量结构测定流程为数据挖掘提供了独特的机会。一个重要问题是,从一级序列推导的蛋白质特性如何与蛋白质产生X射线质量晶体的倾向(结晶性)及三维X射线结构相关。针对1300多种表达良好但不溶的蛋白质以及约720种产生了X射线结构的独特蛋白质,计算了一组蛋白质特性。针对蛋白质靶点的全长和结构域构建体,分析了蛋白质的等电点和总平均亲水性(GRAVY)与结晶性的相关性。在第二步中,添加并评估了可从蛋白质序列计算得出的其他几个特性。通过统计分析,我们确定了一组与蛋白质结晶倾向相关的属性,并基于这些属性实现了支持向量机(SVM)分类器。我们创建了应用程序,用于分析查询序列并提供最佳边界信息,以及可视化数据。这些工具可通过网站http://bioinformatics.anl.gov/cgi-bin/tools/pdpredictor获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0082/3366497/419c49580ca4/nihms194740f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0082/3366497/f531fdc2e765/nihms194740f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0082/3366497/a558e6b0c710/nihms194740f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0082/3366497/12e843ed12bf/nihms194740f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0082/3366497/8920a3e12de9/nihms194740f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0082/3366497/419c49580ca4/nihms194740f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0082/3366497/f531fdc2e765/nihms194740f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0082/3366497/a558e6b0c710/nihms194740f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0082/3366497/12e843ed12bf/nihms194740f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0082/3366497/8920a3e12de9/nihms194740f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0082/3366497/419c49580ca4/nihms194740f5.jpg

相似文献

1
Predicting protein crystallization propensity from protein sequence.从蛋白质序列预测蛋白质结晶倾向。
J Struct Funct Genomics. 2010 Mar;11(1):71-80. doi: 10.1007/s10969-010-9080-0. Epub 2010 Feb 23.
2
PredPPCrys: accurate prediction of sequence cloning, protein production, purification and crystallization propensity from protein sequences using multi-step heterogeneous feature fusion and selection.PredPPCrys:利用多步异构特征融合与选择从蛋白质序列准确预测序列克隆、蛋白质生产、纯化及结晶倾向。
PLoS One. 2014 Aug 22;9(8):e105902. doi: 10.1371/journal.pone.0105902. eCollection 2014.
3
SPINE: an integrated tracking database and data mining approach for identifying feasible targets in high-throughput structural proteomics.SPINE:一种用于在高通量结构蛋白质组学中识别可行靶点的集成跟踪数据库和数据挖掘方法。
Nucleic Acids Res. 2001 Jul 1;29(13):2884-98. doi: 10.1093/nar/29.13.2884.
4
Crysalis: an integrated server for computational analysis and design of protein crystallization.Crysalis:用于蛋白质结晶计算分析与设计的集成服务器。
Sci Rep. 2016 Feb 24;6:21383. doi: 10.1038/srep21383.
5
SCMCRYS: predicting protein crystallization using an ensemble scoring card method with estimating propensity scores of P-collocated amino acid pairs.SCMCRYS:使用基于 P 位氨基酸对倾向得分估计的集成评分卡方法预测蛋白质结晶。
PLoS One. 2013 Sep 3;8(9):e72368. doi: 10.1371/journal.pone.0072368. eCollection 2013.
6
Protein crystallizability.蛋白质结晶性
Methods Mol Biol. 2010;609:385-400. doi: 10.1007/978-1-60327-241-4_22.
7
RFCRYS: sequence-based protein crystallization propensity prediction by means of random forest.RFCRYS:基于序列的蛋白质结晶倾向预测的随机森林方法。
J Theor Biol. 2012 Aug 7;306:115-9. doi: 10.1016/j.jtbi.2012.04.028. Epub 2012 May 2.
8
Prediction of protein crystallization using collocation of amino acid pairs.利用氨基酸对的搭配预测蛋白质结晶
Biochem Biophys Res Commun. 2007 Apr 13;355(3):764-9. doi: 10.1016/j.bbrc.2007.02.040. Epub 2007 Feb 15.
9
Critical evaluation of bioinformatics tools for the prediction of protein crystallization propensity.蛋白质结晶倾向预测的生物信息学工具的批判性评估。
Brief Bioinform. 2018 Sep 28;19(5):838-852. doi: 10.1093/bib/bbx018.
10
Will my protein crystallize? A sequence-based predictor.我的蛋白质会结晶吗?一种基于序列的预测器。
Proteins. 2006 Feb 1;62(2):343-55. doi: 10.1002/prot.20789.

引用本文的文献

1
Computational modeling of cyclotides as antimicrobial agents against PorB porin protein: integration of docking, immune, and molecular dynamics simulations.作为抗PorB孔蛋白的抗菌剂的环肽的计算建模:对接、免疫和分子动力学模拟的整合
Front Chem. 2024 Nov 25;12:1493165. doi: 10.3389/fchem.2024.1493165. eCollection 2024.
2
Data collection from crystals grown in microfluidic droplets.从微流控液滴中生长的晶体中收集数据。
Acta Crystallogr D Struct Biol. 2022 Aug 1;78(Pt 8):997-1009. doi: 10.1107/S2059798322004661. Epub 2022 Jul 21.
3
fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization.

本文引用的文献

1
Extracytoplasmic PAS-like domains are common in signal transduction proteins.细胞外 PAS 样结构域常见于信号转导蛋白中。
J Bacteriol. 2010 Feb;192(4):1156-9. doi: 10.1128/JB.01508-09. Epub 2009 Dec 11.
2
Understanding the physical properties that control protein crystallization by analysis of large-scale experimental data.通过分析大规模实验数据来理解控制蛋白质结晶的物理性质。
Nat Biotechnol. 2009 Jan;27(1):51-7. doi: 10.1038/nbt.1514.
3
Large-scale evaluation of protein reductive methylation for improving protein crystallization.
fDETECT 网页服务器:快速预测蛋白质生产、纯化和结晶的倾向。
BMC Bioinformatics. 2018 Jan 3;18(1):580. doi: 10.1186/s12859-017-1995-z.
4
Crystal Nucleation Using Surface-Energy-Modified Glass Substrates.使用表面能改性玻璃基板的晶体成核
Cryst Growth Des. 2017 Aug 2;17(8):4049-4055. doi: 10.1021/acs.cgd.7b00574. Epub 2017 Jul 21.
5
Survey of Predictors of Propensity for Protein Production and Crystallization with Application to Predict Resolution of Crystal Structures.蛋白质产生和结晶倾向预测因子的调查及其在预测晶体结构分辨率中的应用
Curr Protein Pept Sci. 2018;19(2):200-210. doi: 10.2174/1389203718666170921114437.
6
Databases, Repositories, and Other Data Resources in Structural Biology.结构生物学中的数据库、储存库及其他数据资源。
Methods Mol Biol. 2017;1607:643-665. doi: 10.1007/978-1-4939-7000-1_27.
7
The "Sticky Patch" Model of Crystallization and Modification of Proteins for Enhanced Crystallizability.用于增强结晶性的蛋白质结晶与修饰的“粘性补丁”模型
Methods Mol Biol. 2017;1607:77-115. doi: 10.1007/978-1-4939-7000-1_4.
8
Critical evaluation of bioinformatics tools for the prediction of protein crystallization propensity.蛋白质结晶倾向预测的生物信息学工具的批判性评估。
Brief Bioinform. 2018 Sep 28;19(5):838-852. doi: 10.1093/bib/bbx018.
9
Crysalis: an integrated server for computational analysis and design of protein crystallization.Crysalis:用于蛋白质结晶计算分析与设计的集成服务器。
Sci Rep. 2016 Feb 24;6:21383. doi: 10.1038/srep21383.
10
Computational crystallization.计算结晶学
Arch Biochem Biophys. 2016 Jul 15;602:12-20. doi: 10.1016/j.abb.2016.01.004. Epub 2016 Jan 11.
用于改善蛋白质结晶的蛋白质还原甲基化的大规模评估
Nat Methods. 2008 Oct;5(10):853-4. doi: 10.1038/nmeth1008-853.
4
Target selection for structural genomics: an overview.结构基因组学的靶点选择:综述
Methods Mol Biol. 2008;426:3-25. doi: 10.1007/978-1-60327-058-8_1.
5
ParCrys: a Parzen window density estimation approach to protein crystallization propensity prediction.ParCrys:一种用于蛋白质结晶倾向预测的Parzen窗密度估计方法。
Bioinformatics. 2008 Apr 1;24(7):901-7. doi: 10.1093/bioinformatics/btn055. Epub 2008 Feb 19.
6
An overview of statistical learning theory.统计学习理论概述。
IEEE Trans Neural Netw. 1999;10(5):988-99. doi: 10.1109/72.788640.
7
AAindex: amino acid index database, progress report 2008.AAindex:氨基酸索引数据库,2008年进展报告。
Nucleic Acids Res. 2008 Jan;36(Database issue):D202-5. doi: 10.1093/nar/gkm998. Epub 2007 Nov 12.
8
In situ proteolysis for protein crystallization and structure determination.用于蛋白质结晶和结构测定的原位蛋白酶解
Nat Methods. 2007 Dec;4(12):1019-21. doi: 10.1038/nmeth1118. Epub 2007 Nov 4.
9
The challenge of protein structure determination--lessons from structural genomics.蛋白质结构测定的挑战——来自结构基因组学的经验教训。
Protein Sci. 2007 Nov;16(11):2472-82. doi: 10.1110/ps.073037907.
10
XtalPred: a web server for prediction of protein crystallizability.XtalPred:一个用于预测蛋白质结晶性的网络服务器。
Bioinformatics. 2007 Dec 15;23(24):3403-5. doi: 10.1093/bioinformatics/btm477. Epub 2007 Oct 5.