• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于种子传播采样和留一法检验的蛋白质折叠类型预测分析

An analysis of protein folding type prediction by seed-propagated sampling and jackknife test.

作者信息

Zhang C T, Chou K C

机构信息

Department of Physics, Tianjin University, China.

出版信息

J Protein Chem. 1995 Oct;14(7):583-93. doi: 10.1007/BF01886884.

DOI:10.1007/BF01886884
PMID:8561854
Abstract

In the development of methodology for statistical prediction of protein folding types, how to test the predicted results is a crucial problem. In addition to the resubstitution test in which the folding type of each protein from a training set is predicted based on the rules derived from the same set, cross-validation tests are needed. Among them, the single-test-set method seems to be least reliable due to the arbitrariness in selecting the test set. Although the leaving-one-out (or jackknife) test is more objective and hence more reliable, it may cause a severe information loss by leaving a protein in turn out of the training set when its size is not large enough. In order to overcome the above drawback, a seed-propagated sampling approach is proposed that can be used to generate any number of simulated proteins with a desired type based on a given training set database. There is no need to make any predetermined assumption about the statistical distribution function of the amino acid frequencies. Combined with the existing cross-validation methods, the new technique may provide a more objective estimation for various protein-folding-type prediction methods.

摘要

在蛋白质折叠类型统计预测方法的发展过程中,如何检验预测结果是一个关键问题。除了基于从训练集中得出的规则对训练集中每个蛋白质的折叠类型进行预测的重代入检验外,还需要交叉验证检验。其中,单测试集方法由于在选择测试集时存在任意性,似乎是最不可靠的。虽然留一法(或刀切法)检验更客观,因此更可靠,但当训练集规模不够大时,每次将一个蛋白质排除在训练集之外可能会导致严重的信息损失。为了克服上述缺点,提出了一种种子传播抽样方法,该方法可用于基于给定的训练集数据库生成任意数量具有所需类型的模拟蛋白质。无需对氨基酸频率的统计分布函数做出任何预先假设。结合现有的交叉验证方法,新技术可能为各种蛋白质折叠类型预测方法提供更客观的估计。

相似文献

1
An analysis of protein folding type prediction by seed-propagated sampling and jackknife test.基于种子传播采样和留一法检验的蛋白质折叠类型预测分析
J Protein Chem. 1995 Oct;14(7):583-93. doi: 10.1007/BF01886884.
2
Monte Carlo simulation studies on the prediction of protein folding types from amino acid composition.基于氨基酸组成预测蛋白质折叠类型的蒙特卡罗模拟研究。
Biophys J. 1992 Dec;63(6):1523-9. doi: 10.1016/S0006-3495(92)81728-9.
3
Monte Carlo simulation studies on the prediction of protein folding types from amino acid composition. II. Correlative effect.
J Protein Chem. 1995 May;14(4):251-8. doi: 10.1007/BF01886766.
4
Estimation of protein folding probability from equilibrium simulations.通过平衡模拟估算蛋白质折叠概率。
J Chem Phys. 2005 May 8;122(18):184901. doi: 10.1063/1.1893753.
5
A new approach to predicting protein folding types.一种预测蛋白质折叠类型的新方法。
J Protein Chem. 1993 Apr;12(2):169-78. doi: 10.1007/BF01026038.
6
Does the folding type of a protein depend on its amino acid composition?
FEBS Lett. 1995 Apr 17;363(1-2):127-31. doi: 10.1016/0014-5793(95)00245-5.
7
Predicting protein folding types by distance functions that make allowances for amino acid interactions.
J Biol Chem. 1994 Sep 2;269(35):22014-20.
8
Folding of small proteins using a single continuous potential.利用单一连续势对小蛋白质进行折叠。
J Chem Phys. 2004 May 1;120(17):8271-6. doi: 10.1063/1.1689643.
9
Prediction of secondary structural content of proteins from their amino acid composition alone. I. New analytic vector decomposition methods.仅根据氨基酸组成预测蛋白质的二级结构含量。I. 新的分析向量分解方法。
Proteins. 1996 Jun;25(2):157-68. doi: 10.1002/(SICI)1097-0134(199606)25:2<157::AID-PROT2>3.0.CO;2-F.
10
Atomic hydration potentials using a Monte Carlo Reference State (MCRS) for protein solvation modeling.使用蒙特卡罗参考态(MCRS)进行蛋白质溶剂化建模的原子水合势
BMC Struct Biol. 2007 Mar 30;7:19. doi: 10.1186/1472-6807-7-19.

引用本文的文献

1
iPPBS-Opt: A Sequence-Based Ensemble Classifier for Identifying Protein-Protein Binding Sites by Optimizing Imbalanced Training Datasets.iPPBS-Opt:一种基于序列的集成分类器,通过优化不平衡训练数据集来识别蛋白质-蛋白质结合位点。
Molecules. 2016 Jan 19;21(1):E95. doi: 10.3390/molecules21010095.
2
Predicting Crystallization Propensity of Proteins from Arabidopsis Thaliana.预测拟南芥蛋白质的结晶倾向。
Biol Proced Online. 2015 Nov 23;17:16. doi: 10.1186/s12575-015-0029-3. eCollection 2015.
3
Characterization of protein secondary structure from NMR chemical shifts.

本文引用的文献

1
Prediction of protein folding class from amino acid composition.基于氨基酸组成预测蛋白质折叠类别。
Proteins. 1993 May;16(1):79-91. doi: 10.1002/prot.340160109.
2
Cross-validation of protein structural class prediction using statistical clustering and neural networks.使用统计聚类和神经网络对蛋白质结构类别预测进行交叉验证。
Protein Sci. 1993 Jul;2(7):1171-82. doi: 10.1002/pro.5560020712.
3
Protein folding classes: a geometric interpretation of the amino acid composition of globular proteins.蛋白质折叠类别:球状蛋白质氨基酸组成的几何解释
通过核磁共振化学位移表征蛋白质二级结构
Prog Nucl Magn Reson Spectrosc. 2009 Apr 5;54(3-4):141-165. doi: 10.1016/j.pnmrs.2008.06.002.
4
Correlation and prediction of gene expression level from amino acid and dipeptide composition of its protein.基于蛋白质的氨基酸和二肽组成对基因表达水平进行相关性分析与预测。
BMC Bioinformatics. 2005 Mar 17;6:59. doi: 10.1186/1471-2105-6-59.
5
Analysis and prediction of affinity of TAP binding peptides using cascade SVM.使用级联支持向量机分析和预测TAP结合肽的亲和力
Protein Sci. 2004 Mar;13(3):596-607. doi: 10.1110/ps.03373104.
Protein Eng. 1994 Mar;7(3):319-30. doi: 10.1093/protein/7.3.319.
4
Predicting protein folding types by distance functions that make allowances for amino acid interactions.
J Biol Chem. 1994 Sep 2;269(35):22014-20.
5
A novel approach to predicting protein structural classes in a (20-1)-D amino acid composition space.
Proteins. 1995 Apr;21(4):319-44. doi: 10.1002/prot.340210406.
6
The folding type of a protein is relevant to the amino acid composition.蛋白质的折叠类型与氨基酸组成有关。
J Biochem. 1986 Jan;99(1):153-62. doi: 10.1093/oxfordjournals.jbchem.a135454.
7
Prediction of protein structural class by discriminant analysis.通过判别分析预测蛋白质结构类别。
Biochim Biophys Acta. 1986 Nov 21;874(2):205-15. doi: 10.1016/0167-4838(86)90119-6.
8
Prediction of protein structural class from the amino acid sequence.从氨基酸序列预测蛋白质结构类别。
Biopolymers. 1986 Sep;25(9):1659-72. doi: 10.1002/bip.360250909.
9
A correlation-coefficient method to predicting protein-structural classes from amino acid compositions.
Eur J Biochem. 1992 Jul 15;207(2):429-3. doi: 10.1111/j.1432-1033.1992.tb17067.x.
10
Predicting protein secondary structure content. A tandem neural network approach.
J Mol Biol. 1992 Jun 5;225(3):713-27. doi: 10.1016/0022-2836(92)90396-2.