• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用统计聚类和神经网络对蛋白质结构类别预测进行交叉验证。

Cross-validation of protein structural class prediction using statistical clustering and neural networks.

作者信息

Metfessel B A, Saurugger P N, Connelly D P, Rich S S

机构信息

Department of Laboratory Medicine and Pathology, University of Minnesota, Minneapolis 55455.

出版信息

Protein Sci. 1993 Jul;2(7):1171-82. doi: 10.1002/pro.5560020712.

DOI:10.1002/pro.5560020712
PMID:8358300
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2142422/
Abstract

We present an approach to predicting protein structural class that uses amino acid composition and hydrophobic pattern frequency information as input to two types of neural networks: (1) a three-layer back-propagation network and (2) a learning vector quantization network. The results of these methods are compared to those obtained from a modified Euclidean statistical clustering algorithm. The protein sequence data used to drive these algorithms consist of the normalized frequency of up to 20 amino acid types and six hydrophobic amino acid patterns. From these frequency values the structural class predictions for each protein (all-alpha, all-beta, or alpha-beta classes) are derived. Examples consisting of 64 previously classified proteins were randomly divided into multiple training (56 proteins) and test (8 proteins) sets. The best performing algorithm on the test sets was the learning vector quantization network using 17 inputs, obtaining a prediction accuracy of 80.2%. The Matthews correlation coefficients are statistically significant for all algorithms and all structural classes. The differences between algorithms are in general not statistically significant. These results show that information exists in protein primary sequences that is easily obtainable and useful for the prediction of protein structural class by neural networks as well as by standard statistical clustering algorithms.

摘要

我们提出了一种预测蛋白质结构类别的方法,该方法将氨基酸组成和疏水模式频率信息作为输入,应用于两种类型的神经网络:(1)一个三层反向传播网络和(2)一个学习矢量量化网络。将这些方法的结果与通过改进的欧几里得统计聚类算法获得的结果进行比较。用于驱动这些算法的蛋白质序列数据由多达20种氨基酸类型的归一化频率和六种疏水氨基酸模式组成。根据这些频率值得出每种蛋白质(全α、全β或α-β类)的结构类预测。由64个先前分类的蛋白质组成的示例被随机分为多个训练集(56个蛋白质)和测试集(8个蛋白质)。在测试集上表现最佳的算法是使用17个输入的学习矢量量化网络,预测准确率为80.2%。所有算法和所有结构类别的马修斯相关系数均具有统计学意义。算法之间的差异一般无统计学意义。这些结果表明,蛋白质一级序列中存在易于获取的信息,这些信息对于通过神经网络以及标准统计聚类算法预测蛋白质结构类别很有用。

相似文献

1
Cross-validation of protein structural class prediction using statistical clustering and neural networks.使用统计聚类和神经网络对蛋白质结构类别预测进行交叉验证。
Protein Sci. 1993 Jul;2(7):1171-82. doi: 10.1002/pro.5560020712.
2
Prediction of beta-turns in proteins from multiple alignment using neural network.利用神经网络从多序列比对预测蛋白质中的β-转角。
Protein Sci. 2003 Mar;12(3):627-34. doi: 10.1110/ps.0228903.
3
Neural networks for secondary structure and structural class predictions.用于二级结构和结构类别预测的神经网络。
Protein Sci. 1995 Feb;4(2):275-85. doi: 10.1002/pro.5560040214.
4
A neural network method for prediction of beta-turn types in proteins using evolutionary information.一种利用进化信息预测蛋白质中β-转角类型的神经网络方法。
Bioinformatics. 2004 Nov 1;20(16):2751-8. doi: 10.1093/bioinformatics/bth322. Epub 2004 May 14.
5
Prediction of protein folding class using global description of amino acid sequence.利用氨基酸序列的全局描述预测蛋白质折叠类别。
Proc Natl Acad Sci U S A. 1995 Sep 12;92(19):8700-4. doi: 10.1073/pnas.92.19.8700.
6
Prediction of the location and type of beta-turns in proteins using neural networks.使用神经网络预测蛋白质中β-转角的位置和类型。
Protein Sci. 1999 May;8(5):1045-55. doi: 10.1110/ps.8.5.1045.
7
The importance of larger data sets for protein secondary structure prediction with neural networks.更大数据集对神经网络进行蛋白质二级结构预测的重要性。
Protein Sci. 1996 Apr;5(4):768-74. doi: 10.1002/pro.5560050422.
8
Protein secondary structure prediction with SPARROW.利用 SPARROW 进行蛋白质二级结构预测。
J Chem Inf Model. 2012 Feb 27;52(2):545-56. doi: 10.1021/ci200321u. Epub 2012 Jan 23.
9
Prediction of beta-turns and beta-turn types by a novel bidirectional Elman-type recurrent neural network with multiple output layers (MOLEBRNN).利用具有多个输出层的新型双向埃尔曼型递归神经网络(MOLEBRNN)预测β-转角和β-转角类型。
Gene. 2008 Oct 1;422(1-2):22-9. doi: 10.1016/j.gene.2008.06.008. Epub 2008 Jun 10.
10
Prediction of protein structure classes with flexible neural tree.使用灵活神经树预测蛋白质结构类别。
Biomed Mater Eng. 2014;24(6):3797-806. doi: 10.3233/BME-141209.

引用本文的文献

1
Identification of Cancerlectins Using Support Vector Machines With Fusion of G-Gap Dipeptide.使用融合G-间隙二肽的支持向量机鉴定癌凝集素。
Front Genet. 2020 Apr 3;11:275. doi: 10.3389/fgene.2020.00275. eCollection 2020.
2
Some remarks on protein attribute prediction and pseudo amino acid composition.关于蛋白质属性预测和伪氨基酸组成的一些说明。
J Theor Biol. 2011 Mar 21;273(1):236-47. doi: 10.1016/j.jtbi.2010.12.024. Epub 2010 Dec 17.
3
Characterization of protein secondary structure from NMR chemical shifts.通过核磁共振化学位移表征蛋白质二级结构
Prog Nucl Magn Reson Spectrosc. 2009 Apr 5;54(3-4):141-165. doi: 10.1016/j.pnmrs.2008.06.002.
4
A time-series-based feature extraction approach for prediction of protein structural class.一种基于时间序列的蛋白质结构类别预测特征提取方法。
EURASIP J Bioinform Syst Biol. 2008;2008(1):235451. doi: 10.1155/2008/235451.
5
Prediction of protein structural class with Rough Sets.基于粗糙集的蛋白质结构类预测
BMC Bioinformatics. 2006 Jan 14;7:20. doi: 10.1186/1471-2105-7-20.
6
Support vector machines for predicting protein structural class.用于预测蛋白质结构类别的支持向量机
BMC Bioinformatics. 2001;2:3. doi: 10.1186/1471-2105-2-3. Epub 2001 Jun 29.
7
Using pair-coupled amino acid composition to predict protein secondary structure content.利用配对耦合氨基酸组成预测蛋白质二级结构含量。
J Protein Chem. 1999 May;18(4):473-80. doi: 10.1023/a:1020696810938.
8
An intriguing controversy over protein structural class prediction.关于蛋白质结构类别预测的一场有趣争论。
J Protein Chem. 1998 Nov;17(8):729-38. doi: 10.1023/a:1020713915365.
9
An eigenvalue-eigenvector approach to predicting protein folding types.一种预测蛋白质折叠类型的特征值-特征向量方法。
J Protein Chem. 1995 Jul;14(5):309-26. doi: 10.1007/BF01886788.
10
An analysis of protein folding type prediction by seed-propagated sampling and jackknife test.基于种子传播采样和留一法检验的蛋白质折叠类型预测分析
J Protein Chem. 1995 Oct;14(7):583-93. doi: 10.1007/BF01886884.

本文引用的文献

1
Use of helical wheels to represent the structures of proteins and to identify segments with helical potential.使用螺旋轮来表示蛋白质的结构并识别具有螺旋潜力的片段。
Biophys J. 1967 Mar;7(2):121-35. doi: 10.1016/S0006-3495(67)86579-2.
2
Algorithms for prediction of alpha-helical and beta-structural regions in globular proteins.预测球状蛋白质中α螺旋和β结构区域的算法。
J Mol Biol. 1974 Oct 5;88(4):873-94. doi: 10.1016/0022-2836(74)90405-7.
3
Amino acid composition and hydrophobicity patterns of protein domains correlate with their structures.蛋白质结构域的氨基酸组成和疏水性模式与其结构相关。
Biopolymers. 1985 Oct;24(10):1995-2023. doi: 10.1002/bip.360241011.
4
Hydrophobicity of amino acid residues in globular proteins.球状蛋白质中氨基酸残基的疏水性
Science. 1985 Aug 30;229(4716):834-8. doi: 10.1126/science.4023714.
5
Prediction of protein structural class by discriminant analysis.通过判别分析预测蛋白质结构类别。
Biochim Biophys Acta. 1986 Nov 21;874(2):205-15. doi: 10.1016/0167-4838(86)90119-6.
6
Prediction of protein structural class from the amino acid sequence.从氨基酸序列预测蛋白质结构类别。
Biopolymers. 1986 Sep;25(9):1659-72. doi: 10.1002/bip.360250909.
7
An algorithm for protein secondary structure prediction based on class prediction.一种基于类别预测的蛋白质二级结构预测算法。
Protein Eng. 1987 Aug-Sep;1(4):289-94. doi: 10.1093/protein/1.4.289.
8
Predicting the secondary structure of globular proteins using neural network models.使用神经网络模型预测球状蛋白质的二级结构。
J Mol Biol. 1988 Aug 20;202(4):865-84. doi: 10.1016/0022-2836(88)90564-5.
9
Improvements in protein secondary structure prediction by an enhanced neural network.通过增强神经网络改进蛋白质二级结构预测
J Mol Biol. 1990 Jul 5;214(1):171-82. doi: 10.1016/0022-2836(90)90154-E.
10
Net result: folded protein. A neural network deciphers the structure of protein.最终结果:折叠蛋白。神经网络解析蛋白质的结构。
Sci Am. 1990 Apr;262(4):24-6. doi: 10.1038/scientificamerican0490-24b.