• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于氨基酸组成预测蛋白质结构类别:模糊聚类的应用

Predicting protein structural classes from amino acid composition: application of fuzzy clustering.

作者信息

Zhang C T, Chou K C, Maggiora G M

机构信息

Department of Physics, Tianjin University, China.

出版信息

Protein Eng. 1995 May;8(5):425-35. doi: 10.1093/protein/8.5.425.

DOI:10.1093/protein/8.5.425
PMID:8532663
Abstract

Most globular proteins can be classified into one of four structural classes--all-alpha, all-beta, alpha + beta and alpha/beta--depending upon the type, amount and arrangement of secondary structures present. In this work a new method, based upon fuzzy clustering, is proposed for predicting the structural class of a protein from its amino acid composition. Here, each of the structural classes is described by a fuzzy cluster and each protein is characterized by its membership degree, a number between zero and one in each of the four clusters, with the constraint that the sum of the membership degrees equals unity. A given protein is then classified as belonging to that structural class corresponding to the fuzzy cluster with maximum membership degree. Calculation of membership degrees is carried out using the fuzzy c-means algorithm on a training set of 64 proteins. Results obtained for the training set show that the fuzzy clustering approach produces results comparable with or better than those obtained by other methods. A test set of 27 proteins also produced comparable results to those obtained with the training set. The success of the present preliminary work on protein structure class prediction suggests that further refinements of method may lead to improved predictions and this is currently being investigated.

摘要

大多数球状蛋白质可根据所含二级结构的类型、数量和排列分为四种结构类别之一——全α结构、全β结构、α + β结构和α/β结构。在这项工作中,提出了一种基于模糊聚类的新方法,用于从蛋白质的氨基酸组成预测其结构类别。在这里,每个结构类别由一个模糊聚类描述,每个蛋白质由其隶属度表征,即在四个聚类中每个聚类的隶属度是一个介于0和1之间的数,且隶属度之和等于1。然后将给定蛋白质分类为属于具有最大隶属度的模糊聚类对应的结构类别。使用模糊c均值算法对64种蛋白质的训练集进行隶属度计算。训练集获得的结果表明,模糊聚类方法产生的结果与其他方法相当或更好。27种蛋白质的测试集也产生了与训练集相当的结果。目前关于蛋白质结构类别预测的这项初步工作的成功表明,方法的进一步改进可能会带来更好的预测,目前正在对此进行研究。

相似文献

1
Predicting protein structural classes from amino acid composition: application of fuzzy clustering.基于氨基酸组成预测蛋白质结构类别:模糊聚类的应用
Protein Eng. 1995 May;8(5):425-35. doi: 10.1093/protein/8.5.425.
2
Using supervised fuzzy clustering to predict protein structural classes.使用监督模糊聚类预测蛋白质结构类别。
Biochem Biophys Res Commun. 2005 Aug 26;334(2):577-81. doi: 10.1016/j.bbrc.2005.06.128.
3
Accurate prediction of protein secondary structural class with fuzzy structural vectors.
Protein Eng. 1995 Jun;8(6):505-12. doi: 10.1093/protein/8.6.505.
4
A weighting method for predicting protein structural class from amino acid composition.
Eur J Biochem. 1992 Dec 15;210(3):747-9. doi: 10.1111/j.1432-1033.1992.tb17476.x.
5
Prediction of protein structural classes.蛋白质结构类别的预测。
Crit Rev Biochem Mol Biol. 1995;30(4):275-349. doi: 10.3109/10409239509083488.
6
Fuzzy KNN for predicting membrane protein types from pseudo-amino acid composition.基于伪氨基酸组成的模糊K近邻算法预测膜蛋白类型
J Theor Biol. 2006 May 7;240(1):9-13. doi: 10.1016/j.jtbi.2005.08.016. Epub 2005 Sep 28.
7
A novel approach to predicting protein structural classes in a (20-1)-D amino acid composition space.
Proteins. 1995 Apr;21(4):319-44. doi: 10.1002/prot.340210406.
8
Fuzzy cluster analysis of simple physicochemical properties of amino acids for recognizing secondary structure in proteins.用于识别蛋白质二级结构的氨基酸简单物理化学性质的模糊聚类分析。
Protein Sci. 1995 Jun;4(6):1178-87. doi: 10.1002/pro.5560040616.
9
Using pseudo amino acid composition to predict protein structural class: approached by incorporating 400 dipeptide components.利用伪氨基酸组成预测蛋白质结构类别:通过纳入400种二肽成分的方法。
J Comput Chem. 2007 Jul 15;28(9):1463-1466. doi: 10.1002/jcc.20554.
10
Fuzzy cluster analysis of molecular dynamics trajectories.分子动力学轨迹的模糊聚类分析
Proteins. 1992 Oct;14(2):249-64. doi: 10.1002/prot.340140211.

引用本文的文献

1
Prediction of protein structural classes by different feature expressions based on 2-D wavelet denoising and fusion.基于二维小波去噪和融合的不同特征表达预测蛋白质结构类别。
BMC Bioinformatics. 2019 Dec 24;20(Suppl 25):701. doi: 10.1186/s12859-019-3276-5.
2
iGPCR-drug: a web server for predicting interaction between GPCRs and drugs in cellular networking.iGPCR-drug:用于预测细胞网络中 GPCR 与药物相互作用的网络服务器。
PLoS One. 2013 Aug 27;8(8):e72234. doi: 10.1371/journal.pone.0072234. eCollection 2013.
3
Protein sequences classification by means of feature extraction with substitution matrices.
基于替换矩阵的特征提取对蛋白质序列进行分类。
BMC Bioinformatics. 2010 Apr 8;11:175. doi: 10.1186/1471-2105-11-175.
4
Characterization of protein secondary structure from NMR chemical shifts.通过核磁共振化学位移表征蛋白质二级结构
Prog Nucl Magn Reson Spectrosc. 2009 Apr 5;54(3-4):141-165. doi: 10.1016/j.pnmrs.2008.06.002.
5
Semi-supervised protein subcellular localization.半监督蛋白质亚细胞定位
BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S47. doi: 10.1186/1471-2105-10-S1-S47.