• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

氨基酸出现情况在区分球状蛋白质不同折叠类型中的应用。

Application of amino acid occurrence for discriminating different folding types of globular proteins.

作者信息

Taguchi Y-h, Gromiha M Michael

机构信息

Department of Physics, Faculty of Science and Technology, Chuo University, 1-13-27 Kasuga, Bunkyo-ku, Tokyo 112-8551, Japan.

出版信息

BMC Bioinformatics. 2007 Oct 22;8:404. doi: 10.1186/1471-2105-8-404.

DOI:10.1186/1471-2105-8-404
PMID:17953741
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2174517/
Abstract

BACKGROUND

Predicting the three-dimensional structure of a protein from its amino acid sequence is a long-standing goal in computational/molecular biology. The discrimination of different structural classes and folding types are intermediate steps in protein structure prediction.

RESULTS

In this work, we have proposed a method based on linear discriminant analysis (LDA) for discriminating 30 different folding types of globular proteins using amino acid occurrence. Our method was tested with a non-redundant set of 1612 proteins and it discriminated them with the accuracy of 38%, which is comparable to or better than other methods in the literature. A web server has been developed for discriminating the folding type of a query protein from its amino acid sequence and it is available at http://granular.com/PROLDA/.

CONCLUSION

Amino acid occurrence has been successfully used to discriminate different folding types of globular proteins. The discrimination accuracy obtained with amino acid occurrence is better than that obtained with amino acid composition and/or amino acid properties. In addition, the method is very fast to obtain the results.

摘要

背景

从氨基酸序列预测蛋白质的三维结构是计算生物学/分子生物学中长期以来的目标。区分不同的结构类别和折叠类型是蛋白质结构预测中的中间步骤。

结果

在这项工作中,我们提出了一种基于线性判别分析(LDA)的方法,利用氨基酸出现频率来区分30种不同的球状蛋白质折叠类型。我们的方法用一组1612个非冗余蛋白质进行了测试,其区分这些蛋白质的准确率为38%,与文献中的其他方法相当或更好。已开发出一个网络服务器,用于根据查询蛋白质的氨基酸序列来区分其折叠类型,可在http://granular.com/PROLDA/获取。

结论

氨基酸出现频率已成功用于区分不同的球状蛋白质折叠类型。使用氨基酸出现频率获得的区分准确率优于使用氨基酸组成和/或氨基酸性质获得的准确率。此外,该方法获取结果的速度非常快。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d64/2174517/5ea1c6549099/1471-2105-8-404-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d64/2174517/9e073b26408e/1471-2105-8-404-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d64/2174517/6c96ddbb4d64/1471-2105-8-404-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d64/2174517/5ea1c6549099/1471-2105-8-404-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d64/2174517/9e073b26408e/1471-2105-8-404-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d64/2174517/6c96ddbb4d64/1471-2105-8-404-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d64/2174517/5ea1c6549099/1471-2105-8-404-3.jpg

相似文献

1
Application of amino acid occurrence for discriminating different folding types of globular proteins.氨基酸出现情况在区分球状蛋白质不同折叠类型中的应用。
BMC Bioinformatics. 2007 Oct 22;8:404. doi: 10.1186/1471-2105-8-404.
2
Influence of amino acid properties for discriminating outer membrane proteins at better accuracy.氨基酸特性对更准确鉴别外膜蛋白的影响。
Biochim Biophys Acta. 2006 Sep;1764(9):1493-7. doi: 10.1016/j.bbapap.2006.07.005. Epub 2006 Jul 31.
3
Analysis and prediction of protein folding rates using quadratic response surface models.使用二次响应面模型对蛋白质折叠速率进行分析和预测。
J Comput Chem. 2008 Jul 30;29(10):1675-83. doi: 10.1002/jcc.20925.
4
Discrimination of outer membrane proteins using machine learning algorithms.使用机器学习算法鉴别外膜蛋白。
Proteins. 2006 Jun 1;63(4):1031-7. doi: 10.1002/prot.20929.
5
Application of residue distribution along the sequence for discriminating outer membrane proteins.沿序列的残基分布在鉴别外膜蛋白中的应用
Comput Biol Chem. 2005 Apr;29(2):135-42. doi: 10.1016/j.compbiolchem.2005.02.006.
6
Influence of medium- and long-range interactions in different folding types of globular proteins.中远程相互作用对不同折叠类型球状蛋白的影响。
Biophys Chem. 2002 Oct 16;99(2):189-98. doi: 10.1016/s0301-4622(02)00183-7.
7
Discrimination of outer membrane proteins using support vector machines.使用支持向量机鉴别外膜蛋白。
Bioinformatics. 2005 Dec 1;21(23):4223-9. doi: 10.1093/bioinformatics/bti697. Epub 2005 Oct 4.
8
FOLD-RATE: prediction of protein folding rates from amino acid sequence.折叠速率:从氨基酸序列预测蛋白质折叠速率。
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W70-4. doi: 10.1093/nar/gkl043.
9
NIAS-Server: Neighbors Influence of Amino acids and Secondary Structures in Proteins.NIAS服务器:蛋白质中氨基酸和二级结构的邻域影响
J Comput Biol. 2017 Mar;24(3):255-265. doi: 10.1089/cmb.2016.0074. Epub 2016 Aug 5.
10
SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.支持向量机折叠法:一种用于判别式多类别蛋白质折叠和超家族识别的工具。
BMC Bioinformatics. 2007 May 22;8 Suppl 4(Suppl 4):S2. doi: 10.1186/1471-2105-8-S4-S2.

引用本文的文献

1
Computational Analysis Predicts Correlations among Amino Acids in SARS-CoV-2 Proteomes.计算分析预测了新冠病毒蛋白质组中氨基酸之间的相关性。
Biomedicines. 2023 Feb 10;11(2):512. doi: 10.3390/biomedicines11020512.
2
DeepFrag-k: a fragment-based deep learning approach for protein fold recognition.DeepFrag-k:一种用于蛋白质折叠识别的基于片段的深度学习方法。
BMC Bioinformatics. 2020 Nov 18;21(Suppl 6):203. doi: 10.1186/s12859-020-3504-z.
3
HMMBinder: DNA-Binding Protein Prediction Using HMM Profile Based Features.HMMBinder:基于 HMM -profile 特征的 DNA 结合蛋白预测。

本文引用的文献

1
A statistical method for predicting protein unfolding rates from amino acid sequence.一种从氨基酸序列预测蛋白质解折叠速率的统计方法。
J Chem Inf Model. 2006 May-Jun;46(3):1503-8. doi: 10.1021/ci050417u.
2
Ensemble classifier for protein fold pattern recognition.用于蛋白质折叠模式识别的集成分类器。
Bioinformatics. 2006 Jul 15;22(14):1717-22. doi: 10.1093/bioinformatics/btl170. Epub 2006 May 3.
3
Amino Acid Principal Component Analysis (AAPCA) and its applications in protein structural class prediction.氨基酸主成分分析(AAPCA)及其在蛋白质结构类别预测中的应用。
Biomed Res Int. 2017;2017:4590609. doi: 10.1155/2017/4590609. Epub 2017 Nov 14.
4
GRAFENE: Graphlet-based alignment-free network approach integrates 3D structural and sequence (residue order) data to improve protein structural comparison.基于图元的无比对网络方法整合了 3D 结构和序列(残基顺序)数据,以改进蛋白质结构比对。
Sci Rep. 2017 Nov 2;7(1):14890. doi: 10.1038/s41598-017-14411-y.
5
Highly Accurate Prediction of Protein-Protein Interactions via Incorporating Evolutionary Information and Physicochemical Characteristics.通过整合进化信息和物理化学特征实现蛋白质-蛋白质相互作用的高精度预测
Int J Mol Sci. 2016 Aug 25;17(9):1396. doi: 10.3390/ijms17091396.
6
Improving protein fold recognition using the amalgamation of evolutionary-based and structural based information.利用基于进化和基于结构的信息融合来改进蛋白质折叠识别。
BMC Bioinformatics. 2014;15 Suppl 16(Suppl 16):S12. doi: 10.1186/1471-2105-15-S16-S12. Epub 2014 Dec 8.
7
Evaluation of sequence features from intrinsically disordered regions for the estimation of protein function.评估来自内在无序区域的序列特征以估计蛋白质功能。
PLoS One. 2014 Feb 24;9(2):e89890. doi: 10.1371/journal.pone.0089890. eCollection 2014.
8
A strategy to select suitable physicochemical attributes of amino acids for protein fold recognition.氨基酸理化属性选择用于蛋白质折叠识别的策略。
BMC Bioinformatics. 2013 Jul 24;14:233. doi: 10.1186/1471-2105-14-233.
9
Prediction of functionally important residues in globular proteins from unusual central distances of amino acids.从氨基酸的异常中心距离预测球状蛋白质中功能重要的残基。
BMC Struct Biol. 2011 Sep 18;11:34. doi: 10.1186/1472-6807-11-34.
10
Fold homology detection using sequence fragment composition profiles of proteins.使用蛋白质序列片段组成特征来检测折叠同源性。
Proteins. 2010 Oct;78(13):2745-56. doi: 10.1002/prot.22788.
J Biomol Struct Dyn. 2006 Jun;23(6):635-40. doi: 10.1080/07391102.2006.10507088.
4
Proteins of the same fold and unrelated sequences have similar amino acid composition.具有相同折叠结构但序列不相关的蛋白质具有相似的氨基酸组成。
Proteins. 2006 Jul 1;64(1):275-9. doi: 10.1002/prot.20964.
5
A machine learning information retrieval approach to protein fold recognition.一种用于蛋白质折叠识别的机器学习信息检索方法。
Bioinformatics. 2006 Jun 15;22(12):1456-63. doi: 10.1093/bioinformatics/btl102. Epub 2006 Mar 17.
6
Using supervised fuzzy clustering to predict protein structural classes.使用监督模糊聚类预测蛋白质结构类别。
Biochem Biophys Res Commun. 2005 Aug 26;334(2):577-81. doi: 10.1016/j.bbrc.2005.06.128.
7
A simple statistical method for discriminating outer membrane proteins with better accuracy.一种用于更准确地鉴别外膜蛋白的简单统计方法。
Bioinformatics. 2005 Apr 1;21(7):961-8. doi: 10.1093/bioinformatics/bti126. Epub 2004 Nov 5.
8
Fold recognition by combining sequence profiles derived from evolution and from depth-dependent structural alignment of fragments.通过结合从进化中获得的序列概况和片段的深度依赖结构比对来进行折叠识别。
Proteins. 2005 Feb 1;58(2):321-8. doi: 10.1002/prot.20308.
9
Predicting subcellular localization of proteins in a hybridization space.预测杂交空间中蛋白质的亚细胞定位。
Bioinformatics. 2004 May 1;20(7):1151-6. doi: 10.1093/bioinformatics/bth054. Epub 2004 Feb 5.
10
Importance of native-state topology for determining the folding rate of two-state proteins.天然态拓扑结构对确定两态蛋白质折叠速率的重要性。
J Chem Inf Comput Sci. 2003 Sep-Oct;43(5):1481-5. doi: 10.1021/ci0340308.