• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iHSP-PseRAAAC:利用伪简约氨基酸字母组成鉴定热休克蛋白家族。

iHSP-PseRAAAC: Identifying the heat shock protein families using pseudo reduced amino acid alphabet composition.

作者信息

Feng Peng-Mian, Chen Wei, Lin Hao, Chou Kuo-Chen

机构信息

School of Public Health, Hebei United University, Tangshan 063000, China.

出版信息

Anal Biochem. 2013 Nov 1;442(1):118-25. doi: 10.1016/j.ab.2013.05.024. Epub 2013 Jun 10.

DOI:10.1016/j.ab.2013.05.024
PMID:23756733
Abstract

Heat shock proteins (HSPs) are a type of functionally related proteins present in all living organisms, both prokaryotes and eukaryotes. They play essential roles in protein-protein interactions such as folding and assisting in the establishment of proper protein conformation and prevention of unwanted protein aggregation. Their dysfunction may cause various life-threatening disorders, such as Parkinson's, Alzheimer's, and cardiovascular diseases. Based on their functions, HSPs are usually classified into six families: (i) HSP20 or sHSP, (ii) HSP40 or J-class proteins, (iii) HSP60 or GroEL/ES, (iv) HSP70, (v) HSP90, and (vi) HSP100. Although considerable progress has been achieved in discriminating HSPs from other proteins, it is still a big challenge to identify HSPs among their six different functional types according to their sequence information alone. With the avalanche of protein sequences generated in the post-genomic age, it is highly desirable to develop a high-throughput computational tool in this regard. To take up such a challenge, a predictor called iHSP-PseRAAAC has been developed by incorporating the reduced amino acid alphabet information into the general form of pseudo amino acid composition. One of the remarkable advantages of introducing the reduced amino acid alphabet is being able to avoid the notorious dimension disaster or overfitting problem in statistical prediction. It was observed that the overall success rate achieved by iHSP-PseRAAAC in identifying the functional types of HSPs among the aforementioned six types was more than 87%, which was derived by the jackknife test on a stringent benchmark dataset in which none of HSPs included has ≥40% pairwise sequence identity to any other in the same subset. It has not escaped our notice that the reduced amino acid alphabet approach can also be used to investigate other protein classification problems. As a user-friendly web server, iHSP-PseRAAAC is accessible to the public at http://lin.uestc.edu.cn/server/iHSP-PseRAAAC.

摘要

热休克蛋白(HSPs)是一类在原核生物和真核生物等所有生物中都存在的功能相关蛋白。它们在蛋白质 - 蛋白质相互作用中发挥着重要作用,例如折叠、协助建立正确的蛋白质构象以及防止不必要的蛋白质聚集。它们的功能失调可能导致各种危及生命的疾病,如帕金森病、阿尔茨海默病和心血管疾病。基于其功能,热休克蛋白通常分为六个家族:(i)HSP20或小热休克蛋白(sHSP),(ii)HSP40或J类蛋白,(iii)HSP60或GroEL/ES,(iv)HSP70,(v)HSP90,以及(vi)HSP100。尽管在区分热休克蛋白与其他蛋白质方面已经取得了相当大的进展,但仅根据序列信息在其六种不同功能类型中识别热休克蛋白仍然是一个巨大的挑战。随着后基因组时代产生的大量蛋白质序列,非常需要开发一种高通量计算工具来应对这一挑战。为了迎接这一挑战,通过将简化氨基酸字母表信息纳入伪氨基酸组成的一般形式,开发了一种名为iHSP - PseRAAAC的预测器。引入简化氨基酸字母表的一个显著优点是能够避免统计预测中臭名昭著的维度灾难或过拟合问题。据观察,iHSP - PseRAAAC在识别上述六种类型的热休克蛋白功能类型方面的总体成功率超过87%,这是通过对一个严格的基准数据集进行留一法检验得出的,在该数据集中,没有任何热休克蛋白与同一子集中的任何其他热休克蛋白具有≥40%的成对序列同一性。我们也注意到简化氨基酸字母表方法还可用于研究其他蛋白质分类问题。作为一个用户友好的网络服务器,公众可通过http://lin.uestc.edu.cn/server/iHSP-PseRAAAC访问iHSP - PseRAAAC。

相似文献

1
iHSP-PseRAAAC: Identifying the heat shock protein families using pseudo reduced amino acid alphabet composition.iHSP-PseRAAAC:利用伪简约氨基酸字母组成鉴定热休克蛋白家族。
Anal Biochem. 2013 Nov 1;442(1):118-25. doi: 10.1016/j.ab.2013.05.024. Epub 2013 Jun 10.
2
Identification of Heat Shock Protein families and J-protein types by incorporating Dipeptide Composition into Chou's general PseAAC.通过将二肽组成纳入周的通用 PseAAC,鉴定热休克蛋白家族和 J 蛋白类型。
Comput Methods Programs Biomed. 2015 Nov;122(2):165-74. doi: 10.1016/j.cmpb.2015.07.005. Epub 2015 Jul 22.
3
GPCR-2L: predicting G protein-coupled receptors and their types by hybridizing two different modes of pseudo amino acid compositions.GPCR-2L:通过两种不同模式的伪氨基酸组成杂交预测G蛋白偶联受体及其类型。
Mol Biosyst. 2011 Mar;7(3):911-9. doi: 10.1039/c0mb00170h. Epub 2010 Dec 23.
4
MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM.MemType-2L:一个通过伪位置特异性得分矩阵整合进化信息来预测膜蛋白及其类型的网络服务器。
Biochem Biophys Res Commun. 2007 Aug 24;360(2):339-45. doi: 10.1016/j.bbrc.2007.06.027. Epub 2007 Jun 15.
5
iDNA-Prot|dis: identifying DNA-binding proteins by incorporating amino acid distance-pairs and reduced alphabet profile into the general pseudo amino acid composition.iDNA-Prot|dis:通过将氨基酸距离对和简化字母表概况纳入通用伪氨基酸组成来鉴定DNA结合蛋白。
PLoS One. 2014 Sep 3;9(9):e106691. doi: 10.1371/journal.pone.0106691. eCollection 2014.
6
iDPF-PseRAAAC: A Web-Server for Identifying the Defensin Peptide Family and Subfamily Using Pseudo Reduced Amino Acid Alphabet Composition.iDPF-PseRAAAC:一个使用伪简化氨基酸字母组成来识别防御素肽家族和亚家族的网络服务器。
PLoS One. 2015 Dec 29;10(12):e0145541. doi: 10.1371/journal.pone.0145541. eCollection 2015.
7
Identifying Heat Shock Protein Families from Imbalanced Data by Using Combined Features.利用组合特征从不平衡数据中识别热休克蛋白家族。
Comput Math Methods Med. 2020 Sep 23;2020:8894478. doi: 10.1155/2020/8894478. eCollection 2020.
8
iTIS-PseTNC: a sequence-based predictor for identifying translation initiation site in human genes using pseudo trinucleotide composition.iTIS-PseTNC:一种基于序列的预测工具,利用伪三核苷酸组成来识别人类基因中的翻译起始位点。
Anal Biochem. 2014 Oct 1;462:76-83. doi: 10.1016/j.ab.2014.06.022. Epub 2014 Jul 10.
9
iCDI-PseFpt: identify the channel-drug interaction in cellular networking with PseAAC and molecular fingerprints.iCDI-PseFpt:利用 PseAAC 和分子指纹识别细胞网络中的通道药物相互作用。
J Theor Biol. 2013 Nov 21;337:71-9. doi: 10.1016/j.jtbi.2013.08.013. Epub 2013 Aug 26.
10
GPCR-GIA: a web-server for identifying G-protein coupled receptors and their families with grey incidence analysis.GPCR-GIA:一个利用灰色关联分析识别 G 蛋白偶联受体及其家族的网络服务器。
Protein Eng Des Sel. 2009 Nov;22(11):699-705. doi: 10.1093/protein/gzp057. Epub 2009 Sep 22.

引用本文的文献

1
RSLpred2: An Integrated Web Server for the Annotation of Rice Proteome Subcellular Localization Using Deep Learning.RSLpred2:一个使用深度学习对水稻蛋白质组亚细胞定位进行注释的集成网络服务器。
Rice (N Y). 2025 Jul 4;18(1):58. doi: 10.1186/s12284-025-00767-7.
2
AtSubP-2.0: An integrated web server for the annotation of Arabidopsis proteome subcellular localization using deep learning.AtSubP-2.0:一个使用深度学习对拟南芥蛋白质组亚细胞定位进行注释的集成网络服务器。
Plant Genome. 2025 Mar;18(1):e20536. doi: 10.1002/tpg2.20536.
3
A review of artificial intelligence-assisted omics techniques in plant defense: current trends and future directions.
植物防御中人工智能辅助组学技术综述:当前趋势与未来方向
Front Plant Sci. 2024 Mar 5;15:1292054. doi: 10.3389/fpls.2024.1292054. eCollection 2024.
4
Bioinformatic Analysis of Codon Usage Bias of HSP20 Genes in Four Cruciferous Species.四种十字花科植物HSP20基因密码子使用偏好性的生物信息学分析
Plants (Basel). 2024 Feb 6;13(4):468. doi: 10.3390/plants13040468.
5
Krein support vector machine classification of antimicrobial peptides.抗菌肽的Krein支持向量机分类
Digit Discov. 2023 Feb 27;2(2):502-511. doi: 10.1039/d3dd00004d. eCollection 2023 Apr 11.
6
Research progress of reduced amino acid alphabets in protein analysis and prediction.蛋白质分析与预测中简化氨基酸字母表的研究进展
Comput Struct Biotechnol J. 2022 Jul 4;20:3503-3510. doi: 10.1016/j.csbj.2022.07.001. eCollection 2022.
7
Identification of Inhibitors and Drug Targets for Human Adenovirus Infections.鉴定人腺病毒感染的抑制剂和药物靶点。
Viruses. 2022 May 4;14(5):959. doi: 10.3390/v14050959.
8
XGB-DrugPred: computational prediction of druggable proteins using eXtreme gradient boosting and optimized features set.XGB-DrugPred:使用极端梯度提升和优化特征集的可药物蛋白计算预测。
Sci Rep. 2022 Apr 1;12(1):5505. doi: 10.1038/s41598-022-09484-3.
9
Identification of D Modification Sites Using a Random Forest Model Based on Nucleotide Chemical Properties.基于核苷酸化学性质的随机森林模型鉴定 D 修饰位点。
Int J Mol Sci. 2022 Mar 11;23(6):3044. doi: 10.3390/ijms23063044.
10
IDDLncLoc: Subcellular Localization of LncRNAs Based on a Framework for Imbalanced Data Distributions.IDDLncLoc:基于不平衡数据分布框架的 lncRNAs 亚细胞定位。
Interdiscip Sci. 2022 Jun;14(2):409-420. doi: 10.1007/s12539-021-00497-6. Epub 2022 Feb 22.