HBPred：一种识别生长激素结合蛋白的工具。

HBPred: a tool to identify growth hormone-binding proteins.

机构信息

Department of Pathophysiology, Southwest Medical University, Luzhou 646000, China.

Key Laboratory for NeuroInformation of Ministry of Education, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China.

出版信息

Int J Biol Sci. 2018 May 22;14(8):957-964. doi: 10.7150/ijbs.24174. eCollection 2018.

DOI:10.7150/ijbs.24174

PMID:29989085

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6036759/

Abstract

Hormone-binding protein (HBP) is a kind of soluble carrier protein and can selectively and non-covalently interact with hormone. HBP plays an important role in life growth, but its function is still unclear. Correct recognition of HBPs is the first step to further study their function and understand their biological process. However, it is difficult to correctly recognize HBPs from more and more proteins through traditional biochemical experiments because of high experimental cost and long experimental period. To overcome these disadvantages, we designed a computational method for identifying HBPs accurately in the study. At first, we collected HBP data from UniProt to establish a high-quality benchmark dataset. Based on the dataset, the dipeptide composition was extracted from HBP residue sequences. In order to find out the optimal features to provide key clues for HBP identification, the analysis of various (ANOVA) was performed for feature ranking. The optimal features were selected through the incremental feature selection strategy. Subsequently, the features were inputted into support vector machine (SVM) for prediction model construction. Jackknife cross-validation results showed that 88.6% HBPs and 81.3% non-HBPs were correctly recognized, suggesting that our proposed model was powerful. This study provides a new strategy to identify HBPs. Moreover, based on the proposed model, we established a webserver called which could be freely accessed at http://lin-group.cn/server/HBPred.

摘要

激素结合蛋白（HBP）是一种可溶性载体蛋白，能够选择性地、非共价地与激素相互作用。HBP 在生命生长中发挥着重要作用，但它的功能仍不清楚。正确识别 HBPs 是进一步研究其功能和了解其生物学过程的第一步。然而，由于实验成本高、实验周期长，通过传统的生化实验很难从越来越多的蛋白质中正确识别 HBPs。为了克服这些缺点，我们在研究中设计了一种准确识别 HBPs 的计算方法。首先，我们从 UniProt 中收集 HBP 数据，以建立一个高质量的基准数据集。基于该数据集，从 HBP 残基序列中提取二肽组成。为了找出最佳特征，为 HBP 识别提供关键线索，我们对特征进行了各种（ANOVA）分析，以进行特征排序。通过增量特征选择策略选择最佳特征。随后，将特征输入支持向量机（SVM）进行预测模型构建。Jackknife 交叉验证结果表明，88.6%的 HBPs 和 81.3%的非 HBPs 被正确识别，表明我们提出的模型具有强大的功能。本研究为识别 HBPs 提供了一种新策略。此外，我们基于所提出的模型，建立了一个名为 HBPred 的免费在线服务器，可以在 http://lin-group.cn/server/HBPred 上访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/796f/6036759/586b57905005/ijbsv14p0957g001.jpg

相似文献

HBPred: a tool to identify growth hormone-binding proteins.HBPred：一种识别生长激素结合蛋白的工具。

Int J Biol Sci. 2018 May 22;14(8):957-964. doi: 10.7150/ijbs.24174. eCollection 2018.

Identification of hormone binding proteins based on machine learning methods.基于机器学习方法的激素结合蛋白鉴定

Math Biosci Eng. 2019 Mar 22;16(4):2466-2480. doi: 10.3934/mbe.2019123.

SCMHBP: prediction and analysis of heme binding proteins using propensity scores of dipeptides.SCMHBP：利用二肽倾向得分预测和分析血红素结合蛋白

BMC Bioinformatics. 2014;15 Suppl 16(Suppl 16):S4. doi: 10.1186/1471-2105-15-S16-S4. Epub 2014 Dec 8.

Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes.基于K-mer特征表示和朴素贝叶斯的激素结合蛋白预测

Front Genet. 2021 Nov 23;12:797641. doi: 10.3389/fgene.2021.797641. eCollection 2021.

Identifying Antioxidant Proteins by Using Optimal Dipeptide Compositions.利用最优二肽组成鉴定抗氧化蛋白。

Interdiscip Sci. 2016 Jun;8(2):186-191. doi: 10.1007/s12539-015-0124-9. Epub 2015 Sep 7.

Identification of bacteriophage virion proteins by the ANOVA feature selection and analysis.通过方差分析特征选择和分析鉴定噬菌体病毒粒子蛋白。

Mol Biosyst. 2014 Aug;10(8):2229-35. doi: 10.1039/c4mb00316k.

IonchanPred 2.0: A Tool to Predict Ion Channels and Their Types.IonchanPred 2.0：一种预测离子通道及其类型的工具。

Int J Mol Sci. 2017 Aug 24;18(9):1838. doi: 10.3390/ijms18091838.

Ensemble Learning for Hormone Binding Protein Prediction: A Promising Approach for Early Diagnosis of Thyroid Hormone Disorders in Serum.用于激素结合蛋白预测的集成学习：血清甲状腺激素紊乱早期诊断的一种有前景的方法。

Diagnostics (Basel). 2023 Jun 1;13(11):1940. doi: 10.3390/diagnostics13111940.

iHBPs-VWDC: variable-length window-based dynamic connectivity approach for identifying hormone-binding proteins.iHBPs-VWDC：用于识别激素结合蛋白的基于可变长度窗口的动态连通性方法

J Biomol Struct Dyn. 2025 Jan;43(1):550-559. doi: 10.1080/07391102.2023.2283150. Epub 2023 Nov 18.

Predicting cancerlectins by the optimal g-gap dipeptides.通过最优g-间隙二肽预测癌凝集素

Sci Rep. 2015 Dec 9;5:16964. doi: 10.1038/srep16964.

引用本文的文献

Machine learning models based on routine blood and biochemical test data for diagnosis of neurological diseases.基于常规血液和生化检测数据的机器学习模型用于神经疾病诊断。

Sci Rep. 2025 Jul 30;15(1):27857. doi: 10.1038/s41598-025-09439-4.

Construction of machine learning diagnostic models for cardiovascular pan-disease based on blood routine and biochemical detection data.基于血常规和生化检测数据构建心血管多病种的机器学习诊断模型。

Cardiovasc Diabetol. 2024 Sep 28;23(1):351. doi: 10.1186/s12933-024-02439-0.

ACP-CapsPred: an explainable computational framework for identification and functional prediction of anticancer peptides based on capsule network.ACP-CapsPred：一种基于胶囊网络的用于识别和抗癌肽功能预测的可解释计算框架。

Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae460.

Meta-2OM: A multi-classifier meta-model for the accurate prediction of RNA 2'-O-methylation sites in human RNA.Meta-2OM：一种用于准确预测人类 RNA 2'-O-甲基化位点的多分类器元模型。

PLoS One. 2024 Jun 26;19(6):e0305406. doi: 10.1371/journal.pone.0305406. eCollection 2024.

Machine Learning in Enhancing Protein Binding Sites Predictions - What Has Changed Since Then?机器学习在增强蛋白质结合位点预测中的应用——自那时起有何变化？

Comb Chem High Throughput Screen. 2024 Jun 11. doi: 10.2174/0113862073305298240524050145.

Deep-STP: a deep learning-based approach to predict snake toxin proteins by using word embeddings.深度序列到蛋白预测（Deep-STP）：一种基于深度学习的方法，通过词嵌入来预测蛇毒蛋白。

Front Med (Lausanne). 2024 Jan 17;10:1291352. doi: 10.3389/fmed.2023.1291352. eCollection 2023.

Accurately identifying hemagglutinin using sequence information and machine learning methods.使用序列信息和机器学习方法准确识别血凝素。

Front Med (Lausanne). 2023 Oct 31;10:1281880. doi: 10.3389/fmed.2023.1281880. eCollection 2023.

A First Computational Frame for Recognizing Heparin-Binding Protein.一种用于识别肝素结合蛋白的首个计算框架。

Diagnostics (Basel). 2023 Jul 24;13(14):2465. doi: 10.3390/diagnostics13142465.

Diagnostics (Basel). 2023 Jun 1;13(11):1940. doi: 10.3390/diagnostics13111940.

m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation.m5U-SVM：基于理化特征和分布式表示的多视图特征识别 RNA 5-甲基尿嘧啶修饰位点。

BMC Biol. 2023 Apr 24;21(1):93. doi: 10.1186/s12915-023-01596-0.

本文引用的文献

PSBinder: A Web Service for Predicting Polystyrene Surface-Binding Peptides.PSBinder：用于预测聚苯乙烯表面结合肽的 Web 服务。

Biomed Res Int. 2017;2017:5761517. doi: 10.1155/2017/5761517. Epub 2017 Dec 27.

iDNA6mA-PseKNC: Identifying DNA N-methyladenosine sites by incorporating nucleotide physicochemical properties into PseKNC.iDNA6mA-PseKNC：通过将核苷酸理化性质纳入 PseKNC 来鉴定 DNA N6-甲基腺苷位点。

Genomics. 2019 Jan;111(1):96-102. doi: 10.1016/j.ygeno.2018.01.005. Epub 2018 Jan 31.

Predict protein structural class by incorporating two different modes of evolutionary information into Chou's general pseudo amino acid composition.通过将两种不同模式的进化信息整合到周氏广义伪氨基酸组成中预测蛋白质结构类别。

J Mol Graph Model. 2017 Nov;78:110-117. doi: 10.1016/j.jmgm.2017.10.003. Epub 2017 Oct 7.

ProLanGO: Protein Function Prediction Using Neural Machine Translation Based on a Recurrent Neural Network.ProLanGO：基于循环神经网络的神经机器翻译在蛋白质功能预测中的应用。

Molecules. 2017 Oct 17;22(10):1732. doi: 10.3390/molecules22101732.

Adverse Effects of the Metabolic Acidosis of Chronic Kidney Disease.慢性肾脏病代谢性酸中毒的不良反应

Adv Chronic Kidney Dis. 2017 Sep;24(5):289-297. doi: 10.1053/j.ackd.2017.06.005.

DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier.DeepGO：使用深度本体感知分类器从序列和相互作用预测蛋白质功能。

Bioinformatics. 2018 Feb 15;34(4):660-668. doi: 10.1093/bioinformatics/btx624.

Computational identification of protein S-sulfenylation sites by incorporating the multiple sequence features information.通过整合多序列特征信息对蛋白质S-亚磺酰化位点进行计算识别。

Mol Biosyst. 2017 Nov 21;13(12):2545-2550. doi: 10.1039/c7mb00491e.

iDNA4mC: identifying DNA N4-methylcytosine sites based on nucleotide chemical properties.iDNA4mC：基于核苷酸化学性质鉴定 DNA N4-甲基胞嘧啶位点。

Bioinformatics. 2017 Nov 15;33(22):3518-3523. doi: 10.1093/bioinformatics/btx479.

Bi-PSSM: Position specific scoring matrix based intelligent computational model for identification of mycobacterial membrane proteins.双位置特异性评分矩阵：基于位置特异性评分矩阵的用于鉴定分枝杆菌膜蛋白的智能计算模型。

J Theor Biol. 2017 Dec 21;435:116-124. doi: 10.1016/j.jtbi.2017.09.013. Epub 2017 Sep 18.

POSSUM: a bioinformatics toolkit for generating numerical sequence feature descriptors based on PSSM profiles.POSSUM：一种基于位置特异性得分矩阵（PSSM）谱生成数字序列特征描述符的生物信息学工具包。

Bioinformatics. 2017 Sep 1;33(17):2756-2758. doi: 10.1093/bioinformatics/btx302.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

HBPred：一种识别生长激素结合蛋白的工具。

HBPred: a tool to identify growth hormone-binding proteins.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献