用于激素结合蛋白预测的集成学习：血清甲状腺激素紊乱早期诊断的一种有前景的方法。

Ensemble Learning for Hormone Binding Protein Prediction: A Promising Approach for Early Diagnosis of Thyroid Hormone Disorders in Serum.

作者信息

Butt Ahmad Hassan, Alkhalifah Tamim, Alturise Fahad, Khan Yaser Daanial

机构信息

Department of Computer Science, Faculty of Computing & Information Technology, University of the Punjab, Lahore 54000, Pakistan.

Department of Computer, College of Science and Arts in Ar Rass, Qassim University, Ar Rass 51921, Qassim, Saudi Arabia.

出版信息

Diagnostics (Basel). 2023 Jun 1;13(11):1940. doi: 10.3390/diagnostics13111940.

DOI:10.3390/diagnostics13111940

PMID:37296792

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10252793/

Abstract

Hormone-binding proteins (HBPs) are specific carrier proteins that bind to a given hormone. A soluble carrier hormone binding protein (HBP), which can interact non-covalently and specifically with growth hormone, modulates or inhibits hormone signaling. HBP is essential for the growth of life, despite still being poorly understood. Several diseases, according to some data, are caused by HBPs that express themselves abnormally. Accurate identification of these molecules is the first step in investigating the roles of HBPs and understanding their biological mechanisms. For a better understanding of cell development and cellular mechanisms, accurate HBP determination from a given protein sequence is essential. Using traditional biochemical experiments, it is difficult to correctly separate HBPs from an increasing number of proteins because of the high experimental costs and lengthy experiment periods. The abundance of protein sequence data that has been gathered in the post-genomic era necessitates a computational method that is automated and enables quick and accurate identification of putative HBPs within a large number of candidate proteins. A brand-new machine-learning-based predictor is suggested as the HBP identification method. To produce the desirable feature set for the method proposed, statistical moment-based features and amino acids were combined, and the random forest was used to train the feature set. During 5-fold cross validation experiments, the suggested method achieved 94.37% accuracy and 0.9438 F1-scores, respectively, demonstrating the importance of the Hahn moment-based features.

摘要

激素结合蛋白（HBPs）是一类与特定激素结合的特异性载体蛋白。一种可溶性载体激素结合蛋白（HBP），它能与生长激素进行非共价且特异性的相互作用，从而调节或抑制激素信号传导。尽管人们对HBP的了解仍然有限，但它对生命的生长至关重要。根据一些数据，几种疾病是由异常表达的HBP引起的。准确识别这些分子是研究HBP的作用及其生物学机制的第一步。为了更好地理解细胞发育和细胞机制，从给定的蛋白质序列中准确测定HBP至关重要。使用传统的生化实验，由于实验成本高和实验周期长，很难从越来越多的蛋白质中正确分离出HBP。在后基因组时代积累的大量蛋白质序列数据需要一种自动化的计算方法，以便能够在大量候选蛋白质中快速准确地识别出假定的HBP。一种全新的基于机器学习的预测器被建议作为HBP的识别方法。为了为所提出的方法生成理想的特征集，将基于统计矩的特征和氨基酸相结合，并使用随机森林对该特征集进行训练。在5折交叉验证实验中，所提出的方法分别达到了94.37%的准确率和0.9438的F1分数，证明了基于哈恩矩的特征的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/ac0f589a1d89/diagnostics-13-01940-g001.jpg

相似文献

Ensemble Learning for Hormone Binding Protein Prediction: A Promising Approach for Early Diagnosis of Thyroid Hormone Disorders in Serum.

Diagnostics (Basel). 2023 Jun 1;13(11):1940. doi: 10.3390/diagnostics13111940.

Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes.

Front Genet. 2021 Nov 23;12:797641. doi: 10.3389/fgene.2021.797641. eCollection 2021.

HBPred: a tool to identify growth hormone-binding proteins.

Int J Biol Sci. 2018 May 22;14(8):957-964. doi: 10.7150/ijbs.24174. eCollection 2018.

iGHBP: Computational identification of growth hormone binding proteins from sequences using extremely randomised tree.

Comput Struct Biotechnol J. 2018 Oct 24;16:412-420. doi: 10.1016/j.csbj.2018.10.007. eCollection 2018.

Empirical comparison and recent advances of computational prediction of hormone binding proteins using machine learning methods.

Comput Struct Biotechnol J. 2023 Mar 17;21:2253-2261. doi: 10.1016/j.csbj.2023.03.024. eCollection 2023.

Identification of hormone binding proteins based on machine learning methods.

Math Biosci Eng. 2019 Mar 22;16(4):2466-2480. doi: 10.3934/mbe.2019123.

SCMHBP: prediction and analysis of heme binding proteins using propensity scores of dipeptides.

BMC Bioinformatics. 2014;15 Suppl 16(Suppl 16):S4. doi: 10.1186/1471-2105-15-S16-S4. Epub 2014 Dec 8.

iHBPs-VWDC: variable-length window-based dynamic connectivity approach for identifying hormone-binding proteins.

J Biomol Struct Dyn. 2025 Jan;43(1):550-559. doi: 10.1080/07391102.2023.2283150. Epub 2023 Nov 18.

A representation transfer learning approach for enhanced prediction of growth hormone binding proteins.

Comput Biol Chem. 2020 May 5;87:107274. doi: 10.1016/j.compbiolchem.2020.107274.

Prediction of nuclear proteins using nuclear translocation signals proposed by probabilistic latent semantic indexing.

BMC Bioinformatics. 2012;13 Suppl 17(Suppl 17):S13. doi: 10.1186/1471-2105-13-S17-S13. Epub 2012 Dec 13.

引用本文的文献

Machine learning-based migraine analysis using retinal vessel diameters from optical coherence tomography: an alternative approach.

Neurol Sci. 2025 Sep 17. doi: 10.1007/s10072-025-08462-7.

Diaproteo: A supervised learning framework for early detection of diabetes mellitus based on proteomic profiles.

Digit Health. 2025 Jul 30;11:20552076251362281. doi: 10.1177/20552076251362281. eCollection 2025 Jan-Dec.

TNFR-LSTM: A Deep Intelligent Model for Identification of Tumour Necroses Factor Receptor (TNFR) Activity.

IET Syst Biol. 2025 Jan-Dec;19(1):e70007. doi: 10.1049/syb2.70007.

A novel meta learning based stacked approach for diagnosis of thyroid syndrome.

PLoS One. 2024 Nov 1;19(11):e0312313. doi: 10.1371/journal.pone.0312313. eCollection 2024.

BBB-PEP-prediction: improved computational model for identification of blood-brain barrier peptides using blending position relative composition specific features and ensemble modeling.

J Cheminform. 2023 Nov 18;15(1):110. doi: 10.1186/s13321-023-00773-1.

本文引用的文献

iSUMOK-PseAAC: prediction of lysine sumoylation sites using statistical moments and Chou's PseAAC.

PeerJ. 2021 Aug 4;9:e11581. doi: 10.7717/peerj.11581. eCollection 2021.

iGluK-Deep: computational identification of lysine glutarylation sites using deep neural networks with general pseudo amino acid compositions.

J Biomol Struct Dyn. 2022;40(22):11691-11704. doi: 10.1080/07391102.2021.1962738. Epub 2021 Aug 16.

Evaluating machine learning methodologies for identification of cancer driver genes.

Sci Rep. 2021 Jun 10;11(1):12281. doi: 10.1038/s41598-021-91656-8.

Optimization of serine phosphorylation prediction in proteins by comparing human engineered features and deep representations.

Anal Biochem. 2021 Feb 15;615:114069. doi: 10.1016/j.ab.2020.114069. Epub 2020 Dec 16.

iPhosS(Deep)-PseAAC: Identification of Phosphoserine Sites in Proteins Using Deep Learning on General Pseudo Amino Acid Compositions.

IEEE/ACM Trans Comput Biol Bioinform. 2022 May-Jun;19(3):1703-1714. doi: 10.1109/TCBB.2020.3040747. Epub 2022 Jun 3.

iHyd-LysSite (EPSV): Identifying Hydroxylysine Sites in Protein Using Statistical Formulation by Extracting Enhanced Position and Sequence Variant Feature Technique.

Curr Genomics. 2020 Nov;21(7):536-545. doi: 10.2174/1389202921999200831142629.

Identification of 4-carboxyglutamate residue sites based on position based statistical feature and multiple classification.

Sci Rep. 2020 Oct 9;10(1):16913. doi: 10.1038/s41598-020-73107-y.

A representation transfer learning approach for enhanced prediction of growth hormone binding proteins.

Comput Biol Chem. 2020 May 5;87:107274. doi: 10.1016/j.compbiolchem.2020.107274.

A Sequence-Based Predictor of Zika Virus Proteins Developed by Integration of PseAAC and Statistical Moments.

Comb Chem High Throughput Screen. 2020;23(8):797-804. doi: 10.2174/1386207323666200428115449.

iSulfoTyr-PseAAC: Identify Tyrosine Sulfation Sites by Incorporating Statistical Moments Chou's 5-steps Rule and Pseudo Components.

Curr Genomics. 2019 May;20(4):306-320. doi: 10.2174/1389202920666190819091609.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于激素结合蛋白预测的集成学习：血清甲状腺激素紊乱早期诊断的一种有前景的方法。

Ensemble Learning for Hormone Binding Protein Prediction: A Promising Approach for Early Diagnosis of Thyroid Hormone Disorders in Serum.

作者信息

Butt Ahmad Hassan, Alkhalifah Tamim, Alturise Fahad, Khan Yaser Daanial

机构信息

Department of Computer Science, Faculty of Computing & Information Technology, University of the Punjab, Lahore 54000, Pakistan.

Department of Computer, College of Science and Arts in Ar Rass, Qassim University, Ar Rass 51921, Qassim, Saudi Arabia.

出版信息

Diagnostics (Basel). 2023 Jun 1;13(11):1940. doi: 10.3390/diagnostics13111940.

DOI:10.3390/diagnostics13111940

PMID:37296792

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10252793/

Abstract

摘要

用于激素结合蛋白预测的集成学习：血清甲状腺激素紊乱早期诊断的一种有前景的方法。

Ensemble Learning for Hormone Binding Protein Prediction: A Promising Approach for Early Diagnosis of Thyroid Hormone Disorders in Serum.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于激素结合蛋白预测的集成学习：血清甲状腺激素紊乱早期诊断的一种有前景的方法。

Ensemble Learning for Hormone Binding Protein Prediction: A Promising Approach for Early Diagnosis of Thyroid Hormone Disorders in Serum.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献