• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于激素结合蛋白预测的集成学习:血清甲状腺激素紊乱早期诊断的一种有前景的方法。

Ensemble Learning for Hormone Binding Protein Prediction: A Promising Approach for Early Diagnosis of Thyroid Hormone Disorders in Serum.

作者信息

Butt Ahmad Hassan, Alkhalifah Tamim, Alturise Fahad, Khan Yaser Daanial

机构信息

Department of Computer Science, Faculty of Computing & Information Technology, University of the Punjab, Lahore 54000, Pakistan.

Department of Computer, College of Science and Arts in Ar Rass, Qassim University, Ar Rass 51921, Qassim, Saudi Arabia.

出版信息

Diagnostics (Basel). 2023 Jun 1;13(11):1940. doi: 10.3390/diagnostics13111940.

DOI:10.3390/diagnostics13111940
PMID:37296792
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10252793/
Abstract

Hormone-binding proteins (HBPs) are specific carrier proteins that bind to a given hormone. A soluble carrier hormone binding protein (HBP), which can interact non-covalently and specifically with growth hormone, modulates or inhibits hormone signaling. HBP is essential for the growth of life, despite still being poorly understood. Several diseases, according to some data, are caused by HBPs that express themselves abnormally. Accurate identification of these molecules is the first step in investigating the roles of HBPs and understanding their biological mechanisms. For a better understanding of cell development and cellular mechanisms, accurate HBP determination from a given protein sequence is essential. Using traditional biochemical experiments, it is difficult to correctly separate HBPs from an increasing number of proteins because of the high experimental costs and lengthy experiment periods. The abundance of protein sequence data that has been gathered in the post-genomic era necessitates a computational method that is automated and enables quick and accurate identification of putative HBPs within a large number of candidate proteins. A brand-new machine-learning-based predictor is suggested as the HBP identification method. To produce the desirable feature set for the method proposed, statistical moment-based features and amino acids were combined, and the random forest was used to train the feature set. During 5-fold cross validation experiments, the suggested method achieved 94.37% accuracy and 0.9438 F1-scores, respectively, demonstrating the importance of the Hahn moment-based features.

摘要

激素结合蛋白(HBPs)是一类与特定激素结合的特异性载体蛋白。一种可溶性载体激素结合蛋白(HBP),它能与生长激素进行非共价且特异性的相互作用,从而调节或抑制激素信号传导。尽管人们对HBP的了解仍然有限,但它对生命的生长至关重要。根据一些数据,几种疾病是由异常表达的HBP引起的。准确识别这些分子是研究HBP的作用及其生物学机制的第一步。为了更好地理解细胞发育和细胞机制,从给定的蛋白质序列中准确测定HBP至关重要。使用传统的生化实验,由于实验成本高和实验周期长,很难从越来越多的蛋白质中正确分离出HBP。在后基因组时代积累的大量蛋白质序列数据需要一种自动化的计算方法,以便能够在大量候选蛋白质中快速准确地识别出假定的HBP。一种全新的基于机器学习的预测器被建议作为HBP的识别方法。为了为所提出的方法生成理想的特征集,将基于统计矩的特征和氨基酸相结合,并使用随机森林对该特征集进行训练。在5折交叉验证实验中,所提出的方法分别达到了94.37%的准确率和0.9438的F1分数,证明了基于哈恩矩的特征的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/d6d7c6af9abb/diagnostics-13-01940-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/ac0f589a1d89/diagnostics-13-01940-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/25a06019f85b/diagnostics-13-01940-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/cd736c1ede70/diagnostics-13-01940-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/9481b223c041/diagnostics-13-01940-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/f2bd9cf3f7d5/diagnostics-13-01940-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/7c11ea96368b/diagnostics-13-01940-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/7432116d3f62/diagnostics-13-01940-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/e6063ef4988e/diagnostics-13-01940-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/e5c08dee259c/diagnostics-13-01940-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/d6d7c6af9abb/diagnostics-13-01940-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/ac0f589a1d89/diagnostics-13-01940-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/25a06019f85b/diagnostics-13-01940-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/cd736c1ede70/diagnostics-13-01940-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/9481b223c041/diagnostics-13-01940-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/f2bd9cf3f7d5/diagnostics-13-01940-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/7c11ea96368b/diagnostics-13-01940-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/7432116d3f62/diagnostics-13-01940-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/e6063ef4988e/diagnostics-13-01940-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/e5c08dee259c/diagnostics-13-01940-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1b3/10252793/d6d7c6af9abb/diagnostics-13-01940-g010.jpg

相似文献

1
Ensemble Learning for Hormone Binding Protein Prediction: A Promising Approach for Early Diagnosis of Thyroid Hormone Disorders in Serum.用于激素结合蛋白预测的集成学习:血清甲状腺激素紊乱早期诊断的一种有前景的方法。
Diagnostics (Basel). 2023 Jun 1;13(11):1940. doi: 10.3390/diagnostics13111940.
2
Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes.基于K-mer特征表示和朴素贝叶斯的激素结合蛋白预测
Front Genet. 2021 Nov 23;12:797641. doi: 10.3389/fgene.2021.797641. eCollection 2021.
3
HBPred: a tool to identify growth hormone-binding proteins.HBPred:一种识别生长激素结合蛋白的工具。
Int J Biol Sci. 2018 May 22;14(8):957-964. doi: 10.7150/ijbs.24174. eCollection 2018.
4
iGHBP: Computational identification of growth hormone binding proteins from sequences using extremely randomised tree.iGHBP:使用极端随机树从序列中对生长激素结合蛋白进行计算识别。
Comput Struct Biotechnol J. 2018 Oct 24;16:412-420. doi: 10.1016/j.csbj.2018.10.007. eCollection 2018.
5
Empirical comparison and recent advances of computational prediction of hormone binding proteins using machine learning methods.使用机器学习方法对激素结合蛋白进行计算预测的实证比较与最新进展
Comput Struct Biotechnol J. 2023 Mar 17;21:2253-2261. doi: 10.1016/j.csbj.2023.03.024. eCollection 2023.
6
Identification of hormone binding proteins based on machine learning methods.基于机器学习方法的激素结合蛋白鉴定
Math Biosci Eng. 2019 Mar 22;16(4):2466-2480. doi: 10.3934/mbe.2019123.
7
SCMHBP: prediction and analysis of heme binding proteins using propensity scores of dipeptides.SCMHBP:利用二肽倾向得分预测和分析血红素结合蛋白
BMC Bioinformatics. 2014;15 Suppl 16(Suppl 16):S4. doi: 10.1186/1471-2105-15-S16-S4. Epub 2014 Dec 8.
8
iHBPs-VWDC: variable-length window-based dynamic connectivity approach for identifying hormone-binding proteins.iHBPs-VWDC:用于识别激素结合蛋白的基于可变长度窗口的动态连通性方法
J Biomol Struct Dyn. 2025 Jan;43(1):550-559. doi: 10.1080/07391102.2023.2283150. Epub 2023 Nov 18.
9
A representation transfer learning approach for enhanced prediction of growth hormone binding proteins.一种用于增强生长激素结合蛋白预测的表征迁移学习方法。
Comput Biol Chem. 2020 May 5;87:107274. doi: 10.1016/j.compbiolchem.2020.107274.
10
Prediction of nuclear proteins using nuclear translocation signals proposed by probabilistic latent semantic indexing.基于概率潜在语义索引的核转位信号预测核蛋白。
BMC Bioinformatics. 2012;13 Suppl 17(Suppl 17):S13. doi: 10.1186/1471-2105-13-S17-S13. Epub 2012 Dec 13.

引用本文的文献

1
Machine learning-based migraine analysis using retinal vessel diameters from optical coherence tomography: an alternative approach.基于机器学习的偏头痛分析:利用光学相干断层扫描技术获取的视网膜血管直径进行分析的一种替代方法。
Neurol Sci. 2025 Sep 17. doi: 10.1007/s10072-025-08462-7.
2
Diaproteo: A supervised learning framework for early detection of diabetes mellitus based on proteomic profiles.Diaproteo:一种基于蛋白质组学图谱的糖尿病早期检测监督学习框架。
Digit Health. 2025 Jul 30;11:20552076251362281. doi: 10.1177/20552076251362281. eCollection 2025 Jan-Dec.
3
TNFR-LSTM: A Deep Intelligent Model for Identification of Tumour Necroses Factor Receptor (TNFR) Activity.

本文引用的文献

1
iSUMOK-PseAAC: prediction of lysine sumoylation sites using statistical moments and Chou's PseAAC.iSUMOK-PseAAC:利用统计矩和周氏伪氨基酸组成预测赖氨酸的类泛素化位点
PeerJ. 2021 Aug 4;9:e11581. doi: 10.7717/peerj.11581. eCollection 2021.
2
iGluK-Deep: computational identification of lysine glutarylation sites using deep neural networks with general pseudo amino acid compositions.iGluK-Deep:利用具有通用伪氨基酸组成的深度神经网络对赖氨酸戊二酰化位点进行计算识别。
J Biomol Struct Dyn. 2022;40(22):11691-11704. doi: 10.1080/07391102.2021.1962738. Epub 2021 Aug 16.
3
Evaluating machine learning methodologies for identification of cancer driver genes.
TNFR-LSTM:一种用于识别肿瘤坏死因子受体(TNFR)活性的深度智能模型。
IET Syst Biol. 2025 Jan-Dec;19(1):e70007. doi: 10.1049/syb2.70007.
4
A novel meta learning based stacked approach for diagnosis of thyroid syndrome.一种基于元学习的新型堆叠方法用于甲状腺综合征的诊断。
PLoS One. 2024 Nov 1;19(11):e0312313. doi: 10.1371/journal.pone.0312313. eCollection 2024.
5
BBB-PEP-prediction: improved computational model for identification of blood-brain barrier peptides using blending position relative composition specific features and ensemble modeling.血脑屏障肽预测:利用混合位置相对组成特异性特征和集成建模改进的血脑屏障肽识别计算模型。
J Cheminform. 2023 Nov 18;15(1):110. doi: 10.1186/s13321-023-00773-1.
评估用于识别癌症驱动基因的机器学习方法。
Sci Rep. 2021 Jun 10;11(1):12281. doi: 10.1038/s41598-021-91656-8.
4
Optimization of serine phosphorylation prediction in proteins by comparing human engineered features and deep representations.通过比较人类工程特征和深度表示来优化蛋白质丝氨酸磷酸化预测。
Anal Biochem. 2021 Feb 15;615:114069. doi: 10.1016/j.ab.2020.114069. Epub 2020 Dec 16.
5
iPhosS(Deep)-PseAAC: Identification of Phosphoserine Sites in Proteins Using Deep Learning on General Pseudo Amino Acid Compositions.iPhosS(Deep)-PseAAC:基于广义伪氨基酸组成的深度学习算法鉴定蛋白质磷酸丝氨酸位点
IEEE/ACM Trans Comput Biol Bioinform. 2022 May-Jun;19(3):1703-1714. doi: 10.1109/TCBB.2020.3040747. Epub 2022 Jun 3.
6
iHyd-LysSite (EPSV): Identifying Hydroxylysine Sites in Protein Using Statistical Formulation by Extracting Enhanced Position and Sequence Variant Feature Technique.iHyd-LysSite(EPSV):通过提取增强位置和序列变异特征技术,使用统计公式识别蛋白质中的羟赖氨酸位点。
Curr Genomics. 2020 Nov;21(7):536-545. doi: 10.2174/1389202921999200831142629.
7
Identification of 4-carboxyglutamate residue sites based on position based statistical feature and multiple classification.基于位置的统计特征和多分类识别 4-羧基谷氨酸残基位点
Sci Rep. 2020 Oct 9;10(1):16913. doi: 10.1038/s41598-020-73107-y.
8
A representation transfer learning approach for enhanced prediction of growth hormone binding proteins.一种用于增强生长激素结合蛋白预测的表征迁移学习方法。
Comput Biol Chem. 2020 May 5;87:107274. doi: 10.1016/j.compbiolchem.2020.107274.
9
A Sequence-Based Predictor of Zika Virus Proteins Developed by Integration of PseAAC and Statistical Moments.基于序列的 Zika 病毒蛋白预测器的开发,通过 PseAAC 与统计矩的整合。
Comb Chem High Throughput Screen. 2020;23(8):797-804. doi: 10.2174/1386207323666200428115449.
10
iSulfoTyr-PseAAC: Identify Tyrosine Sulfation Sites by Incorporating Statistical Moments Chou's 5-steps Rule and Pseudo Components.iSulfoTyr-PseAAC:通过结合统计矩、周氏五步法则和伪组分来识别酪氨酸硫酸化位点
Curr Genomics. 2019 May;20(4):306-320. doi: 10.2174/1389202920666190819091609.