Perveen Gulnaz, Alturise Fahad, Alkhalifah Tamim, Daanial Khan Yaser
Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, Punjab, Pakistan.
Department of Computer, College of Science and Arts in Ar Rass Qassim University, Buraidah, Qassim, Saudi Arabia.
Digit Health. 2023 Jul 5;9:20552076231180739. doi: 10.1177/20552076231180739. eCollection 2023 Jan-Dec.
The objective of this study is to propose a novel in-silico method called Hemolytic-Pred for identifying hemolytic proteins based on their sequences, using statistical moment-based features, along with position-relative and frequency-relative information.
Primary sequences were transformed into feature vectors using statistical and position-relative moment-based features. Varying machine learning algorithms were employed for classification. Computational models were rigorously evaluated using four different validation. The Hemolytic-Pred webserver is available for further analysis at http://ec2-54-160-229-10.compute-1.amazonaws.com/.
XGBoost outperformed the other six classifiers with an accuracy value of 0.99, 0.98, 0.97, and 0.98 for self-consistency test, 10-fold cross-validation, Jackknife test, and independent set test, respectively. The proposed method with the XGBoost classifier is a workable and robust solution for predicting hemolytic proteins efficiently and accurately.
The proposed method of Hemolytic-Pred with XGBoost classifier is a reliable tool for the timely identification of hemolytic cells and diagnosis of various related severe disorders. The application of Hemolytic-Pred can yield profound benefits in the medical field.
本研究的目的是提出一种名为Hemolytic-Pred的新型计算机模拟方法,该方法基于序列,利用基于统计矩的特征以及位置相关和频率相关信息来识别溶血蛋白。
使用基于统计和位置相关矩的特征将一级序列转化为特征向量。采用不同的机器学习算法进行分类。使用四种不同的验证方法对计算模型进行严格评估。可通过http://ec2-54-160-229-10.compute-1.amazonaws.com/访问Hemolytic-Pred网络服务器以进行进一步分析。
在自一致性测试、10折交叉验证、留一法测试和独立集测试中,XGBoost的表现优于其他六个分类器,其准确率分别为0.99、0.98、0.97和0.98。所提出的采用XGBoost分类器的方法是一种可行且稳健的解决方案,能够高效且准确地预测溶血蛋白。
所提出的采用XGBoost分类器的Hemolytic-Pred方法是及时识别溶血细胞和诊断各种相关严重疾病的可靠工具。Hemolytic-Pred的应用在医学领域可产生深远益处。