Department of Obstetrics and Gynecology, Women and Children's Hospital of Chongqing Medical University, Chongqing 401147, China; Department of Obstetrics and Gynecology, Chongqing Health Center for Women and Children, Chongqing 401147, China.
Chongqing Key Laboratory of Maternal and Fetal Medicine, Chongqing Medical University, Chongqing 400016, China; Joint International Research Laboratory of Reproduction and Development, Chinese Ministry of Education, Chongqing Medical University, 400016, China; Department of Obstetrics, The First Affiliated Hospital of Chongqing Medical University, Chongqing 400016, China.
Int J Biol Macromol. 2024 Mar;262(Pt 2):130180. doi: 10.1016/j.ijbiomac.2024.130180. Epub 2024 Feb 13.
Ferroptosis represents a novel form of programmed cell death. Pan-cancer bioinformatics analysis indicates that identifying and modulating ferroptosis offer innovative approaches for preventing and treating diverse tumor pathologies. However, the precise detection of ferroptosis-related proteins via conventional wet-laboratory techniques remains a formidable challenge, largely due to the constraints of existing methodologies. These traditional approaches are not only labor-intensive but also financially burdensome. Consequently, there is an imperative need for the development of more sophisticated and efficient computational tools to facilitate the detection of these proteins. In this paper, we presented a XGBoost and multi-view features-based machine learning prediction method for predicting ferroptosis-related proteins, which was referred to as FRP-XGBoost. In this study, we explored four types of protein feature extraction methods and evaluated their effectiveness in predicting ferroptosis-related proteins using six of the most commonly used traditional classifiers. To enhance the representational power of the hybrid features, we employed a two-step feature selection technique to identify the optimal subset of features. Subsequently, we constructed a prediction model using the XGBoost algorithm. The FRP-XGBoost achieved an accuracy of 96.74 % in 10-fold cross-validation and a further accuracy of 91.52 % in an independent test. The implementation source code of FRP-XGBoost is available at https://github.com/linli5417/FRP-XGBoost.
铁死亡代表一种新型的程序性细胞死亡方式。泛癌症生物信息学分析表明,鉴定和调节铁死亡为预防和治疗多种肿瘤病理提供了创新方法。然而,通过传统的湿实验室技术精确检测铁死亡相关蛋白仍然是一个巨大的挑战,这主要是由于现有方法学的限制。这些传统方法不仅劳动强度大,而且经济负担重。因此,迫切需要开发更复杂和高效的计算工具来促进这些蛋白的检测。在本文中,我们提出了一种基于 XGBoost 和多视图特征的机器学习预测方法,用于预测铁死亡相关蛋白,称为 FRP-XGBoost。在这项研究中,我们探索了四种类型的蛋白质特征提取方法,并使用六种最常用的传统分类器评估了它们在预测铁死亡相关蛋白中的有效性。为了增强混合特征的表示能力,我们采用了两步特征选择技术来识别最优的特征子集。随后,我们使用 XGBoost 算法构建了一个预测模型。在 10 折交叉验证中,FRP-XGBoost 的准确率达到 96.74%,在独立测试中进一步达到 91.52%。FRP-XGBoost 的实现源代码可在 https://github.com/linli5417/FRP-XGBoost 上获得。