Yadalam Pradeep Kumar, Arumuganainar Deepavalli, Natarajan Prabhu Manickam, Ardila Carlos M
Department of Periodontics, Saveetha Institute of Medical and Technology sciences, Saveetha Dental College, SIMATS, Saveetha University, Chennai, Tamil Nadu, India.
Department of Periodontics, Saveetha Institute of Medical and Technical Sciences, Saveetha Dental College and Hospital, Saveetha University, Chennai, 600077, Tamil Nadu, India.
Sci Rep. 2025 Feb 3;15(1):4068. doi: 10.1038/s41598-025-88819-2.
Understanding shared pathways and mechanisms involved in the pathogenesis of diseases like oral squamous cell carcinoma (OSCC) and COVID-19 could lead to the development of novel therapeutic strategies and diagnostic biomarkers. This study aims to predict the interactome of OSCC and COVID-19 based on salivary inflammatory proteins. Datasets for OSCC and COVID-19 were obtained from https://www.salivaryproteome.org/differential-expression and selected for differential gene expression analysis. Differential gene expression analysis was performed using log transformation and a fold change of two. Hub proteins were identified using Cytoscape and Cytohubba, and machine learning algorithms including naïve Bayes, neural networks, gradient boosting, and random forest were used to predict hub genes. Top hub genes identified included ALDH1A1, MT-CO2, SERPINC1, FGB, and TF. The random forest model achieved the highest accuracy (93%) and class accuracy (84%). The naive Bayes model had lower accuracy (63%) and class accuracy (66%), while the neural network model showed 55% accuracy and class accuracy, possibly due to data pre-processing issues. The gradient boosting model outperformed all models with an accuracy of 95% and class accuracy of 95%. Salivary proteomic interactome analysis revealed novel hub proteins as potential common biomarkers.
了解口腔鳞状细胞癌(OSCC)和COVID-19等疾病发病机制中涉及的共同途径和机制,可能会促成新型治疗策略和诊断生物标志物的开发。本研究旨在基于唾液炎症蛋白预测OSCC和COVID-19的相互作用组。OSCC和COVID-19的数据集从https://www.salivaryproteome.org/differential-expression获取,并选择用于差异基因表达分析。使用对数转换和两倍的变化倍数进行差异基因表达分析。使用Cytoscape和Cytohubba鉴定枢纽蛋白,并使用包括朴素贝叶斯、神经网络、梯度提升和随机森林在内的机器学习算法预测枢纽基因。确定的顶级枢纽基因包括ALDH1A1、MT-CO2、SERPINC1、FGB和TF。随机森林模型实现了最高的准确率(93%)和类别准确率(84%)。朴素贝叶斯模型的准确率(63%)和类别准确率(66%)较低,而神经网络模型的准确率和类别准确率为55%,这可能是由于数据预处理问题。梯度提升模型的表现优于所有模型,准确率为95%,类别准确率为95%。唾液蛋白质组相互作用组分析揭示了新型枢纽蛋白作为潜在的共同生物标志物。