Valizadeh Laktarashi Hossein, Rahimi Milad, Abrishamifar Kimia, Mahmoudabadi Ali, Nazari Elham
Department of Health Information Technology and Management, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
Department of Health Information Technology, School of Allied Medical Sciences, Urmia University of Medical Sciences, Urmia, Iran.
Med J Islam Repub Iran. 2025 Jan 6;39:4. doi: 10.47176/mjiri.39.4. eCollection 2025.
Uterine corpus endometrial cancer (UCEC) is known as the sixth most common cancer in the world. Advances in bioinformatics and deep learning have provided the 2 tools for screening large-scale genomic data and discovering potential biomarkers indicative of disease states. This study aimed to investigate the identification of important genes for diagnosis and prognosis in the uterus using bioinformatics and machine learning algorithms.
RNA expression profiles of UECE patients were analyzed to identify differentially expressed genes (DEGs) using deep learning techniques. Prognostic biomarkers were assessed through survival curve analysis utilizing COMBIO-ROC. Additionally, molecular pathways, protein-protein interaction (PPI) networks, co-expression patterns of DEGs, and their associations with clinical data were thoroughly examined. Ultimately, diagnostic markers were determined through deep learning-based analyses.
According to our findings, MEX3B, CTRP2 (C1QTNF2), and AASS are new biomarkers for UCEC. The evaluation metrics demonstrate the deep learning model's (DNN) efficacy, with a minimal mean squared error (MSE) of 5.1096067E-5 and a root mean squared error (RMSE) of 0.007, indicative of accurate predictions. The R-squared value of 0.99 underscores the model's ability to explain a substantial portion of the variance in the data. Thus, the model achieves a perfect area under the curve (AUC) of 1, signifying exceptional discrimination ability, and an accuracy rate of 97%.
The GDCA database and deep learning algorithms identified 3 significant genes -MEX3B, CTRP2 (C1QTNF2), and AASS-as potential diagnosis biomarkers of UCEC. Thus, identifying new UCEC biomarkers has promise for effective care, improved prognosis, and early diagnosis.
子宫内膜癌(UCEC)是世界上第六大常见癌症。生物信息学和深度学习的进展为筛选大规模基因组数据和发现指示疾病状态的潜在生物标志物提供了两种工具。本研究旨在利用生物信息学和机器学习算法研究子宫中用于诊断和预后的重要基因的识别。
使用深度学习技术分析UCEC患者的RNA表达谱,以识别差异表达基因(DEG)。通过利用COMBIO-ROC的生存曲线分析评估预后生物标志物。此外,还深入研究了分子途径、蛋白质-蛋白质相互作用(PPI)网络、DEG的共表达模式及其与临床数据的关联。最终,通过基于深度学习的分析确定诊断标志物。
根据我们的研究结果,MEX3B、CTRP2(C1QTNF2)和AASS是UCEC的新生物标志物。评估指标证明了深度学习模型(DNN)的有效性,最小均方误差(MSE)为5.1096067E-5,均方根误差(RMSE)为0.007,表明预测准确。R平方值为0.99强调了该模型解释数据中大部分方差的能力。因此,该模型实现了完美的曲线下面积(AUC)为1,表明具有出色的区分能力,准确率为97%。
GDCA数据库和深度学习算法确定了3个重要基因——MEX3B、CTRP2(C1QTNF2)和AASS——作为UCEC的潜在诊断生物标志物。因此,识别新的UCEC生物标志物有望实现有效的护理、改善预后和早期诊断。