Department of Occupational and Environmental Health, College of Public Health, Zhengzhou University, Zhengzhou, China.
The Key Laboratory of Nanomedicine and Health Inspection of Zhengzhou, Zhengzhou University, Zhengzhou, China.
BMC Cancer. 2024 Jan 10;24(1):52. doi: 10.1186/s12885-024-11830-9.
BACKGROUND: As biomarkers, microRNAs (miRNAs) are closely associated with the occurrence, progression, and prognosis of non-small cell lung cancer (NSCLC). However, the prognostic predictive value of miRNAs in NSCLC has rarely been explored. In this study, the value in prognosis prediction of NSCLC was mined based on data mining models using clinical data and plasma miRNAs biomarkers. METHODS: A total of 69 patients were included in this prospective cohort study. After informed consent, they filled out questionnaires and had their peripheral blood collected. The expressions of plasma miRNAs were examined by quantitative polymerase chain reaction (qPCR). The Whitney U test was used to analyze non-normally distributed data. Kaplan-Meier was used to plot the survival curve, the log-rank test was used to compare with the overall survival curve, and the Cox proportional hazards model was used to screen the factors related to the prognosis of lung cancer. Data mining techniques were utilized to predict the prognostic status of patients. RESULTS: We identified that smoking (HR = 2.406, 95% CI = 1.256-4.611), clinical stage III + IV (HR = 5.389, 95% CI = 2.290-12.684), the high expression group of miR-20a (HR = 4.420, 95% CI = 1.760-11.100), the high expression group of miR-197 (HR = 3.828, 95% CI = 1.778-8.245), the low expression group of miR-145 ( HR = 0.286, 95% CI = 0.116-0.709), and the low expression group of miR-30a (HR = 0.307, 95% CI = 0.133-0.706) was associated with worse prognosis. Among the five data mining models, the decision trees (DT) C5.0 model performs the best, with accuracy and Area Under Curve (AUC) of 93.75% and 0.929 (0.685, 0.997), respectively. CONCLUSION: The results showed that the high expression level of miR-20a and miR-197, the low expression level of miR-145 and miR-30a were strongly associated with poorer prognosis in NSCLC patients, and the DT C5.0 model may serve as a novel, accurate, method for predicting prognosis of NSCLC.
背景:微小 RNA(miRNA)作为生物标志物与非小细胞肺癌(NSCLC)的发生、进展和预后密切相关。然而,miRNA 对 NSCLC 的预后预测价值很少被探索。本研究基于临床数据和血浆 miRNA 生物标志物,利用数据挖掘模型挖掘 NSCLC 预后预测的价值。
方法:本前瞻性队列研究共纳入 69 例患者。在知情同意后,他们填写问卷并采集外周血。通过实时定量聚合酶链反应(qPCR)检测血浆 miRNA 的表达。采用非正态分布数据的 Whitney U 检验。Kaplan-Meier 用于绘制生存曲线,对数秩检验用于比较总生存曲线,Cox 比例风险模型用于筛选与肺癌预后相关的因素。数据挖掘技术用于预测患者的预后状态。
结果:我们发现吸烟(HR=2.406,95%CI=1.256-4.611)、临床分期 III+IV(HR=5.389,95%CI=2.290-12.684)、miR-20a 高表达组(HR=4.420,95%CI=1.760-11.100)、miR-197 高表达组(HR=3.828,95%CI=1.778-8.245)、miR-145 低表达组(HR=0.286,95%CI=0.116-0.709)和 miR-30a 低表达组(HR=0.307,95%CI=0.133-0.706)与预后不良相关。在五个数据挖掘模型中,决策树(DT)C5.0 模型表现最好,准确性和曲线下面积(AUC)分别为 93.75%和 0.929(0.685,0.997)。
结论:结果表明,miR-20a 和 miR-197 的高表达水平、miR-145 和 miR-30a 的低表达水平与 NSCLC 患者的预后不良密切相关,DT C5.0 模型可能成为预测 NSCLC 预后的一种新的、准确的方法。
Int J Clin Exp Pathol. 2014-6-15
Tumour Biol. 2016-9
Cancer Biol Med. 2020-8-15
BMC Pulm Med. 2025-1-25
Front Artif Intell. 2023-7-26
Surg Pathol Clin. 2021-9
Front Oncol. 2021-4-20
Thorac Cancer. 2020-3
Thorac Cancer. 2018-11-28
Cell Physiol Biochem. 2018