Li Rang, Han Kedong, Xu Dehua, Chen Xiaolin, Lan Shujin, Liao Yuanjun, Sun Shengnan, Rao Shaoqi
Institute of Medical Systems Biology, School of Public Health, Guangdong Medical University, Dongguan, China.
Department of Cardiology, Maoming People's Hospital, Maoming, China.
Front Genet. 2021 Jan 28;11:625977. doi: 10.3389/fgene.2020.625977. eCollection 2020.
Early and precise prediction is an important way to reduce the poor prognosis of lung adenocarcinoma (LUAD) patients. Nevertheless, the widely used tumor, node, and metastasis (TNM) staging system based on anatomical information only often could not achieve adequate performance on foreseeing the prognosis of LUAD patients. This study thus aimed to examine whether the long non-coding RNAs (lncRNAs), known highly involved in the tumorigenesis of LUAD through the competing endogenous RNAs (ceRNAs) mechanism, could provide additional information to improve prognosis prediction of LUAD patients. To prove the hypothesis, a dataset consisting of both RNA sequencing data and clinical pathological data, obtained from The Cancer Genome Atlas (TCGA) database, was analyzed. Then, differentially expressed RNAs (DElncRNAs, DEmiRNAs, and DEmRNAs) were identified and a lncRNA-miRNA-mRNA ceRNA network was constructed based on those differentially expressed RNAs. Functional enrichment analysis revealed that this ceRNA network was highly enriched in some cancer-associated signaling pathways. Next, lasso-Cox model was run 1,000 times to recognize the potential survival-related combinations of the candidate lncRNAs in the ceRNA network, followed by the "best subset selection" to further optimize these lncRNA-based combinations, and a seven-lncRNA prognostic signature with the best performance was determined. Based on the median risk score, LUAD patients could be well distinguished into high-/low-risk subgroups. The Kaplan-Meier survival curve showed that LUAD patients in the high-risk group had significantly shorter overall survival than those in the low-risk group (log-rank test = 4.52 × 10). The ROC curve indicated that the clinical genomic model including both the TNM staging system and the signature had a superior performance in predicting the patients' overall survival compared to the clinical model with the TNM staging system only. Further stratification analysis suggested that the signature could work well in the different strata of the stage, gender, or age, rendering it to be a wide application. Finally, a ceRNA subnetwork related to the signature was extracted, demonstrating its high involvement in the tumorigenesis mechanism of LUAD. In conclusion, the present study established a lncRNA-based molecular signature, which can significantly improve prognosis prediction for LUAD patients.
早期精确预测是降低肺腺癌(LUAD)患者不良预后的重要途径。然而,广泛使用的仅基于解剖学信息的肿瘤、淋巴结和转移(TNM)分期系统在预测LUAD患者预后方面往往表现不佳。因此,本研究旨在探讨长链非编码RNA(lncRNAs)是否可以提供额外信息以改善LUAD患者的预后预测,已知lncRNAs通过竞争性内源性RNA(ceRNAs)机制高度参与LUAD的肿瘤发生。为了验证这一假设,分析了从癌症基因组图谱(TCGA)数据库获得的包含RNA测序数据和临床病理数据的数据集。然后,鉴定差异表达的RNA(DElncRNAs、DEmiRNAs和DEmRNAs),并基于这些差异表达的RNA构建lncRNA-miRNA-mRNA ceRNA网络。功能富集分析表明,该ceRNA网络在一些癌症相关信号通路中高度富集。接下来,对套索-考克斯模型运行1000次,以识别ceRNA网络中候选lncRNAs的潜在生存相关组合,随后进行“最佳子集选择”以进一步优化这些基于lncRNA的组合,并确定了性能最佳的七lncRNA预后特征。基于中位风险评分,LUAD患者可被很好地分为高/低风险亚组。Kaplan-Meier生存曲线显示,高风险组的LUAD患者总生存期明显短于低风险组(对数秩检验 = 4.52 × 10)。ROC曲线表明,与仅使用TNM分期系统的临床模型相比,包括TNM分期系统和该特征的临床基因组模型在预测患者总生存期方面具有更好的性能。进一步的分层分析表明,该特征在不同的分期、性别或年龄亚组中均表现良好,具有广泛的应用前景。最后,提取了与该特征相关的ceRNA子网,表明其高度参与LUAD的肿瘤发生机制。总之,本研究建立了一种基于lncRNA的分子特征,可显著改善LUAD患者的预后预测。