Cancer Biomark. 2019;25(4):313-324. doi: 10.3233/CBM-190225.
Lung adenocarcinoma (LUAD) accounts for a significant proportion of lung cancer and there have been few diagnostic and therapeutic targets for LUAD due to the lack of specific biomarker. The aim of this study was to identify key long non-coding RNAs (lncRNAs) for LUAD.
The lncRNA and mRNA expression profiles of a large group of patients with LUAD were obtained from The Cancer Genome Atlas (TCGA). The differentially expressed lncRNAs (DElncRNAs) and mRNAs (DEmRNAs) were identified. The optimal diagnostic lncRNA biomarkers for LUAD were identified by using feature selection procedure and classification model. We established classification models including random forests, decision tree and support vector machine to distinguish LUAD and normal tissues. The lncRNAs-mRNAs co-expression networks and module identification were established by weighted gene co-expression network analysis (WGCNA). Functional annotation of pink and green modules was performed. The expression of selected DElncRNAs were validated by qRT-PCR.
A total of 1364 DEmRNAs (468 down-regulated and 896 up-regulated mRNAs) and 260 DElncRNAs (88 down-regulated and 172 up-regulated lncRNAs) between LUAD and normal tissue were obtained. LANCL1-AS1, MIR3945HG, LINC01270, RP5-1061H20.4, BLACAT1, LINC01703, CTD-2227E11.1 and RP1-244F24.1 were identified as optimal diagnostic lncRNA biomarkers for LUAD. The area under curve (AUC) of the random forests model, decision tree model and SVM model were 0.999, 0.937 and 0.999, and the specificity and sensitivity of the three model were 98.3% and 99.8%, 93.2% and 99% and 100% and 98.4%, respectively. Co-expression networks analysis showed that RP11-389C8.2, CTD-2510F5.4 and TMPO-AS1 were co-expressed with 44, 242 and 241 mRNAs, respectively. Cell cycle, DNA replication and p53 signaling pathway were three significantly enriched pathways. The qRT-PCR results were consistent with our integrated analysis, generally. The GSE32863 and GSE104854 validation was consistent with our integrated analysis, generally.
Our study identified eight DElncRNAs as potential diagnostic biomarkers of LUAD. Functional annotation of green module provided new evidences for exploring the precise roles of lncRNA in LUAD.
肺腺癌 (LUAD) 在肺癌中占很大比例,由于缺乏特异性生物标志物,LUAD 的诊断和治疗靶点很少。本研究旨在鉴定 LUAD 的关键长非编码 RNA (lncRNA)。
从癌症基因组图谱 (TCGA) 获得大量 LUAD 患者的 lncRNA 和 mRNA 表达谱。鉴定差异表达的 lncRNA (DElncRNA) 和 mRNA (DEmRNA)。通过特征选择程序和分类模型鉴定 LUAD 的最佳诊断 lncRNA 生物标志物。我们建立了分类模型,包括随机森林、决策树和支持向量机,以区分 LUAD 和正常组织。通过加权基因共表达网络分析 (WGCNA) 建立 lncRNA-mRNA 共表达网络和模块识别。对粉色和绿色模块进行功能注释。通过 qRT-PCR 验证选定的 DElncRNA 的表达。
在 LUAD 与正常组织之间获得了 1364 个 DEmRNA(468 个下调和 896 个上调 mRNA)和 260 个 DElncRNA(88 个下调和 172 个上调 lncRNA)。LANCL1-AS1、MIR3945HG、LINC01270、RP5-1061H20.4、BLACAT1、LINC01703、CTD-2227E11.1 和 RP1-244F24.1 被鉴定为 LUAD 的最佳诊断 lncRNA 生物标志物。随机森林模型、决策树模型和 SVM 模型的曲线下面积 (AUC) 分别为 0.999、0.937 和 0.999,三个模型的特异性和敏感性分别为 98.3%和 99.8%、93.2%和 99%和 100%和 98.4%。共表达网络分析表明,RP11-389C8.2、CTD-2510F5.4 和 TMPO-AS1 分别与 44、242 和 241 个 mRNA 共表达。细胞周期、DNA 复制和 p53 信号通路是三个显著富集的通路。qRT-PCR 结果与我们的综合分析基本一致。GSE32863 和 GSE104854 的验证与我们的综合分析基本一致。
本研究鉴定了 8 个 DElncRNA 作为 LUAD 的潜在诊断生物标志物。绿色模块的功能注释为探索 lncRNA 在 LUAD 中的精确作用提供了新的证据。