Kim Shinuk
Department of Civil Engineering, Sangmyung University, Cheonan, Republic of Korea.
Cancer Inform. 2020 Feb 28;19:1176935120908301. doi: 10.1177/1176935120908301. eCollection 2020.
Microarray data sets have been used for predicting cancer biomarkers. Yet, replication of the prediction has not been fully satisfied. Recently, new data sets called deep sequencing data sets have been generated, with an advantage of less noise in computational analysis. In this study, we analyzed the kidney miRNA and mRNA sequence data sets for predicting cancer markers using 5 different statistical feature selection methods. In the results, we obtained 3 mRNA- and 27 miRNA-based cancer biomarkers to compare with the normal samples. In addition, we clustered the kidney cancer subtypes using a nonnegative matrix factorization method and obtained significant results of survival analysis from the 2 separate groups including miRNA-342 and its target eukaryotic translation initiation factor 5A ().
微阵列数据集已被用于预测癌症生物标志物。然而,预测的重复性尚未得到充分满足。最近,产生了一种名为深度测序数据集的新数据集,其在计算分析中具有噪声较少的优势。在本研究中,我们使用5种不同的统计特征选择方法分析了肾脏miRNA和mRNA序列数据集以预测癌症标志物。结果,我们获得了3种基于mRNA和27种基于miRNA的癌症生物标志物,用于与正常样本进行比较。此外,我们使用非负矩阵分解方法对肾癌亚型进行聚类,并从包括miRNA - 342及其靶标真核生物翻译起始因子5A()的2个独立组中获得了显著的生存分析结果。