Guo Shiyong, Mao Chunyi, Peng Jun, Xie Shaohui, Yang Jun, Xie Wenping, Li Wanran, Yang Huaide, Guo Hao, Zhu Zexuan, Zheng Yun
State Key Laboratory of Primate Biomedical Research; Institute of Primate Translational Medicine, Kunming University of Science and Technology, Kunming, Yunnan 650500, China.
College of Horticulture and Landscape, Yunnan Agricultural University, Kunming, Yunnan, 650201, China.
Heliyon. 2024 Feb 10;10(4):e26081. doi: 10.1016/j.heliyon.2024.e26081. eCollection 2024 Feb 29.
MiRNAs are edited or modified in multiple ways during their biogenesis pathways. It was reported that miRNA editing was deregulated in tumors, suggesting the potential value of miRNA editing in cancer classification. Here we extracted three types of miRNA features from 395 LUAD and control samples, including the abundances of original miRNAs, the abundances of edited miRNAs, and the editing levels of miRNA editing sites. Our results show that eight classification algorithms selected generally had better performances on combined features than on the abundances of miRNAs or editing features of miRNAs alone. One feature selection algorithm, i.e., the DFL algorithm, selected only three features, i.e., the frequencies of hsa-miR-135b-5p, hsa-miR-210-3p and hsa-mir-182_48u (an edited miRNA), from 316 training samples. Seven classification algorithms achieved 100% accuracies on these three features for 79 independent testing samples. These results indicate that the additional information of miRNA editing is useful in improving the classification of LUAD samples.
微小RNA(miRNAs)在其生物合成途径中会以多种方式进行编辑或修饰。据报道,肿瘤中miRNA编辑失调,这表明miRNA编辑在癌症分类中具有潜在价值。在此,我们从395例肺腺癌(LUAD)样本和对照样本中提取了三种类型的miRNA特征,包括原始miRNAs的丰度、编辑后miRNAs的丰度以及miRNA编辑位点的编辑水平。我们的结果表明,所选择的八种分类算法通常在组合特征上比单独基于miRNAs的丰度或miRNAs的编辑特征表现更好。一种特征选择算法,即DFL算法,从316个训练样本中仅选择了三个特征,即hsa-miR-135b-5p、hsa-miR-210-3p和hsa-mir-182_48u(一种编辑后的miRNA)的频率。七种分类算法在这三个特征上对79个独立测试样本的准确率达到了100%。这些结果表明,miRNA编辑的额外信息有助于改善LUAD样本的分类。