Dong Changlong, Rao Nini, Du Wenju, Gao Fenglin, Lv Xiaoqin, Wang Guangbin, Zhang Junpeng
Center for Informational Biology, School of Life Sciences and Technology, University of Electronic Science and Technology of China, Chengdu, China.
School of Life Sciences and Technology, University of Electronic Science and Technology of China, Chengdu, China.
Front Genet. 2021 Jul 27;12:679612. doi: 10.3389/fgene.2021.679612. eCollection 2021.
In this work, an algorithm named mRBioM was developed for the identification of potential mRNA biomarkers (PmBs) from complete transcriptomic RNA profiles of gastric adenocarcinoma (GA).
mRBioM initially extracts differentially expressed (DE) RNAs (mRNAs, miRNAs, and lncRNAs). Next, mRBioM calculates the total information amount of each DE mRNA based on the coexpression network, including three types of RNAs and the protein-protein interaction network encoded by DE mRNAs. Finally, PmBs were identified according to the variation trend of total information amount of all DE mRNAs. Four PmB-based classifiers without learning and with learning were designed to discriminate the sample types to confirm the reliability of PmBs identified by mRBioM. PmB-based survival analysis was performed. Finally, three other cancer datasets were used to confirm the generalization ability of mRBioM.
mRBioM identified 55 PmBs (41 upregulated and 14 downregulated) related to GA. The list included thirteen PmBs that have been verified as biomarkers or potential therapeutic targets of gastric cancer, and some PmBs were newly identified. Most PmBs were primarily enriched in the pathways closely related to the occurrence and development of gastric cancer. Cancer-related factors without learning achieved sensitivity, specificity, and accuracy of 0.90, 1, and 0.90, respectively, in the classification of the GA and control samples. Average accuracy, sensitivity, and specificity of the three classifiers with machine learning ranged within 0.94-0.98, 0.94-0.97, and 0.97-1, respectively. The prognostic risk score model constructed by 4 PmBs was able to correctly and significantly ( < 0.001) classify 269 GA patients into the high-risk ( = 134) and low-risk ( = 135) groups. GA equivalent classification performance was achieved using the complete transcriptomic RNA profiles of colon adenocarcinoma, lung adenocarcinoma, and hepatocellular carcinoma using PmBs identified by mRBioM.
GA-related PmBs have high specificity and sensitivity and strong prognostic risk prediction. MRBioM has also good generalization. These PmBs may have good application prospects for early diagnosis of GA and may help to elucidate the mechanism governing the occurrence and development of GA. Additionally, mRBioM is expected to be applied for the identification of other cancer-related biomarkers.
在本研究中,开发了一种名为mRBioM的算法,用于从胃腺癌(GA)的完整转录组RNA谱中识别潜在的mRNA生物标志物(PmBs)。
mRBioM首先提取差异表达(DE)RNA(mRNA、miRNA和lncRNA)。接下来,mRBioM基于共表达网络计算每个DE mRNA的总信息量,该网络包括三种类型的RNA以及由DE mRNA编码的蛋白质-蛋白质相互作用网络。最后,根据所有DE mRNA总信息量的变化趋势识别PmBs。设计了四个基于PmB的无学习和有学习的分类器来区分样本类型,以确认mRBioM识别的PmBs的可靠性。进行了基于PmB的生存分析。最后,使用其他三个癌症数据集来确认mRBioM的泛化能力。
mRBioM识别出55个与GA相关的PmBs(41个上调和14个下调)。该列表包括13个已被验证为胃癌生物标志物或潜在治疗靶点的PmBs,并且一些PmBs是新识别的。大多数PmBs主要富集在与胃癌发生和发展密切相关的途径中。在GA和对照样本的分类中,无学习的癌症相关因子的敏感性、特异性和准确性分别达到0.90、1和0.90。具有机器学习的三个分类器的平均准确性、敏感性和特异性分别在0.94 - 0.98、0.94 - 0.97和0.97 - 1范围内。由4个PmBs构建的预后风险评分模型能够正确且显著地(<0.001)将269例GA患者分为高风险组(=134)和低风险组(=135)。使用mRBioM识别的PmBs,利用结肠腺癌、肺腺癌和肝细胞癌的完整转录组RNA谱实现了GA等效分类性能。
GA相关的PmBs具有高特异性和敏感性以及强大的预后风险预测能力。MRBioM也具有良好的泛化性。这些PmBs在GA的早期诊断中可能具有良好的应用前景,并可能有助于阐明GA发生和发展的机制。此外,预计mRBioM可用于识别其他癌症相关生物标志物。