College of Computer Science and Technology, Jilin University, Changchun, Jilin, 130012, China; Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin·University, Changchun, Jilin, 130012, China.
Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin·University, Changchun, Jilin, 130012, China; College of Software, Jilin University, Changchun, Jilin, 130012, China.
Comput Biol Med. 2022 Sep;148:105883. doi: 10.1016/j.compbiomed.2022.105883. Epub 2022 Jul 20.
The transcriptome describes the expression of all genes in a sample. Most studies have investigated the differential patterns or discrimination powers of transcript expression levels. In this study, we hypothesized that the quantitative correlations between the expression levels of transcription factors (TFs) and their regulated target genes (mRNAs) serve as a novel view of healthy status, and a disease sample exhibits a differential landscape (mqTrans) of transcription regulations compared with healthy status. We formulated quantitative transcription regulation relationships of metabolism-related genes as a multi-input multi-output regression model via a gated recurrent unit (GRU) network. The GRU model was trained using healthy blood transcriptomes and the expression levels of mRNAs were predicted by those of the TFs. The mqTrans feature of a gene was defined as the difference between its predicted and actual expression levels. A pan-cancer investigation of the differentially expressed mqTrans features was conducted between the early- and late-stage cancers in 26 cancer types of The Cancer Genome Atlas database. This study focused on the differentially expressed mqTrans features, that did not show differential expression in the actual expression levels. These genes could not be detected by conventional differential analysis. Such dark biomarkers are worthy of further wet-lab investigation. The experimental data also showed that the proposed mqTrans investigation improved the classification between early- and late-stage samples for some cancer types. Thus, the mqTrans features serve as a complementary view to transcriptomes, an OMIC type with mature high-throughput production technologies, and abundant public resources.
转录组描述了样本中所有基因的表达情况。大多数研究都调查了转录表达水平的差异模式或判别能力。在这项研究中,我们假设转录因子(TFs)的表达水平与其调控的靶基因(mRNA)之间的定量相关性可以作为一种健康状态的新视角,并且与健康状态相比,疾病样本显示出转录调控的差异景观(mqTrans)。我们通过门控循环单元(GRU)网络将与代谢相关基因的定量转录调控关系构建为多输入多输出回归模型。GRU 模型使用健康血液转录组进行训练,并通过 TFs 的表达水平来预测 mRNA 的表达水平。将基因的 mqTrans 特征定义为其预测和实际表达水平之间的差异。通过对 26 种癌症类型的癌症基因组图谱数据库中早期和晚期癌症之间的差异表达 mqTrans 特征进行泛癌研究。本研究主要关注差异表达的 mqTrans 特征,这些特征在实际表达水平上没有表现出差异。这些基因无法通过传统的差异分析检测到。这种暗生物标志物值得进一步进行湿实验室研究。实验数据还表明,对于某些癌症类型,所提出的 mqTrans 研究提高了早期和晚期样本之间的分类。因此,mqTrans 特征作为转录组的补充视角,与转录组具有相同的作用,转录组是一种具有成熟高通量生产技术和丰富公共资源的 OMIC 类型。