Department of Public Health, Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL 33136, U.S.A.
Department of Statistics, National Cheng Kung University, Tainan 701401, Taiwan.
Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad520.
As an important player in transcriptome regulation, microRNAs may effectively diffuse somatic mutation impacts to broad cellular processes and ultimately manifest disease and dictate prognosis. Previous studies that tried to correlate mutation with gene expression dysregulation neglected to adjust for the disparate multitudes of false positives associated with unequal sample sizes and uneven class balancing scenarios.
To properly address this issue, we developed a statistical framework to rigorously assess the extent of mutation impact on microRNAs in relation to a permutation-based null distribution of a matching sample structure. Carrying out the framework in a pan-cancer study, we ascertained 9008 protein-coding genes with statistically significant mutation impacts on miRNAs. Of these, the collective miRNA expression for 83 genes showed significant prognostic power in nine cancer types. For example, in lower-grade glioma, 10 genes' mutations broadly impacted miRNAs, all of which showed prognostic value with the corresponding miRNA expression. Our framework was further validated with functional analysis and augmented with rich features including the ability to analyze miRNA isoforms; aggregative prognostic analysis; advanced annotations such as mutation type, regulator alteration, somatic motif, and disease association; and instructive visualization such as mutation OncoPrint, Ideogram, and interactive mRNA-miRNA network.
The data underlying this article are available in MutMix, at http://innovebioinfo.com/Database/TmiEx/MutMix.php.
作为转录组调控的重要参与者,microRNAs 可能会有效地将体细胞突变的影响扩散到广泛的细胞过程中,并最终表现为疾病并决定预后。以前的研究试图将突变与基因表达失调相关联,但忽略了调整与不等样本量和不平衡类别平衡情况相关的大量虚假阳性。
为了正确解决这个问题,我们开发了一个统计框架,以严格评估突变对 microRNAs 的影响程度,与匹配样本结构的基于排列的零分布相关。在泛癌研究中执行该框架,我们确定了 9008 个具有统计学意义的突变影响 miRNA 的蛋白质编码基因。在这些基因中,83 个基因的集体 miRNA 表达在 9 种癌症类型中具有显著的预后能力。例如,在低级别神经胶质瘤中,10 个基因的突变广泛影响 miRNA,所有这些基因的相应 miRNA 表达均具有预后价值。我们的框架还通过功能分析进行了验证,并增加了丰富的功能,包括分析 miRNA 同种型的能力;聚合预后分析;高级注释,如突变类型、调节剂改变、体细胞基序和疾病关联;以及有用的可视化,如突变 OncoPrint、Ideogram 和交互式 mRNA-miRNA 网络。
本文所依据的数据可在 MutMix 中获得,网址为 http://innovebioinfo.com/Database/TmiEx/MutMix.php。