Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK.
Human Technopole, Fondazione Human Technopole, Milan, Italy.
Am J Hum Genet. 2024 Aug 8;111(8):1524-1543. doi: 10.1016/j.ajhg.2024.06.017. Epub 2024 Jul 24.
Gene misexpression is the aberrant transcription of a gene in a context where it is usually inactive. Despite its known pathological consequences in specific rare diseases, we have a limited understanding of its wider prevalence and mechanisms in humans. To address this, we analyzed gene misexpression in 4,568 whole-blood bulk RNA sequencing samples from INTERVAL study blood donors. We found that while individual misexpression events occur rarely, in aggregate they were found in almost all samples and a third of inactive protein-coding genes. Using 2,821 paired whole-genome and RNA sequencing samples, we identified that misexpression events are enriched in cis for rare structural variants. We established putative mechanisms through which a subset of SVs lead to gene misexpression, including transcriptional readthrough, transcript fusions, and gene inversion. Overall, we develop misexpression as a type of transcriptomic outlier analysis and extend our understanding of the variety of mechanisms by which genetic variants can influence gene expression.
基因表达异常是指在通常不活跃的情况下基因的异常转录。尽管在某些罕见疾病中已知其具有病理性后果,但我们对其在人类中的更广泛普遍性和机制仍知之甚少。为了解决这个问题,我们分析了 INTERVAL 研究血液供体的 4568 个全血批量 RNA 测序样本中的基因表达异常。我们发现,虽然单个表达异常事件很少发生,但它们在总体上几乎存在于所有样本和三分之一的无活性蛋白编码基因中。使用 2821 对全基因组和 RNA 测序样本,我们发现表达异常事件在顺式中富含罕见的结构变异。我们通过一些 SV 导致基因表达异常的机制,包括转录通读、转录融合和基因倒置。总的来说,我们将表达异常作为一种转录组异常分析类型,并扩展了我们对遗传变异影响基因表达的各种机制的理解。