Suppr超能文献

荟萃分析表明内含子保留会影响从RNA测序数据中对转座元件的定量分析。

Meta-Analysis Suggests That Intron Retention Can Affect Quantification of Transposable Elements from RNA-Seq Data.

作者信息

Gualandi Nicolò, Iperi Cristian, Esposito Mauro, Ansaloni Federico, Gustincich Stefano, Sanges Remo

机构信息

Computational Genomics Laboratory, Area of Neuroscience, Scuola Internazionale Superiore di Studi Avanzati (SISSA), 34136 Trieste, Italy.

Central RNA Laboratory, Istituto Italiano di Tecnologia, 16132 Genova, Italy.

出版信息

Biology (Basel). 2022 May 27;11(6):826. doi: 10.3390/biology11060826.

Abstract

Transposable elements (TEs), also known as "jumping genes", are repetitive sequences with the capability of changing their location within the genome. They are key players in many different biological processes in health and disease. Therefore, a reliable quantification of their expression as transcriptional units is crucial to distinguish between their independent expression and the transcription of their sequences as part of canonical transcripts. TEs quantification faces difficulties of different types, the most important one being low reads mappability due to their repetitive nature preventing an unambiguous mapping of reads originating from their sequences. A large fraction of TEs fragments localizes within introns, which led to the hypothesis that intron retention (IR) can be an additional source of bias, potentially affecting accurate TEs quantification. IR occurs when introns, normally removed from the mature transcript by the splicing machinery, are maintained in mature transcripts. IR is a widespread mechanism affecting many different genes with cell type-specific patterns. We hypothesized that, in an RNA-seq experiment, reads derived from retained introns can introduce a bias in the detection of overlapping, independent TEs RNA expression. In this study we performed meta-analysis using public RNA-seq data from lymphoblastoid cell lines and show that IR can impact TEs quantification using established tools with default parameters. Reads mapped on intronic TEs were indeed associated to the expression of TEs and influence their correct quantification as independent transcriptional units. We confirmed these results using additional independent datasets, demonstrating that this bias does not appear in samples where IR is not present and that differential TEs expression does not impact on IR quantification. We concluded that IR causes the over-quantification of intronic TEs and differential IR might be confused with differential TEs expression. Our results should be taken into account for a correct quantification of TEs expression from RNA-seq data, especially in samples in which IR is abundant.

摘要

转座元件(TEs),也被称为“跳跃基因”,是具有在基因组内改变其位置能力的重复序列。它们是健康和疾病中许多不同生物学过程的关键参与者。因此,将它们作为转录单位进行可靠的表达定量对于区分它们的独立表达和作为经典转录本一部分的序列转录至关重要。TEs定量面临着不同类型的困难,其中最重要的是由于其重复性质导致低读数映射能力,从而妨碍了源自其序列的读数的明确映射。很大一部分TEs片段定位于内含子内,这导致了一种假设,即内含子保留(IR)可能是偏差的另一个来源,可能会影响准确的TEs定量。当内含子(通常由剪接机制从成熟转录本中去除)保留在成熟转录本中时,就会发生IR。IR是一种广泛存在的机制,以细胞类型特异性模式影响许多不同的基因。我们假设,在RNA测序实验中,源自保留内含子的读数会在重叠的、独立的TEs RNA表达检测中引入偏差。在这项研究中,我们使用来自淋巴母细胞系的公共RNA测序数据进行了荟萃分析,结果表明,使用具有默认参数的既定工具时,IR会影响TEs定量。映射到内含子TEs上的读数确实与TEs的表达相关,并影响它们作为独立转录单位的正确定量。我们使用额外的独立数据集证实了这些结果,表明这种偏差在不存在IR的样本中不会出现,并且TEs的差异表达不会影响IR定量。我们得出结论,IR导致内含子TEs的过度定量,并且差异IR可能会与TEs的差异表达相混淆。从RNA测序数据中正确定量TEs表达时应考虑我们的结果,特别是在IR丰富的样本中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1461/9220773/66bd09786691/biology-11-00826-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验