Suppr超能文献

转录本组装可提高单细胞 RNA-seq 数据中转座元件的表达定量。

Transcript assembly improves expression quantification of transposable elements in single-cell RNA-seq data.

机构信息

Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA.

Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA.

出版信息

Genome Res. 2021 Jan;31(1):88-100. doi: 10.1101/gr.265173.120. Epub 2020 Dec 21.

Abstract

Transposable elements (TEs) are an integral part of the host transcriptome. TE-containing noncoding RNAs (ncRNAs) show considerable tissue specificity and play important roles during development, including stem cell maintenance and cell differentiation. Recent advances in single-cell RNA-seq (scRNA-seq) revolutionized cell type-specific gene expression analysis. However, effective scRNA-seq quantification tools tailored for TEs are lacking, limiting our ability to dissect TE expression dynamics at single-cell resolution. To address this issue, we established a TE expression quantification pipeline that is compatible with scRNA-seq data generated across multiple technology platforms. We constructed TE-containing ncRNA references using bulk RNA-seq data and showed that quantifying TE expression at the transcript level effectively reduces noise. As proof of principle, we applied this strategy to mouse embryonic stem cells and successfully captured the expression profile of endogenous retroviruses in single cells. We further expanded our analysis to scRNA-seq data from early stages of mouse embryogenesis. Our results illustrated the dynamic TE expression at preimplantation stages and revealed 146 TE-containing ncRNA transcripts with substantial tissue specificity during gastrulation and early organogenesis.

摘要

转座元件 (TEs) 是宿主转录组的一个组成部分。含有 TE 的非编码 RNA (ncRNA) 表现出相当大的组织特异性,并在发育过程中发挥重要作用,包括干细胞维持和细胞分化。单细胞 RNA 测序 (scRNA-seq) 的最新进展彻底改变了细胞类型特异性基因表达分析。然而,缺乏针对 TEs 的有效 scRNA-seq 定量工具,限制了我们在单细胞分辨率下剖析 TE 表达动态的能力。为了解决这个问题,我们建立了一个与跨多个技术平台生成的 scRNA-seq 数据兼容的 TE 表达定量管道。我们使用批量 RNA-seq 数据构建了含有 TE 的 ncRNA 参考,并表明在转录本水平上定量 TE 表达可有效降低噪声。作为原理验证,我们将此策略应用于小鼠胚胎干细胞,并成功捕获了单个细胞中内源性逆转录病毒的表达谱。我们进一步将分析扩展到来自小鼠胚胎发生早期的 scRNA-seq 数据。我们的结果说明了植入前阶段的动态 TE 表达,并揭示了在原肠胚形成和早期器官发生过程中具有显著组织特异性的 146 个含有 TE 的 ncRNA 转录本。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/85a5/7849386/ab58411dab47/88f01.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验