Suppr超能文献

通过组装 RNA-seq 和 small RNA-seq 数据对长非编码 RNA 进行深度注释。

Deep annotation of long noncoding RNAs by assembling RNA-seq and small RNA-seq data.

机构信息

Oujiang Laboratory, Zhejiang Provincial Key Laboratory of Medical Genetics, Key Laboratory of Laboratory Medicine, Ministry of Education, School of Laboratory Medicine and Life Sciences, Wenzhou Medical University, Wenzhou, Zhejiang Province, China; Oujiang Laboratory, Zhejiang Lab for Regenerative Medicine, Vision and Brain Health, Wenzhou Medical University, Wenzhou, Zhejiang Province, China.

Oujiang Laboratory, Zhejiang Provincial Key Laboratory of Medical Genetics, Key Laboratory of Laboratory Medicine, Ministry of Education, School of Laboratory Medicine and Life Sciences, Wenzhou Medical University, Wenzhou, Zhejiang Province, China.

出版信息

J Biol Chem. 2023 Sep;299(9):105130. doi: 10.1016/j.jbc.2023.105130. Epub 2023 Aug 4.

Abstract

Long noncoding RNAs (lncRNAs) are increasingly being recognized as modulators in various biological processes. However, due to their low expression, their systematic characterization is difficult to determine. Here, we performed transcript annotation by a newly developed computational pipeline, termed RNA-seq and small RNA-seq combined strategy (RSCS), in a wide variety of cellular contexts. Thousands of high-confidence potential novel transcripts were identified by the RSCS, and the reliability of the transcriptome was verified by analysis of transcript structure, base composition, and sequence complexity. Evidenced by the length comparison, the frequency of the core promoter and the polyadenylation signal motifs, and the locations of transcription start and end sites, the transcripts appear to be full length. Furthermore, taking advantage of our strategy, we identified a large number of endogenous retrovirus-associated lncRNAs, and a novel endogenous retrovirus-lncRNA that was functionally involved in control of Yap1 expression and essential for early embryogenesis was identified. In summary, the RSCS can generate a more complete and precise transcriptome, and our findings greatly expanded the transcriptome annotation for the mammalian community.

摘要

长链非编码 RNA(lncRNAs)越来越被认为是各种生物过程的调节剂。然而,由于其表达量低,其系统特征的确定具有一定难度。在这里,我们通过一种新开发的计算管道(称为 RNA-seq 和 small RNA-seq 联合策略,RSCS),在广泛的细胞环境中进行了转录本注释。通过 RSCS 鉴定了数千个高可信度的潜在新型转录本,通过分析转录本结构、碱基组成和序列复杂性来验证转录组的可靠性。通过长度比较、核心启动子和聚腺苷酸化信号基序的频率以及转录起始和结束位点的位置,可以看出这些转录本是全长的。此外,利用我们的策略,我们鉴定了大量内源性逆转录病毒相关的 lncRNAs,并鉴定了一种新型内源性逆转录病毒-lncRNA,该 lncRNA 参与控制 Yap1 表达,对早期胚胎发生至关重要。总之,RSCS 可以生成更完整、更精确的转录组,我们的发现极大地扩展了哺乳动物转录组注释。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7272/10498003/bf55fd8a726a/gr1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验