Suppr超能文献

通过组装 RNA-seq 和 small RNA-seq 数据对长非编码 RNA 进行深度注释。

Deep annotation of long noncoding RNAs by assembling RNA-seq and small RNA-seq data.

机构信息

Oujiang Laboratory, Zhejiang Provincial Key Laboratory of Medical Genetics, Key Laboratory of Laboratory Medicine, Ministry of Education, School of Laboratory Medicine and Life Sciences, Wenzhou Medical University, Wenzhou, Zhejiang Province, China; Oujiang Laboratory, Zhejiang Lab for Regenerative Medicine, Vision and Brain Health, Wenzhou Medical University, Wenzhou, Zhejiang Province, China.

Oujiang Laboratory, Zhejiang Provincial Key Laboratory of Medical Genetics, Key Laboratory of Laboratory Medicine, Ministry of Education, School of Laboratory Medicine and Life Sciences, Wenzhou Medical University, Wenzhou, Zhejiang Province, China.

出版信息

J Biol Chem. 2023 Sep;299(9):105130. doi: 10.1016/j.jbc.2023.105130. Epub 2023 Aug 4.

Abstract

Long noncoding RNAs (lncRNAs) are increasingly being recognized as modulators in various biological processes. However, due to their low expression, their systematic characterization is difficult to determine. Here, we performed transcript annotation by a newly developed computational pipeline, termed RNA-seq and small RNA-seq combined strategy (RSCS), in a wide variety of cellular contexts. Thousands of high-confidence potential novel transcripts were identified by the RSCS, and the reliability of the transcriptome was verified by analysis of transcript structure, base composition, and sequence complexity. Evidenced by the length comparison, the frequency of the core promoter and the polyadenylation signal motifs, and the locations of transcription start and end sites, the transcripts appear to be full length. Furthermore, taking advantage of our strategy, we identified a large number of endogenous retrovirus-associated lncRNAs, and a novel endogenous retrovirus-lncRNA that was functionally involved in control of Yap1 expression and essential for early embryogenesis was identified. In summary, the RSCS can generate a more complete and precise transcriptome, and our findings greatly expanded the transcriptome annotation for the mammalian community.

摘要

长链非编码 RNA(lncRNAs)越来越被认为是各种生物过程的调节剂。然而,由于其表达量低,其系统特征的确定具有一定难度。在这里,我们通过一种新开发的计算管道(称为 RNA-seq 和 small RNA-seq 联合策略,RSCS),在广泛的细胞环境中进行了转录本注释。通过 RSCS 鉴定了数千个高可信度的潜在新型转录本,通过分析转录本结构、碱基组成和序列复杂性来验证转录组的可靠性。通过长度比较、核心启动子和聚腺苷酸化信号基序的频率以及转录起始和结束位点的位置,可以看出这些转录本是全长的。此外,利用我们的策略,我们鉴定了大量内源性逆转录病毒相关的 lncRNAs,并鉴定了一种新型内源性逆转录病毒-lncRNA,该 lncRNA 参与控制 Yap1 表达,对早期胚胎发生至关重要。总之,RSCS 可以生成更完整、更精确的转录组,我们的发现极大地扩展了哺乳动物转录组注释。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7272/10498003/bf55fd8a726a/gr1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验