Department of Life Science, College of Natural Sciences, Hanyang University, Seoul, 04763, Republic of Korea.
Research Institute for Convergence of Basic Sciences, Hanyang University, Seoul, 04763, Republic of Korea.
BMC Bioinformatics. 2018 Feb 19;19(Suppl 1):41. doi: 10.1186/s12859-018-2013-9.
LncRNAs are long regulatory non-coding RNAs, some of which are arguably predicted to have coding potential. Despite coding potential classifiers that utilize ribosome profiling data successfully detected actively translated regions, they are less sensitive to lncRNAs. Furthermore, lncRNA annotation can be susceptible to false positives obtained from 3' untranslated region (UTR) fragments of mRNAs.
To lower these limitations in lncRNA annotation, we present a novel tool TERIUS that provides a two-step filtration process to distinguish between bona fide and false lncRNAs. The first step successfully separates lncRNAs from protein-coding genes showing enhanced sensitivity compared to other methods. To eliminate 3'UTR fragments, the second step takes advantage of the 3'UTR-specific association with regulator of nonsense transcripts 1 (UPF1), leading to refined lncRNA annotation. Importantly, TERIUS enabled the detection of misclassified transcripts in published lncRNA annotations.
TERIUS is a robust method for lncRNA annotation, which provides an additional filtration step for 3'UTR fragments. TERIUS was able to successfully re-classify GENCODE and miTranscriptome lncRNA annotations. We believe that TERIUS can benefit construction of extensive and accurate non-coding transcriptome maps in many genomes.
lncRNAs 是长的调控性非编码 RNA,其中一些据称具有编码潜力。尽管利用核糖体图谱数据的编码潜力分类器成功地检测到了活跃翻译的区域,但它们对 lncRNAs 的敏感性较低。此外,lncRNA 注释可能容易受到来自 mRNA 3'非翻译区(UTR)片段的假阳性的影响。
为了降低 lncRNA 注释中的这些限制,我们提出了一种新的工具 TERIUS,它提供了一个两步过滤过程,以区分真正的和假的 lncRNAs。第一步成功地将 lncRNAs 与编码蛋白的基因区分开来,与其他方法相比,具有更高的敏感性。为了消除 3'UTR 片段,第二步利用 3'UTR 与无义转录物调节因子 1(UPF1)的特异性结合,从而进行更精细的 lncRNA 注释。重要的是,TERIUS 能够检测到已发表的 lncRNA 注释中分类错误的转录本。
TERIUS 是一种强大的 lncRNA 注释方法,它为 3'UTR 片段提供了额外的过滤步骤。TERIUS 能够成功地重新分类 GENCODE 和 miTranscriptome lncRNA 注释。我们相信,TERIUS 可以有益于在许多基因组中构建广泛而准确的非编码转录组图谱。