Department of Pharmacology, University of North Carolina, Chapel Hill, North Carolina 27599, USA.
RNA Discovery Center, University of North Carolina, Chapel Hill, North Carolina 27599, USA.
RNA. 2024 Oct 16;30(11):1408-1421. doi: 10.1261/rna.080188.124.
SEquence Evaluation through -mer Representation (SEEKR) is a method of sequence comparison that uses sequence substrings called -mers to quantify the nonlinear similarity between nucleic acid species. We describe the development of new functions within SEEKR that enable end-users to estimate values that ascribe statistical significance to SEEKR-derived similarities, as well as visualize different aspects of -mer similarity. We apply the new functions to identify chromatin-enriched lncRNAs that contain -like sequence features, and we demonstrate the utility of applying SEEKR on lncRNA fragments to identify potential RNA-protein interaction domains. We also highlight ways in which SEEKR can be applied to augment studies of lncRNA conservation, and we outline the best practice of visualizing RNA-seq read density to evaluate support for lncRNA annotations before their in-depth study in cell types of interest.
通过 -mer 表示进行序列评估 (SEEKR) 是一种序列比较方法,它使用称为 -mers 的序列子字符串来量化核酸物种之间的非线性相似性。我们描述了 SEEKR 中开发的新功能,使最终用户能够估计为 SEEKR 衍生的相似性赋予统计意义的值,以及可视化 -mer 相似性的不同方面。我们应用新功能来识别富含染色质的 lncRNA,这些 lncRNA 包含类似的序列特征,并展示了在 lncRNA 片段上应用 SEEKR 来识别潜在的 RNA-蛋白质相互作用结构域的效用。我们还强调了 SEEKR 可以应用于增强 lncRNA 保守性研究的方法,并概述了可视化 RNA-seq 读取密度的最佳实践,以在深入研究感兴趣的细胞类型之前评估对 lncRNA 注释的支持。