Loher Phillipe, Telonis Aristeidis G, Rigoutsos Isidore
Computational Medicine Center, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, PA, USA.
Methods Mol Biol. 2018;1680:237-255. doi: 10.1007/978-1-4939-7339-2_16.
There is an increasing interest within the scientific community in identifying tRNA-derived fragments (tRFs) and elucidating the roles they play in the cell. Such endeavors can be greatly facilitated by mining the numerous datasets from many cellular contexts that exist publicly. However, the standard mapping tools cannot be used for the purpose. Several factors complicate this endeavor including: the presence of multiple identical or nearly identical isodecoders at various genomic locations; the presence of identical sequence segments that are shared by isodecoders of the same or even different anticodons; the existence of numerous partial tRNA sequences across the genome; the existence of hundreds of "lookalike" sequences that resemble true tRNAs; and others. This is generating a need for specialized tools that can mine deep sequencing data to identify and quantify tRFs. We discuss the various complicating factors and their ramifications, and how to use and run MINTmap, a tool that addresses these considerations.
科学界对识别tRNA衍生片段(tRFs)并阐明它们在细胞中所起的作用越来越感兴趣。通过挖掘公开存在的来自许多细胞环境的大量数据集,可以极大地促进此类研究。然而,标准的映射工具不能用于此目的。有几个因素使这项工作变得复杂,包括:在不同基因组位置存在多个相同或几乎相同的同功受体;相同或甚至不同反密码子的同功受体共享相同的序列片段;全基因组中存在大量部分tRNA序列;存在数百个类似于真正tRNA的“相似”序列等等。这就产生了对能够挖掘深度测序数据以识别和量化tRFs的专门工具的需求。我们讨论了各种复杂因素及其影响,以及如何使用和运行MINTmap,这是一种解决这些问题的工具。