Veneziano Dario, Di Bella Sebastiano, Nigita Giovanni, Laganà Alessandro, Ferro Afredo, Croce Carlo M
Department of Cancer Biology and Genetics, Comprehensive Cancer Center, The Ohio State University, Columbus, Ohio, 43210.
Nerviano Medical Sciences srl, Nerviano, Milan, Italy.
Hum Mutat. 2016 Dec;37(12):1283-1298. doi: 10.1002/humu.23066. Epub 2016 Sep 5.
One of the most significant biological discoveries of the last decade is represented by the reality that the vast majority of the transcribed genomic output comprises diverse classes of noncoding RNAs (ncRNAs) that may play key roles and/or be affected by many biochemical cellular processes (i.e., RNA editing), with implications in human health and disease. With 90% of the human genome being transcribed and novel classes of ncRNA emerging (tRNA-derived small RNAs and circular RNAs among others), the great majority of the human transcriptome suggests that many important ncRNA functions/processes are yet to be discovered. An approach to filling such vast void of knowledge has been recently provided by the increasing application of next-generation sequencing (NGS), offering the unprecedented opportunity to obtain a more accurate profiling with higher resolution, increased throughput, sequencing depth, and low experimental complexity, concurrently posing an increasing challenge in terms of efficiency, accuracy, and usability of data analysis software. This review provides an overview of ncRNAs, NGS technology, and the most recent/popular computational approaches and the challenges they attempt to solve, which are essential to a more sensitive and comprehensive ncRNA annotation capable of furthering our understanding of this still vastly uncharted genomic territory.
过去十年最重要的生物学发现之一是,绝大多数转录的基因组产物包含各种类型的非编码RNA(ncRNA),它们可能发挥关键作用和/或受许多细胞生化过程(如RNA编辑)影响,对人类健康和疾病具有重要意义。随着90%的人类基因组被转录以及新型ncRNA(如tRNA衍生的小RNA和环状RNA等)不断涌现,人类转录组的绝大部分表明许多重要的ncRNA功能/过程仍有待发现。下一代测序(NGS)应用的不断增加,最近为填补如此巨大的知识空白提供了一种方法,它提供了前所未有的机会,能够以更高的分辨率、更高的通量、测序深度和更低的实验复杂性获得更准确的图谱,同时在数据分析软件的效率、准确性和可用性方面也带来了越来越大的挑战。本综述概述了ncRNA、NGS技术以及最新/最流行的计算方法和它们试图解决的挑战,这些对于更敏感、全面的ncRNA注释至关重要,有助于我们进一步了解这片仍未被充分探索的基因组领域。