He Shunmin, Su Hua, Liu Changning, Skogerbø Geir, He Housheng, He Dandan, Zhu Xiaopeng, Liu Tao, Zhao Yi, Chen Runsheng
Bioinformatics Laboratory and National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, PR China.
BMC Genomics. 2008 May 21;9:236. doi: 10.1186/1471-2164-9-236.
Recent analysis of the mouse transcriptional data has revealed the existence of approximately 34,000 messenger-like non-coding RNAs (ml-ncRNAs). Whereas the functional properties of these ml-ncRNAs are beginning to be unravelled, no functional information is available for the large majority of these transcripts.
A few ml-ncRNA have been shown to have genomic loci that overlap with microRNA loci, leading us to suspect that a fraction of ml-ncRNA may encode microRNAs. We therefore developed an algorithm (PriMir) for specifically detecting potential microRNA-encoding transcripts in the entire set of 34,030 mouse full-length ml-ncRNAs. In combination with mouse-rat sequence conservation, this algorithm detected 97 (80 of them were novel) strong miRNA-encoding candidates, and for 52 of these we obtained experimental evidence for the existence of their corresponding mature microRNA by microarray and stem-loop RT-PCR. Sequence analysis of the microRNA-encoding RNAs revealed an internal motif, whose presence correlates strongly (R2 = 0.9, P-value = 2.2 x 10(-16)) with the occurrence of stem-loops with characteristics of known pre-miRNAs, indicating the presence of a larger number microRNA-encoding RNAs (from 300 up to 800) in the ml-ncRNAs population.
Our work highlights a unique group of ml-ncRNAs and offers clues to their functions.
最近对小鼠转录数据的分析揭示了大约34000种信使样非编码RNA(ml-ncRNAs)的存在。虽然这些ml-ncRNAs的功能特性开始被揭示,但对于这些转录本中的绝大多数,尚无功能信息。
已显示少数ml-ncRNA的基因组位点与微小RNA位点重叠,这使我们怀疑一部分ml-ncRNA可能编码微小RNA。因此,我们开发了一种算法(PriMir),用于在34030个小鼠全长ml-ncRNAs的整个集合中特异性检测潜在的编码微小RNA的转录本。结合小鼠-大鼠序列保守性,该算法检测到97个(其中80个是新的)强烈的编码miRNA的候选物,并且对于其中52个,我们通过微阵列和茎环RT-PCR获得了其相应成熟微小RNA存在的实验证据。对编码微小RNA的RNA的序列分析揭示了一个内部基序,其存在与具有已知前体miRNA特征的茎环的出现密切相关(R2 = 0.9,P值 = 2.2 x 10(-16)),表明在ml-ncRNAs群体中存在更多数量的编码微小RNA的RNA(从300到800)。
我们的工作突出了一组独特的ml-ncRNAs,并为它们的功能提供了线索。