Sprott Center for Stem Cell Research, Ottawa Hospital Research Institute, Ottawa, Canada.
PLoS One. 2011;6(6):e20561. doi: 10.1371/journal.pone.0020561. Epub 2011 Jun 15.
Many computational methods have been used to predict novel non-coding RNAs (ncRNAs), but none, to our knowledge, have explicitly investigated the impact of integrating existing cDNA-based Expressed Sequence Tag (EST) data that flank structural RNA predictions. To determine whether flanking EST data can assist in microRNA (miRNA) prediction, we identified genomic sites encoding putative miRNAs by combining functional RNA predictions with flanking ESTs data in a model consistent with miRNAs undergoing cleavage during maturation. In both human and mouse genomes, we observed that the inclusion of flanking ESTs adjacent to and not overlapping predicted miRNAs significantly improved the performance of various methods of miRNA prediction, including direct high-throughput sequencing of small RNA libraries. We analyzed the expression of hundreds of miRNAs predicted to be expressed during myogenic differentiation using a customized microarray and identified several known and predicted myogenic miRNA hairpins. Our results indicate that integrating ESTs flanking structural RNA predictions improves the quality of cleaved miRNA predictions and suggest that this strategy can be used to predict other non-coding RNAs undergoing cleavage during maturation.
许多计算方法已被用于预测新型非编码 RNA(ncRNA),但据我们所知,还没有任何方法明确研究整合现有基于 cDNA 的表达序列标签(EST)数据对结构 RNA 预测的影响。为了确定侧翼 EST 数据是否可以辅助 microRNA(miRNA)预测,我们通过将功能 RNA 预测与 miRNA 成熟过程中发生切割相一致的模型中的侧翼 EST 数据相结合,鉴定出编码假定 miRNA 的基因组位点。在人类和小鼠基因组中,我们观察到,包括预测 miRNA 相邻但不重叠的侧翼 EST,显著提高了 miRNA 预测的各种方法的性能,包括直接对小 RNA 文库进行高通量测序。我们使用定制的微阵列分析了数百个预测在成肌分化过程中表达的 miRNA 的表达情况,并鉴定出了一些已知和预测的成肌 miRNA 发夹。我们的结果表明,整合结构 RNA 预测的 EST 可以提高切割 miRNA 预测的质量,并表明该策略可用于预测其他在成熟过程中发生切割的非编码 RNA。