Pedersen Jakob Skou, Bejerano Gill, Siepel Adam, Rosenbloom Kate, Lindblad-Toh Kerstin, Lander Eric S, Kent Jim, Miller Webb, Haussler David
Center for Biomolecular Science and Engineering, University of California Santa Cruz, Santa Cruz, California, USA.
PLoS Comput Biol. 2006 Apr;2(4):e33. doi: 10.1371/journal.pcbi.0020033. Epub 2006 Apr 21.
The discoveries of microRNAs and riboswitches, among others, have shown functional RNAs to be biologically more important and genomically more prevalent than previously anticipated. We have developed a general comparative genomics method based on phylogenetic stochastic context-free grammars for identifying functional RNAs encoded in the human genome and used it to survey an eight-way genome-wide alignment of the human, chimpanzee, mouse, rat, dog, chicken, zebra-fish, and puffer-fish genomes for deeply conserved functional RNAs. At a loose threshold for acceptance, this search resulted in a set of 48,479 candidate RNA structures. This screen finds a large number of known functional RNAs, including 195 miRNAs, 62 histone 3'UTR stem loops, and various types of known genetic recoding elements. Among the highest-scoring new predictions are 169 new miRNA candidates, as well as new candidate selenocysteine insertion sites, RNA editing hairpins, RNAs involved in transcript auto regulation, and many folds that form singletons or small functional RNA families of completely unknown function. While the rate of false positives in the overall set is difficult to estimate and is likely to be substantial, the results nevertheless provide evidence for many new human functional RNAs and present specific predictions to facilitate their further characterization.
微小RNA和核糖开关等的发现表明,功能性RNA在生物学上比以前预期的更为重要,在基因组中也更为普遍。我们开发了一种基于系统发育随机上下文无关文法的通用比较基因组学方法,用于识别人类基因组中编码的功能性RNA,并利用该方法在人类、黑猩猩、小鼠、大鼠、狗、鸡、斑马鱼和河豚基因组的全基因组八重比对中搜索深度保守的功能性RNA。在一个宽松的接受阈值下,该搜索产生了一组48479个候选RNA结构。这个筛选发现了大量已知的功能性RNA,包括195个微小RNA、62个组蛋白3'UTR茎环以及各种类型的已知基因重编码元件。得分最高的新预测包括169个新的微小RNA候选物,以及新的候选硒代半胱氨酸插入位点、RNA编辑发夹、参与转录自调控的RNA,以及许多形成单例或功能完全未知的小功能RNA家族的折叠结构。虽然总体假阳性率难以估计且可能很高,但这些结果仍然为许多新的人类功能性RNA提供了证据,并提出了具体的预测,以促进它们的进一步表征。