Huang Yong, Gu Xun
Department of Genetics, Development, and Cell Biology, Center for Bioinformatics and Biological Statistics, Iowa State University, Ames, IA 50011, USA.
BMC Genomics. 2007 Mar 6;8:66. doi: 10.1186/1471-2164-8-66.
Phylogenetically related miRNAs (miRNA families) convey important information of the function and evolution of miRNAs. Due to the special sequence features of miRNAs, pair-wise sequence identity between miRNA precursors alone is often inadequate for unequivocally judging the phylogenetic relationships between miRNAs. Most of the current methods for miRNA classification rely heavily on manual inspection and lack measurements of the reliability of the results.
In this study, we designed an analysis pipeline (the Phylogeny-Bootstrap-Cluster (PBC) pipeline) to identify miRNA families based on branch stability in the bootstrap trees derived from overlapping genome-wide miRNA sequence sets. We tested the PBC analysis pipeline with the miRNAs from six animal species, H. sapiens, M. musculus, G. gallus, D. rerio, D. melanogaster, and C. elegans. The resulting classification was compared with the miRNA families defined in miRBase. The two classifications were largely consistent.
The PBC analysis pipeline is an efficient method for classifying large numbers of heterogeneous miRNA sequences. It requires minimum human involvement and provides measurements of the reliability of the classification results.
系统发育相关的微小RNA(miRNA家族)传达了miRNA功能和进化的重要信息。由于miRNA具有特殊的序列特征,仅miRNA前体之间的成对序列同一性往往不足以明确判断miRNA之间的系统发育关系。当前大多数miRNA分类方法严重依赖人工检查,且缺乏对结果可靠性的衡量。
在本研究中,我们设计了一种分析流程(系统发育-自展-聚类(PBC)流程),基于从全基因组重叠miRNA序列集得出的自展树中的分支稳定性来识别miRNA家族。我们用来自六种动物物种(智人、小家鼠、原鸡、斑马鱼、黑腹果蝇和秀丽隐杆线虫)的miRNA测试了PBC分析流程。将所得分类结果与miRBase中定义的miRNA家族进行比较。两种分类在很大程度上是一致的。
PBC分析流程是一种用于对大量异质miRNA序列进行分类的有效方法。它所需的人工干预最少,并提供了分类结果可靠性的衡量标准。