Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA.
RNA Biol. 2009 Apr-Jun;6(2):195-202. doi: 10.4161/rna.6.2.8050. Epub 2009 May 1.
Transfer RNA (tRNA) gene predictions are complicated by challenges such as structural variation, limited sequence conservation and the presence of highly reiterated short interspersed sequences (SINEs) that originally derived from tRNA genes or tRNA-like transcription units. Annotation of "tRNA genes" in sequenced genomes generally have not been accompanied by experimental verification of the expression status of predicted sequences.
To address this for mouse tRNA genes, we have employed two programs, tRNAScan-SE and ARAGORN, to predict the tRNA genes in the nuclear genome, resulting in diverse but overlapping predicted gene sets. From these, we removed known SINE repeats and sorted the genes into predicted families and single-copy genes. In particular, four families of intron-containing tRNA genes were predicted for the first time in mouse, with introns in positions and structures similar to the well characterized intron-containing tRNA genes in yeast. We verified the expression of the predicted tRNA genes by microarray analysis. We then confirmed the expression of appropriately sized RNA for the four intron-containing tRNA gene families, as well as the other 30 tRNA gene families creating an index of expression-verified mouse tRNAs.
These confirmed tRNA genes represent all anticodons and all known mammalian tRNA structural groups, as well as a variety of predicted "rogue" tRNA genes within families with altered anticodon identities.
转移 RNA(tRNA)基因预测受到结构变异、有限的序列保守性以及高度重复的短散在序列(SINEs)的存在等挑战的影响,这些 SINEs最初来自 tRNA 基因或 tRNA 样转录单元。在测序基因组中,“tRNA 基因”的注释通常没有伴随对预测序列表达状态的实验验证。
为了解决小鼠 tRNA 基因的这个问题,我们使用了两个程序 tRNAScan-SE 和 ARAGORN 来预测核基因组中的 tRNA 基因,从而产生了不同但重叠的预测基因集。从中,我们去除了已知的 SINE 重复,并将基因分类为预测的家族和单拷贝基因。特别是,在小鼠中首次预测了四个包含内含子的 tRNA 基因家族,内含子的位置和结构与酵母中特征明确的内含子 tRNA 基因相似。我们通过微阵列分析验证了预测的 tRNA 基因的表达。然后,我们确认了四个内含子 tRNA 基因家族以及其他 30 个 tRNA 基因家族的适当大小的 RNA 的表达,从而创建了一个表达验证的小鼠 tRNA 索引。
这些经过验证的 tRNA 基因代表了所有反密码子和所有已知的哺乳动物 tRNA 结构群,以及在具有改变的反密码子身份的家族内的各种预测的“流氓”tRNA 基因。