Beijing Institute of Radiation Medicine, State Key Laboratory of Proteomics, Cognitive and Mental Health Research Center, Beijing 100850, China.
Amino Acids. 2013 Feb;44(2):597-606. doi: 10.1007/s00726-012-1380-1. Epub 2012 Sep 4.
Mammalian bicistronic mRNA is a recently discovered mammalian gene structure. Several reported cases of mammalian bicistronic mRNA indicated that genes of this structure play roles in some important biological processes. However, a genome-wide computational identification of bicistronic mRNA in mammalian genome, such as human genome, is still lacking. Here we used a comparative genomics approach to identify the frequency of human bicistronic mRNA. We then validated the result by using a new support vector machine (SVM) model. We identified 43 human bicistronic mRNAs in 30 distinct genes. Our literature analysis shows that our method recovered 100 % (6/6) of the previously known bicistronic mRNAs which had been experimentally confirmed by other groups. Our graph theory-based analysis and GO analysis indicated that human bicistronic mRNAs are prone to produce different yet closely functionally related proteins. In addition, we also described and analyzed three different mechanisms of ORF fusion. Our method of identifying bicistronic mRNAs in human genome provides a model for the computational identification of characteristic gene structures in mammalian genomes. We anticipate that our data will facilitate further molecular characterization and functional study of human bicistronic mRNA.
哺乳动物双顺反子 mRNA 是最近发现的一种哺乳动物基因结构。已有几例报道表明,这种结构的基因在一些重要的生物学过程中发挥作用。然而,在哺乳动物基因组(如人类基因组)中进行全基因组范围的双顺反子 mRNA 的计算识别仍然缺乏。在这里,我们使用比较基因组学方法来识别人类双顺反子 mRNA 的频率。然后,我们使用新的支持向量机 (SVM) 模型验证了结果。我们在 30 个不同的基因中鉴定了 43 个人类双顺反子 mRNA。我们的文献分析表明,我们的方法恢复了之前已知的双顺反子 mRNA 的 100%(6/6),这些 mRNA 已经被其他小组通过实验证实。我们基于图论的分析和 GO 分析表明,人类双顺反子 mRNA 倾向于产生不同但密切相关的功能相关蛋白质。此外,我们还描述和分析了三种不同的 ORF 融合机制。我们在人类基因组中识别双顺反子 mRNA 的方法为哺乳动物基因组中特征基因结构的计算识别提供了一个模型。我们预计我们的数据将有助于进一步对人类双顺反子 mRNA 进行分子特征和功能研究。