Zhang Zhaolei, Gerstein Mark
Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Avenue, New Haven, CT 06520-8114, USA.
Genomics. 2003 May;81(5):468-80. doi: 10.1016/s0888-7543(03)00004-1.
The human (nuclear) genome encodes at least 79 mitochondrial ribosomal proteins (MRPs), which are imported into the mitochondria. Using a comprehensive approach, we find 41 of these give rise to 120 pseudogenes in the genome. The majority of the MRP pseudogenes are of processed origin and can be aligned to match the entire coding region of the functional MRP mRNAs. One processed pseudogene was found to have originated from an alternatively spliced mRNA transcript. We also found two duplicated pseudogenes that are transcribed in the cell as confirmed by screening the human EST database. We observed a significant correlation between the number of processed pseudogenes and the gene CDS length (R = -0.40; p < 0.001), i.e., the relatively shorter genes tend to have more processed pseudogenes. There is also a weaker correlation between the number of processed pseudogenes and the gene CDS GC content. Our study provides a catalogue of human MRP pseudogenes, which will be useful in the study of functional MRP genes. It also provides a molecular record of the evolution of these genes. More details are available at http://pseudogene.org/.
人类(核)基因组编码至少79种线粒体核糖体蛋白(MRP),这些蛋白会被导入线粒体。通过综合方法,我们发现其中41种在基因组中产生了120个假基因。大多数MRP假基因起源于加工过程,并且可以比对以匹配功能性MRP mRNA的整个编码区域。发现一个加工假基因起源于一个可变剪接的mRNA转录本。通过筛选人类EST数据库证实,我们还发现了两个在细胞中被转录的重复假基因。我们观察到加工假基因的数量与基因CDS长度之间存在显著相关性(R = -0.40;p < 0.001),即相对较短的基因往往有更多的加工假基因。加工假基因的数量与基因CDS的GC含量之间也存在较弱的相关性。我们的研究提供了人类MRP假基因的目录,这将有助于功能性MRP基因的研究。它还提供了这些基因进化的分子记录。更多详细信息可在http://pseudogene.org/获取。