Pavlícek Adam, Paces Jan, Zíka Radek, Hejnar Jirí
Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, Flemingovo nam. 2, Prague 6, CZ-16637, Czech Republic
Gene. 2002 Oct 30;300(1-2):189-94. doi: 10.1016/s0378-1119(02)01047-8.
Deciphering the human genome includes reliable identification and structural characterization of individual retrotransposon elements. The most active group of autonomous transposable elements, the long interspersed nuclear elements (LINE), transpose themselves as well as other RNAs, including those of human endogenous retroviruses (HERV). During this transposition, however, the LINE-encoded reverse transcriptase (RT) often abortively dissociates from the RNA template, leaving a prematurely terminated, 5' truncated copy. We have analyzed the length distributions of LINEs and of processed pseudogenes derived from HERV-W. As expected, we have found that the majority of 5' truncated LINEs and HERV-W processed pseudogenes show a prevalence of very short elements terminated close to the 3' end. On the other hand, the number of complete elements is far above the expectation. The characteristic distribution in both cases indicates two important conclusions: (i) dissociation of LINE RT from the template cannot be fully explained by low processivity of RT modelled as a stochastic, Poisson-type process. (ii) Currently cited numbers of pseudogenes within the human genome are underestimated, since a large percentage of pseudogenes are terminated in the 3' untranslated region and remain undetectable in translated homology searches of protein databases against the human genome.
解析人类基因组包括对单个逆转录转座子元件进行可靠的识别和结构表征。最活跃的自主转座元件组,即长散在核元件(LINE),不仅能自我转座,还能使包括人类内源性逆转录病毒(HERV)的RNA在内的其他RNA转座。然而,在这种转座过程中,LINE编码的逆转录酶(RT)常常从RNA模板上过早解离,留下一个提前终止的、5'端截短的拷贝。我们分析了LINE以及源自HERV-W的加工假基因的长度分布。正如预期的那样,我们发现大多数5'端截短的LINE和HERV-W加工假基因显示出大量非常短的元件,这些元件在靠近3'端处终止。另一方面,完整元件的数量远远高于预期。这两种情况下的特征性分布表明了两个重要结论:(i)LINE RT从模板上的解离不能完全用将RT模拟为随机泊松型过程时的低持续合成能力来解释。(ii)目前人类基因组中引用的假基因数量被低估了,因为很大一部分假基因在3'非翻译区终止,在针对人类基因组的蛋白质数据库的翻译同源性搜索中仍然无法检测到。