McClure Marcella A, Richardson Hugh S, Clinton Rochelle A, Hepp Crystal M, Crowther Brad A, Donaldson Eric F
Department of Microbiology and the Center for Computational Biology, Montana State University at Bozeman, 109 Lewis Hall, Bozeman, MT 59717, USA.
Genomics. 2005 Apr;85(4):512-23. doi: 10.1016/j.ygeno.2004.12.006.
Retroid agents are genomes that encode the reverse transcriptase (RT) and replicate by way of an RNA intermediate. Some retroid agents are implicated in disease via insertional mutagenesis, while others have been found to encode proteins essential to primate reproduction or provide regulatory sequences for host cell processes. The Genome Parsing Suite (GPS), a generic multistep automated process, was developed to characterize all RT-like sequences in the human genome database and to annotate the gene complement of the retroid agents that encode these sequences. In this report the GPS analyzes all significant WU-tBLASTn hits returned for 30 representative RT queries. A total of 128,779 unique RT signals were identified, and 7594 of these were retrieved by RTs not previously reported in the human genome. We have identified 9652 full-length long interspersed nuclear elements (LINEs). Only 159 LINEs are without stop codons or frameshifts.
逆转录元件是编码逆转录酶(RT)并通过RNA中间体进行复制的基因组。一些逆转录元件通过插入诱变与疾病有关,而其他一些则被发现编码灵长类动物繁殖所必需的蛋白质或为宿主细胞过程提供调控序列。基因组解析套件(GPS)是一个通用的多步骤自动化程序,旨在表征人类基因组数据库中所有类似RT的序列,并注释编码这些序列的逆转录元件的基因组成。在本报告中,GPS分析了针对30个代表性RT查询返回的所有显著的WU-tBLASTn命中结果。总共鉴定出128,779个独特的RT信号,其中7594个是由人类基因组中先前未报道的RT检索到的。我们已经鉴定出9652个全长的长散在核元件(LINEs)。只有159个LINEs没有终止密码子或移码。