Suppr超能文献

人类基因组中内源性逆转录病毒阅读框的鉴定。

Identification of endogenous retroviral reading frames in the human genome.

作者信息

Villesen Palle, Aagaard Lars, Wiuf Carsten, Pedersen Finn Skou

机构信息

Bioinformatics Research Center, University of Aarhus, Høegh-Guldbergs Gade 10, Bldg. 090, DK-8000 Aarhus, Denmark.

出版信息

Retrovirology. 2004 Oct 11;1:32. doi: 10.1186/1742-4690-1-32.

Abstract

BACKGROUND

Human endogenous retroviruses (HERVs) comprise a large class of repetitive retroelements. Most HERVs are ancient and invaded our genome at least 25 million years ago, except for the evolutionary young HERV-K group. The far majority of the encoded genes are degenerate due to mutational decay and only a few non-HERV-K loci are known to retain intact reading frames. Additional intact HERV genes may exist, since retroviral reading frames have not been systematically annotated on a genome-wide scale.

RESULTS

By clustering of hits from multiple BLAST searches using known retroviral sequences we have mapped 1.1% of the human genome as retrovirus related. The coding potential of all identified HERV regions were analyzed by annotating viral open reading frames (vORFs) and we report 7836 loci as verified by protein homology criteria. Among 59 intact or almost-intact viral polyproteins scattered around the human genome we have found 29 envelope genes including two novel gammaretroviral types. One encodes a protein similar to a recently discovered zebrafish retrovirus (ZFERV) while another shows partial, C-terminal, homology to Syncytin (HERV-W/FRD).

CONCLUSIONS

This compilation of HERV sequences and their coding potential provide a useful tool for pursuing functional analysis such as RNA expression profiling and effects of viral proteins, which may, in turn, reveal a role for HERVs in human health and disease. All data are publicly available through a database at http://www.retrosearch.dk.

摘要

背景

人类内源性逆转录病毒(HERV)构成了一大类重复的逆转录元件。除了进化上较年轻的HERV-K组外,大多数HERV很古老,至少在2500万年前就侵入了我们的基因组。由于突变衰减,绝大多数编码基因都已退化,只有少数非HERV-K位点已知保留完整的阅读框。由于逆转录病毒阅读框尚未在全基因组范围内进行系统注释,可能还存在其他完整的HERV基因。

结果

通过使用已知逆转录病毒序列对多次BLAST搜索的命中结果进行聚类,我们将1.1%的人类基因组定位为与逆转录病毒相关。通过注释病毒开放阅读框(vORF)分析了所有已识别的HERV区域的编码潜力,我们报告了7836个经蛋白质同源性标准验证的位点。在分散于人类基因组中的59个完整或几乎完整的病毒多蛋白中,我们发现了29个包膜基因,包括两种新型γ逆转录病毒类型。其中一个编码的蛋白质类似于最近发现的斑马鱼逆转录病毒(ZFERV),另一个在C末端与合胞素(HERV-W/FRD)有部分同源性。

结论

HERV序列及其编码潜力的汇编为进行功能分析(如RNA表达谱分析和病毒蛋白的作用)提供了一个有用的工具,这反过来可能揭示HERV在人类健康和疾病中的作用。所有数据均可通过http://www.retrosearch.dk的数据库公开获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47b1/524368/815c9167d5b2/1742-4690-1-32-1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验