Sokol Martin, Jessen Karen Margrethe, Pedersen Finn Skou
Department of Molecular Biology and Genetics, Aarhus University, Aarhus, DK-8000, Denmark.
Retrovirology. 2015 Apr 17;12:32. doi: 10.1186/s12977-015-0161-9.
Evidence suggests that some human endogenous retroviruses and endogenous retrovirus-like repeats (here collectively ERVs) regulate the expression of neighboring genes in normal and disease states; e.g. the human globin locus is regulated by an ERV9 that coordinates long-range gene switching during hematopoiesis and activates also intergenic transcripts. While complex transcription regulation is associated with integration of certain exogenous retroviruses, comparable regulation sustained by ERVs is less understood.
We analyzed ERV transcription using ERV9 consensus sequences and publically available RNA-sequencing, chromatin immunoprecipitation with sequencing (ChIP-seq) and cap analysis gene expression (CAGE) data from ENCODE. We discovered previously undescribed and advanced transcription regulation mechanisms in several human reference cell lines. We show that regulation by ERVs involves long-ranging activations including complex RNA splicing patterns, and transcription of large unannotated regions ranging in size from several hundred kb to around 1 Mb. Moreover, regulation was found to be cooperatively sustained in some loci by multiple ERVs and also non-LTR repeats.
Our analyses show that endogenous retroviruses sustain advanced transcription regulation in human cell lines, which shows similarities to complex insertional mutagenesis effects exerted by exogenous retroviruses. By exposing previously undescribed regulation effects, this study should prove useful for understanding fundamental transcription mechanisms resulting from evolutionary acquisition of retroviral sequence in the human genome.
有证据表明,一些人类内源性逆转录病毒和内源性逆转录病毒样重复序列(此处统称为ERVs)在正常和疾病状态下调节邻近基因的表达;例如,人类珠蛋白基因座受ERV9调控,ERV9在造血过程中协调长距离基因转换,并激活基因间转录本。虽然复杂的转录调控与某些外源性逆转录病毒的整合有关,但对ERVs所维持的类似调控了解较少。
我们使用ERV9共有序列以及来自ENCODE的公开可用RNA测序、染色质免疫沉淀测序(ChIP-seq)和基因表达帽分析(CAGE)数据,分析了ERV转录。我们在几种人类参考细胞系中发现了以前未描述的先进转录调控机制。我们表明,ERVs的调控涉及长距离激活,包括复杂的RNA剪接模式,以及大小从几百kb到约1Mb不等的大量未注释区域的转录。此外,发现在某些基因座中,多种ERVs以及非LTR重复序列协同维持调控。
我们的分析表明,内源性逆转录病毒在人类细胞系中维持先进的转录调控,这与外源性逆转录病毒产生的复杂插入诱变效应相似。通过揭示以前未描述的调控效应,本研究应有助于理解人类基因组中逆转录病毒序列进化获得所产生的基本转录机制。