Hadasch R P, Bugert J J, Janssen W, Darai G
Institut für Medizinische Virologie, Universität Heidelberg, FRG.
Intervirology. 1993;36(1):32-43. doi: 10.1159/000150319.
The complete DNA nucleotide sequence of a HindIII/MluI genomic DNA fragment (0.045-0.075 viral map units) from molluscum contagiosum virus type 1 (MCV-1) was determined. The HindIII/MluI DNA fragment comprises 5,646 bp with a base composition of 64.4% G + C and 35.6% A + T. The DNA sequence contains many perfect direct repeats. A cluster of three repetitive DNA elements R1, R2 and R3, with a complex structural arrangement was detected between nucleotide positions 1802 and 2107. The unit length (box) of the repetitive DNA sequences was found to be 6 bp (15 boxes) and 9 bp (24 boxes) for R1 and R2, respectively. The repetitive DNA element R3 is organized in fifteen boxes (15 bp) in which a unit length of R1 is combined with a unit length of R2. The arrangement of the repetition R3 within the DNA sequences of this particular region of the MCV-1 genome was found to be (5 x R3) + (2 x R2) + (1 x R3) + (6 x R2) + (1 x R3) + (1 x R2) + (8 x R3). Twenty-three open reading frames (ORFs) of 60-1,175 amino acid (AA) residues were detected. The largest ORF (number 17) comprises 1,175 AA with a predicted molecular weight of 126 kD. This ORF harbors a promoter signal which is located 21 nucleotides upstream from the start codon and is very similar to the early promoter signals known for vaccinia virus. This putative protein contains glutamine-enriched regions between AA residues 427 and 682 which show homologies to the corresponding glutamine-enriched regions of a variety of cellular genes like human transcriptional initiation factor (TFIID: TATA box factor).
测定了1型传染性软疣病毒(MCV-1)HindIII/MluI基因组DNA片段(病毒图谱单位0.045 - 0.075)的完整DNA核苷酸序列。HindIII/MluI DNA片段包含5646 bp,碱基组成为64.4%的G + C和35.6%的A + T。该DNA序列包含许多完美的直接重复序列。在核苷酸位置1802和2107之间检测到一组由三个重复DNA元件R1、R2和R3组成的、具有复杂结构排列的序列。发现重复DNA序列的单位长度(框)对于R1和R2分别为6 bp(15个框)和9 bp(24个框)。重复DNA元件R3由十五个框(15 bp)组成,其中R1的一个单位长度与R2的一个单位长度组合在一起。发现MCV-1基因组这一特定区域的DNA序列中重复序列R3的排列为(5×R3)+(2×R2)+(1×R3)+(6×R2)+(1×R3)+(1×R2)+(8×R3)。检测到23个开放阅读框(ORF),其氨基酸(AA)残基数量在60 - 1175之间。最大的ORF(编号17)包含1175个AA,预测分子量为126 kD。该ORF含有一个启动子信号,位于起始密码子上游21个核苷酸处,与痘苗病毒已知的早期启动子信号非常相似。这个推定的蛋白质在AA残基427和682之间含有富含谷氨酰胺的区域,这些区域与多种细胞基因(如人类转录起始因子(TFIID:TATA框因子))的相应富含谷氨酰胺的区域具有同源性。