Edman J C, Hatton T W, Nam M, Turner R, Mei Q, Angus C W, Kovacs J A
Department of Laboratory Medicine, University of California, San Francisco 94143-0626, USA.
DNA Cell Biol. 1996 Nov;15(11):989-99. doi: 10.1089/dna.1996.15.989.
The major surface glycoprotein (MSG) of Pneumocystis carinii is encoded by a family of related but distinct genes distributed throughout the P. carinii genome. Previous reports of the genomic and mRNA MSG structure suggested that there was a highly conserved 5'-untranslated region and a highly variable translated region. In the current study, we demonstrate that there is a single expression site for MSG expression and that different MSG genes are located downstream of this expression site. Isolation of a genomic clone containing the putative 5'-untranslated region has demonstrated that there was a single base sequencing error in what was considered to be the untranslated region. The corrected sequence reveals an extended open reading frame encoding a constant amino-terminal leader domain, with a typical signal peptide, for the MSG protein family. Since this constant amino-terminal domain is encoded by a single copy genomic sequence, a recombination/gene conversion-mediated antigenic switching event is required to effect the known variability in expressed MSG sequences. Therefore, like some bacterial and protozoan pathogens, the opportunistic fungal pathogen P. carinii contains a constant genomic site dedicated to MSG expression and a switchable downstream region for the variable part of the MSG gene family.
卡氏肺孢子虫的主要表面糖蛋白(MSG)由一族相关但不同的基因编码,这些基因分布于卡氏肺孢子虫的整个基因组中。先前关于基因组和mRNA MSG结构的报道表明,存在一个高度保守的5'非翻译区和一个高度可变的翻译区。在当前研究中,我们证明MSG表达存在一个单一的表达位点,不同的MSG基因位于该表达位点的下游。包含推定的5'非翻译区的基因组克隆的分离表明,在被认为是非翻译区的部分存在一个单碱基测序错误。校正后的序列揭示了一个延伸的开放阅读框,其编码一个恒定的氨基末端前导结构域,带有典型的信号肽,用于MSG蛋白家族。由于这个恒定的氨基末端结构域由单拷贝基因组序列编码,因此需要一个重组/基因转换介导的抗原转换事件来实现表达的MSG序列中已知的变异性。因此,与一些细菌和原生动物病原体一样,机会性真菌病原体卡氏肺孢子虫包含一个专门用于MSG表达的恒定基因组位点和一个用于MSG基因家族可变部分的可切换下游区域。