Papazisi Leka, Gorton Timothy S, Kutish Gerald, Markham Philip F, Browning Glenn F, Nguyen Di Kim, Swartzell Steven, Madan Anup, Mahairas Greg, Geary Steven J
Department of Pathobiology and Veterinary Science, The University of Connecticut, Storrs, CT 06269-3089, USA.
Center of Excellence for Vaccine Research, The University of Connecticut, Storrs, CT 06269-3089, USA.
Microbiology (Reading). 2003 Sep;149(Pt 9):2307-2316. doi: 10.1099/mic.0.26427-0.
The complete genome of Mycoplasma gallisepticum strain R(low) has been sequenced. The genome is composed of 996,422 bp with an overall G+C content of 31 mol%. It contains 742 putative coding DNA sequences (CDSs), representing a 91 % coding density. Function has been assigned to 469 of the CDSs, while 150 encode conserved hypothetical proteins and 123 remain as unique hypothetical proteins. The genome contains two copies of the rRNA genes and 33 tRNA genes. The origin of replication has been localized based on sequence analysis in the region of the dnaA gene. The vlhA family (previously termed pMGA) contains 43 genes distributed among five loci containing 8, 2, 9, 12 and 12 genes. This family of genes constitutes 10.4% (103 kb) of the total genome. Two CDSs were identified immediately downstream of gapA and crmA encoding proteins that share homology to cytadhesins GapA and CrmA. Based on motif analysis it is predicted that 80 genes encode lipoproteins and 149 proteins contain multiple transmembrane domains. The authors have identified 75 proteins putatively involved in transport of biomolecules, 12 transposases, and a number of potential virulence factors. The completion of this sequence has spawned multiple projects directed at defining the biological basis of M. gallisepticum.
鸡毒支原体R(低)株的全基因组已被测序。该基因组由996,422个碱基对组成,总体G+C含量为31摩尔%。它包含742个推定的编码DNA序列(CDS),编码密度为91%。已为469个CDS赋予了功能,150个编码保守的假定蛋白,123个仍为独特的假定蛋白。基因组包含rRNA基因的两个拷贝和33个tRNA基因。基于dnaA基因区域的序列分析确定了复制起点。vlhA家族(以前称为pMGA)包含43个基因,分布在五个位点,分别含有8、2、9、12和12个基因。该基因家族占总基因组的10.4%(103 kb)。在gapA和crmA下游立即鉴定出两个CDS,它们编码与细胞粘附素GapA和CrmA具有同源性的蛋白。基于基序分析,预测有80个基因编码脂蛋白,149个蛋白含有多个跨膜结构域。作者鉴定出75个可能参与生物分子转运的蛋白、12个转座酶和一些潜在的毒力因子。该序列的完成催生了多个旨在确定鸡毒支原体生物学基础的项目。