Reva Oleg, Tümmler Burkhard
Biochemistry Department, University of Pretoria, Lynnwood Road, Hillcrest, 0002 Pretoria, South Africa.
Environ Microbiol. 2008 Mar;10(3):768-77. doi: 10.1111/j.1462-2920.2007.01500.x.
Long genes should be rare in archaea and eubacteria because of the demanding costs of time and resources for protein production. The search in 580 sequenced prokaryotic genomes, however, revealed 0.2% of all genes to be longer than 5 kb (absolute number: 3732 genes). Eighty giant bacterial genes of more than 20 kb in length were identified in 47 taxa that belong to the phyla Thermotogae (1), Chlorobi (3), Planctomycetes (1), Cyanobacteria (2), Firmicutes (7), Actinobacteria (9), Proteobacteria (23) or Euryarchaeota (1) (number of taxa in brackets). Giant genes are strain-specific, differ in their tetranucleotide usage from the bulk genome and occur preferentially in non-pathogenic environmental bacteria. The two longest bacterial genes known to date were detected in the green sulfur bacterium Chlorobium chlorochromatii CaD3 encoding proteins of 36 806 and 20 647 amino acids, being surpassed in length only by the human titin coding sequence. More than 90% of bacterial giant genes either encode a surface protein or a polyketide/non-ribosomal peptide synthetase. Most surface proteins are acidic, threonine-rich, lack cystein and harbour multiple amino acid repeats. Giant proteins increase bacterial fitness by the production of either weapons towards or shields against animate competitors or hostile environments.
由于蛋白质生产在时间和资源方面成本高昂,长基因在古细菌和真细菌中应该较为罕见。然而,对580个已测序原核生物基因组的搜索发现,所有基因中有0.2%的基因长度超过5 kb(绝对数量:3732个基因)。在属于栖热袍菌门(1个)、绿弯菌门(3个)、浮霉菌门(1个)、蓝细菌门(2个)、厚壁菌门(7个)、放线菌门(9个)、变形菌门(23个)或广古菌门(1个)(括号内为分类单元数量)的47个分类单元中,鉴定出了80个长度超过20 kb的巨大细菌基因。巨大基因具有菌株特异性,其四核苷酸使用情况与整体基因组不同,且优先出现在非致病性环境细菌中。迄今为止已知的两个最长细菌基因是在绿色硫细菌绿嗜氯菌CaD3中检测到的,它们编码的蛋白质分别含有36806和20647个氨基酸,长度仅次于人类肌联蛋白编码序列。超过90%的细菌巨大基因要么编码表面蛋白,要么编码聚酮/非核糖体肽合成酶。大多数表面蛋白呈酸性,富含苏氨酸,不含半胱氨酸,并含有多个氨基酸重复序列。巨大蛋白通过产生针对有生命竞争者或恶劣环境的武器或盾牌来提高细菌的适应性。