Cole S T, Brosch R, Parkhill J, Garnier T, Churcher C, Harris D, Gordon S V, Eiglmeier K, Gas S, Barry C E, Tekaia F, Badcock K, Basham D, Brown D, Chillingworth T, Connor R, Davies R, Devlin K, Feltwell T, Gentles S, Hamlin N, Holroyd S, Hornsby T, Jagels K, Krogh A, McLean J, Moule S, Murphy L, Oliver K, Osborne J, Quail M A, Rajandream M A, Rogers J, Rutter S, Seeger K, Skelton J, Squares R, Squares S, Sulston J E, Taylor K, Whitehead S, Barrell B G
Sanger Centre, Wellcome Trust Genome Campus, Hinxton, UK.
Nature. 1998 Jun 11;393(6685):537-44. doi: 10.1038/31159.
Countless millions of people have died from tuberculosis, a chronic infectious disease caused by the tubercle bacillus. The complete genome sequence of the best-characterized strain of Mycobacterium tuberculosis, H37Rv, has been determined and analysed in order to improve our understanding of the biology of this slow-growing pathogen and to help the conception of new prophylactic and therapeutic interventions. The genome comprises 4,411,529 base pairs, contains around 4,000 genes, and has a very high guanine + cytosine content that is reflected in the biased amino-acid content of the proteins. M. tuberculosis differs radically from other bacteria in that a very large portion of its coding capacity is devoted to the production of enzymes involved in lipogenesis and lipolysis, and to two new families of glycine-rich proteins with a repetitive structure that may represent a source of antigenic variation.
数以百万计的人死于肺结核,这是一种由结核杆菌引起的慢性传染病。为了增进我们对这种生长缓慢的病原体生物学特性的理解,并有助于构思新的预防和治疗干预措施,已对结核分枝杆菌特征最明确的菌株H37Rv的全基因组序列进行了测定和分析。该基因组由4411529个碱基对组成,包含约4000个基因,鸟嘌呤+胞嘧啶含量非常高,这反映在蛋白质的氨基酸含量偏差上。结核分枝杆菌与其他细菌有根本区别,因为其很大一部分编码能力用于参与脂肪生成和脂肪分解的酶的产生,以及用于两个具有重复结构的富含甘氨酸的新蛋白质家族,这可能是抗原变异的一个来源。