Nelson D R, Kamataki T, Waxman D J, Guengerich F P, Estabrook R W, Feyereisen R, Gonzalez F J, Coon M J, Gunsalus I C, Gotoh O
Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill 27599.
DNA Cell Biol. 1993 Jan-Feb;12(1):1-51. doi: 10.1089/dna.1993.12.1.
We provide here a list of 221 P450 genes and 12 putative pseudogenes that have been characterized as of December 14, 1992. These genes have been described in 31 eukaryotes (including 11 mammalian and 3 plant species) and 11 prokaryotes. Of 36 gene families so far described, 12 families exist in all mammals examined to date. These 12 families comprise 22 mammalian subfamilies, of which 17 and 15 have been mapped in the human and mouse genome, respectively. To date, each subfamily appears to represent a cluster of tightly linked genes. This revision supersedes the previous updates [Nebert et al., DNA 6, 1-11, 1987; Nebert et al., DNA 8, 1-13, 1989; Nebert et al., DNA Cell Biol. 10, 1-14 (1991)] in which a nomenclature system, based on divergent evolution of the superfamily, has been described. For the gene and cDNA, we recommend that the italicized root symbol "CYP" for human ("Cyp" for mouse), representing "cytochrome P450," be followed by an Arabic number denoting the family, a letter designating the subfamily (when two or more exist), and an Arabic numeral representing the individual gene within the subfamily. A hyphen should precede the final number in mouse genes. "P" ("p" in mouse) after the gene number denotes a pseudogene. If a gene is the sole member of a family, the subfamily letter and gene number need not be included. We suggest that the human nomenclature system be used for all species other than mouse. The mRNA and enzyme in all species (including mouse) should include all capital letters, without italics or hyphens. This nomenclature system is identical to that proposed in our 1991 update. Also included in this update is a listing of available data base accession numbers for P450 DNA and protein sequences. We also discuss the likelihood that this ancient gene superfamily has existed for more than 3.5 billion years, and that the rate of P450 gene evolution appears to be quite nonlinear. Finally, we describe P450 genes that have been detected by expressed sequence tags (ESTs), as well as the relationship between the P450 and the nitric oxide synthase gene superfamilies, as a likely example of convergent evolution.
我们在此提供一份截至1992年12月14日已被鉴定的221个细胞色素P450基因和12个假定假基因的列表。这些基因已在31种真核生物(包括11种哺乳动物和3种植物)和11种原核生物中被描述。在目前已描述的36个基因家族中,有12个家族存在于迄今所检测的所有哺乳动物中。这12个家族包含22个哺乳动物亚家族,其中17个和15个分别已定位到人类和小鼠基因组中。迄今为止,每个亚家族似乎都代表着一组紧密连锁的基因。本修订版取代了先前的更新内容[内伯特等人,《DNA》6,1 - 11,1987;内伯特等人,《DNA》8,1 - 13,1989;内伯特等人,《DNA细胞生物学》10,1 - 14(1991)],其中描述了一种基于该超家族趋异进化的命名系统。对于基因和cDNA,我们建议人类的(小鼠的为“Cyp”)用斜体的词根符号“CYP”表示“细胞色素P450”,后面跟着一个阿拉伯数字表示家族,一个字母表示亚家族(当存在两个或更多亚家族时),以及一个阿拉伯数字表示该亚家族内的单个基因。小鼠基因的最后一个数字前应加一个连字符。基因编号后的“P”(小鼠的为“p”)表示假基因。如果一个基因是某个家族的唯一成员,则无需包含亚家族字母和基因编号。我们建议除小鼠外的所有物种都使用人类命名系统。所有物种(包括小鼠)的mRNA和酶都应全部大写,不使用斜体或连字符。这个命名系统与我们1991年更新版中提出的相同。本更新版还包括了细胞色素P450 DNA和蛋白质序列的可用数据库登录号列表。我们还讨论了这个古老基因超家族可能已存在超过35亿年的可能性,以及细胞色素P450基因的进化速率似乎相当非线性。最后,我们描述了通过表达序列标签(EST)检测到的细胞色素P450基因,以及细胞色素P450与一氧化氮合酶基因超家族之间的关系,这可能是趋同进化的一个例子。