Koes R E, Spelt C E, van den Elzen P J, Mol J N
Department of Genetics, Free University, Amsterdam, The Netherlands.
Gene. 1989 Sep 30;81(2):245-57. doi: 10.1016/0378-1119(89)90185-6.
Chalcone synthase-encoding genes (chs) in Petunia hybrida comprise a multigene family. Some of the chs genes have been grouped into a subfamily, based upon their strong cross-hybridization and tight genomic linkage. From genomic libraries eight 'complete' chs genes, two chs gene 5'-fragments and two chs gene 3'-fragments have been isolated. The nucleotide sequence of six complete chs genes is presented and discussed in relation to their evolutionary origin and expression in different tissues. Each member of the family consists of two exons separated by an intron of variable size and sequence, which is located at a conserved position. The chs gene fragments represent single exons. Homology between non-linked chs genes is approx. 80% at the DNA level and restricted to protein-coding sequences. Homology between subfamily members (which are tightly linked) is higher (90-99%) and extends into untranslated regions of the gene, strengthening the view that they arose by recent gene duplications. The chsD gene contains a mutated translation stop codon, suggesting that this is an inactive (pseudo)gene. None of the other members of the gene family exhibits characteristics of a pseudogene, indicating that if gene inactivation has occurred during their evolution, it must characteristics of a pseudogene, indicating that if gene inactivation has occurred during their evolution, it must have been a recent event. Homology at the protein level between some (expressed) chs genes is surprisingly low. The possibility that these genes encode proteins with slightly different enzymatic activities is discussed.
矮牵牛中查尔酮合酶编码基因(chs)构成一个多基因家族。基于一些chs基因强烈的交叉杂交和紧密的基因组连锁关系,其中一些基因已被归为一个亚家族。从基因组文库中分离出了8个“完整的”chs基因、2个chs基因5'端片段和2个chs基因3'端片段。本文给出了6个完整chs基因的核苷酸序列,并讨论了它们的进化起源以及在不同组织中的表达情况。该家族的每个成员都由两个外显子组成,中间被一个大小和序列可变的内含子隔开,内含子位于保守位置。chs基因片段代表单个外显子。非连锁的chs基因之间在DNA水平上的同源性约为80%,且仅限于蛋白质编码序列。亚家族成员(紧密连锁)之间的同源性更高(90 - 99%),并延伸到基因的非翻译区,这进一步支持了它们是通过近期基因复制产生的观点。chsD基因包含一个突变的翻译终止密码子,表明这是一个无活性的(假)基因。基因家族的其他成员均未表现出假基因的特征,这表明如果在它们的进化过程中发生了基因失活,那一定是最近发生的事件。一些(已表达的)chs基因在蛋白质水平上的同源性出奇地低。文中讨论了这些基因编码具有略有不同酶活性的蛋白质的可能性。