Jacq C, Alt-Mörbe J, Andre B, Arnold W, Bahr A, Ballesta J P, Bargues M, Baron L, Becker A, Biteau N, Blöcker H, Blugeon C, Boskovic J, Brandt P, Brückner M, Buitrago M J, Coster F, Delaveau T, del Rey F, Dujon B, Eide L G, Garcia-Cantalejo J M, Goffeau A, Gomez-Peris A, Zaccaria P
Laboratoire de Génétique Moléculaire, URA 1302 du CNRS, Ecole Normale Supérieure, Paris, France.
Nature. 1997 May 29;387(6632 Suppl):75-8.
The complete DNA sequence of the yeast Saccharomyces cerevisiae chromosome IV has been determined. Apart from chromosome XII, which contains the 1-2 Mb rDNA cluster, chromosome IV is the longest S. cerevisiae chromosome. It was split into three parts, which were sequenced by a consortium from the European Community, the Sanger Centre, and groups from St Louis and Stanford in the United States. The sequence of 1,531,974 base pairs contains 796 predicted or known genes, 318 (39.9%) of which have been previously identified. Of the 478 new genes, 225 (28.3%) are homologous to previously identified genes and 253 (32%) have unknown functions or correspond to spurious open reading frames (ORFs). On average there is one gene approximately every two kilobases. Superimposed on alternating regional variations in G+C composition, there is a large central domain with a lower G+C content that contains all the yeast transposon (Ty) elements and most of the tRNA genes. Chromosome IV shares with chromosomes II, V, XII, XIII and XV some long clustered duplications which partly explain its origin.
酿酒酵母第四条染色体的完整DNA序列已被测定。除了包含1-2 Mb核糖体DNA簇的第十二条染色体外,第四条染色体是酿酒酵母最长的染色体。它被分成三个部分,由欧洲共同体的一个财团、桑格中心以及美国圣路易斯和斯坦福的研究小组进行测序。1531974个碱基对的序列包含796个预测或已知基因,其中318个(39.9%)先前已被鉴定。在478个新基因中,225个(28.3%)与先前鉴定的基因同源,253个(32%)功能未知或对应于假开放阅读框(ORF)。平均每两千个碱基左右有一个基因。在G+C组成的交替区域变化之上,有一个G+C含量较低的大中央结构域,其中包含所有酵母转座子(Ty)元件和大部分tRNA基因。第四条染色体与第二条、第五条、第十二条、第十三条和第十五条染色体有一些长的成簇重复序列,这部分解释了它的起源。