Voet M, Defoor E, Verhasselt P, Riles L, Robben J, Volckaert G
Katholieke Universiteit Leuven, Laboratory of Gene Technology, Belgium.
Yeast. 1997 Feb;13(2):177-82. doi: 10.1002/(SICI)1097-0061(199702)13:2<177::AID-YEA62>3.0.CO;2-2.
The nucleotide sequence of 22,803 bp on the left arm of chromosome VII was determined by polymerase chain reaction-based approaches to compensate for the unstable character of cosmid clones from this region of the chromosome. The coding density of the sequence is particularly high (more than 83%). Twelve open reading frames (ORFs) longer than 300 bp were found, two of which (at the left side) have been described previously (James et al., 1995) after sequencing of an overlapping cosmid. Four other ORFs correspond to published sequences of the known genes ARO2, RPL9A, TIP1 and MRF1. ARO2 codes for chorismate synthetase. RPL9A for protein L9 of the large ribosomal subunit and MRF1 for a mitochondrial translation release factor. The TIP1 product interacts with Sec20p and is thus involved in transport from endoplasmic reticulum to Golgi. Five of the remaining ORFs have not been identified previously, while the sixth (YGL142c) has been partially sequenced as it lies 5' upstream of MRF1. These six ORFs are relatively large (between 933 and 3657 nucleotides). YGL146c, YGL142c, YGL140c and YGL139w have no significant homology to any protein sequence presently available in the public databases, but show two, nine, nine and eight putative transmembrane spans, respectively. YGL144c has a serine active site signature of lipases. YGL141w has limited homology to several human proteins, one of which mediates complex formation between papillomavirus E6 oncoprotein and tumor suppressor protein p53.
通过基于聚合酶链反应的方法测定了VII号染色体左臂上22,803 bp的核苷酸序列,以弥补来自该染色体区域的黏粒克隆的不稳定特性。该序列的编码密度特别高(超过83%)。发现了12个长度超过300 bp的开放阅读框(ORF),其中两个(在左侧)在对一个重叠黏粒进行测序后,先前已有描述(James等人,1995年)。另外四个ORF与已知基因ARO2、RPL9A、TIP1和MRF1的已发表序列相对应。ARO2编码分支酸合成酶。RPL9A编码大核糖体亚基的L9蛋白,MRF1编码线粒体翻译释放因子。TIP1产物与Sec20p相互作用,因此参与从内质网到高尔基体的转运。其余的ORF中有五个以前未被鉴定,而第六个(YGL142c)由于位于MRF1的5'上游已进行了部分测序。这六个ORF相对较大(在933至3657个核苷酸之间)。YGL146c、YGL142c、YGL140c和YGL139w与公共数据库中目前可用的任何蛋白质序列均无显著同源性,但分别显示出两个、九个、九个和八个推定的跨膜结构域。YGL144c具有脂肪酶的丝氨酸活性位点特征。YGL141w与几种人类蛋白质有有限的同源性,其中一种介导乳头瘤病毒E6癌蛋白与肿瘤抑制蛋白p53之间的复合物形成。