酵母三号染色体182个预测开放阅读框的全面序列分析。

Comprehensive sequence analysis of the 182 predicted open reading frames of yeast chromosome III.

作者信息

Bork P, Ouzounis C, Sander C, Scharf M, Schneider R, Sonnhammer E

机构信息

European Molecular Biology Laboratory, Heidelberg, Germany.

出版信息

Protein Sci. 1992 Dec;1(12):1677-90. doi: 10.1002/pro.5560011216.

DOI:10.1002/pro.5560011216

PMID:1304897

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2142145/

Abstract

With the completion of the first phase of the European yeast genome sequencing project, the complete DNA sequence of chromosome III of Saccharomyces cerevisiae has become available (Oliver, S. G., et al., 1992, Nature 357, 38-46). We have tested the predictive power of computer sequence analysis of the 176 probable protein products of this chromosome, after exclusion of six problem cases. When the results of database similarity searches are pooled with prior knowledge, a likely function can be assigned to 42% of the proteins, and a predicted three-dimensional structure to a third of these (14% of the total). The function of the remaining 58% remains to be determined. Of these, about one-third have one or more probable transmembrane segments. Among the most interesting proteins with predicted functions are a new member of the type X polymerase family, a transcription factor with an N-terminal DNA-binding domain related to GAL4, a "fork head" DNA-binding domain previously known only in Drosophila and in mammals, and a putative methyltransferase. Our analysis increased the number of known significant sequence similarities on chromosome III by 13, to now 67. Although the near 40% success rate of identifying unknown protein function by sequence analysis is surprisingly high, the information gap between known protein sequences and unknown function is expected to widen and become a major bottleneck of genome projects in the near future. Based on the experience gained in this test study, we suggest that the development of an automated computer workbench for protein sequence analysis must be an important item in genome projects.

摘要

随着欧洲酵母基因组测序项目第一阶段的完成，酿酒酵母第三条染色体的完整DNA序列已可得（奥利弗，S.G.等人，1992年，《自然》357卷，38 - 46页）。在排除六个有问题的案例后，我们测试了对这条染色体上176个可能的蛋白质产物进行计算机序列分析的预测能力。当将数据库相似性搜索结果与先验知识汇总时，42%的蛋白质可被赋予可能的功能，其中三分之一（占总数的14%）可预测其三维结构。其余58%蛋白质的功能仍有待确定。其中约三分之一有一个或多个可能的跨膜区段。在具有预测功能的最有趣的蛋白质中，有X型聚合酶家族的一个新成员、一个N端DNA结合结构域与GAL4相关的转录因子、一个此前仅在果蝇和哺乳动物中已知的“叉头”DNA结合结构域以及一个推定的甲基转移酶。我们的分析使第三条染色体上已知的显著序列相似性数量增加了13个，达到现在的67个。尽管通过序列分析识别未知蛋白质功能近40%的成功率高得出人意料，但已知蛋白质序列与未知功能之间的信息差距预计在不久的将来会扩大，并成为基因组项目的一个主要瓶颈。基于在这项测试研究中获得的经验，我们建议开发一个用于蛋白质序列分析的自动化计算机工作台必须成为基因组项目中的一项重要内容。

相似文献

Comprehensive sequence analysis of the 182 predicted open reading frames of yeast chromosome III.酵母三号染色体182个预测开放阅读框的全面序列分析。

Protein Sci. 1992 Dec;1(12):1677-90. doi: 10.1002/pro.5560011216.

Sequence of a 12.7 kb segment of yeast chromosome II identifies a PDR-like gene and several new open reading frames.酵母二号染色体12.7 kb片段的序列鉴定出一个类PDR基因和几个新的开放阅读框。

Yeast. 1992 Sep;8(9):761-8. doi: 10.1002/yea.320080909.

[CHL15--a new gene controlling the replication of chromosomes in saccharomycetes yeast: cloning, physical mapping, sequencing, and sequence analysis].[CHL15——一种控制酿酒酵母染色体复制的新基因：克隆、物理图谱构建、测序及序列分析]

Mol Biol (Mosk). 1993 May-Jun;27(3):569-88.

Nucleotide sequence and analysis of the centromeric region of yeast chromosome IX.酵母九号染色体着丝粒区域的核苷酸序列及分析

Yeast. 1995 Jan;11(1):61-78. doi: 10.1002/yea.320110109.

The sequence of a 36 kb segment on the left arm of yeast chromosome X identifies 24 open reading frames including NUC1, PRP21 (SPP91), CDC6, CRY2, the gene for S24, a homologue to the aconitase gene ACO1 and two homologues to chromosome III genes.酵母X染色体左臂上一段36 kb片段的序列鉴定出24个开放阅读框，包括NUC1、PRP21（SPP91）、CDC6、CRY2、S24基因、乌头酸酶基因ACO1的一个同源物以及与III号染色体基因的两个同源物。

Yeast. 1994 Sep;10(9):1235-49. doi: 10.1002/yea.320100912.

Organization of the centromeric region of chromosome XIV in Saccharomyces cerevisiae.酿酒酵母中第十四号染色体着丝粒区域的组织

Yeast. 1994 Apr;10(4):523-33. doi: 10.1002/yea.320100412.

A new essential gene located on Saccharomyces cerevisiae chromosome IX.一个位于酿酒酵母九号染色体上的新必需基因。

Yeast. 1995 Jul;11(9):885-90. doi: 10.1002/yea.320110910.

The sequence of a 13.5 kb DNA segment from the left arm of yeast chromosome XIV reveals MER1; RAP1; a new putative member of the DNA replication complex and a new putative serine/threonine phosphatase gene.来自酵母十四号染色体左臂的一段13.5 kb DNA片段的序列揭示了MER1；RAP1；DNA复制复合体的一个新的假定成员和一个新的假定丝氨酸/苏氨酸磷酸酶基因。

Yeast. 1995 Jan;11(1):85-91. doi: 10.1002/yea.320110111.

The complete sequence of a 6146 bp fragment of Saccharomyces cerevisiae chromosome III contains two new open reading frames.酿酒酵母三号染色体一个6146碱基对片段的完整序列包含两个新的开放阅读框。

Yeast. 1992 Jul;8(7):569-75. doi: 10.1002/yea.320080708.

Analysis of an 11.7 kb DNA fragment of chromosome XI reveals a new tRNA gene and four new open reading frames including a leucine zipper protein and a homologue to the yeast mitochondrial regulator ABF2.对第十一条染色体上一个11.7 kb的DNA片段进行分析，发现了一个新的tRNA基因和四个新的开放阅读框，其中包括一个亮氨酸拉链蛋白和一个与酵母线粒体调节因子ABF2同源的蛋白。

Yeast. 1994 Jan;10(1):125-30. doi: 10.1002/yea.320100112.

引用本文的文献

20 years of the SMART protein domain annotation resource.SMART 蛋白质结构域注释资源 20 年。

Nucleic Acids Res. 2018 Jan 4;46(D1):D493-D496. doi: 10.1093/nar/gkx922.

Requirement of POL3 and POL4 on non-homologous and microhomology-mediated end joining in rad50/xrs2 mutants of Saccharomyces cerevisiae.酿酒酵母rad50/xrs2突变体中POL3和POL4对非同源及微同源介导的末端连接的需求

Mutagenesis. 2015 Nov;30(6):841-9. doi: 10.1093/mutage/gev046. Epub 2015 Jun 29.

Solving the Problem: Genome Annotation Standards before the Data Deluge.解决问题：数据洪流之前的基因组注释标准

Stand Genomic Sci. 2011 Oct 15;5(1):168-93. doi: 10.4056/sigs.2084864. Epub 2011 Oct 1.

DNA polymerase 4 of Saccharomyces cerevisiae is important for accurate repair of methyl-methanesulfonate-induced DNA damage.酿酒酵母的DNA聚合酶4对于准确修复甲磺酸甲酯诱导的DNA损伤很重要。

Genetics. 2006 Jan;172(1):89-98. doi: 10.1534/genetics.105.049254. Epub 2005 Oct 11.

The past, present and future of genome-wide re-annotation.全基因组重新注释的过去、现在与未来。

Genome Biol. 2002;3(2):COMMENT2001. doi: 10.1186/gb-2002-3-2-comment2001. Epub 2002 Jan 31.

Spb1p is a yeast nucleolar protein associated with Nop1p and Nop58p that is able to bind S-adenosyl-L-methionine in vitro.Spb1p是一种与Nop1p和Nop58p相关的酵母核仁蛋白，它在体外能够结合S-腺苷-L-甲硫氨酸。

Mol Cell Biol. 2000 Feb;20(4):1370-81. doi: 10.1128/MCB.20.4.1370-1381.2000.

Fowlpox virus encodes nonessential homologs of cellular alpha-SNAP, PC-1, and an orphan human homolog of a secreted nematode protein.禽痘病毒编码细胞α-SNAP、PC-1的非必需同源物以及一种分泌型线虫蛋白的人类孤儿同源物。

J Virol. 1998 Aug;72(8):6742-51. doi: 10.1128/JVI.72.8.6742-6751.1998.

The Pol beta-14 dominant negative rat DNA polymerase beta mutator mutant commits errors during the gap-filling step of base excision repair in Saccharomyces cerevisiae.Pol beta - 14显性负性大鼠DNA聚合酶β突变体在酿酒酵母碱基切除修复的缺口填补步骤中会产生错误。

J Bacteriol. 1998 May;180(9):2292-7. doi: 10.1128/JB.180.9.2292-2297.1998.

PWP2, a member of the WD-repeat family of proteins, is an essential Saccharomyces cerevisiae gene involved in cell separation.PWP2是WD重复蛋白家族的成员之一，是酿酒酵母中参与细胞分离的一个必需基因。

Mol Gen Genet. 1996 Aug 27;252(1-2):101-14. doi: 10.1007/BF02173210.

Dominant negative rat DNA polymerase beta mutants interfere with base excision repair in Saccharomyces cerevisiae.显性负性大鼠DNA聚合酶β突变体干扰酿酒酵母中的碱基切除修复。

J Bacteriol. 1996 Feb;178(3):656-61. doi: 10.1128/jb.178.3.656-661.1996.

本文引用的文献

Identification of common molecular subsequences.常见分子子序列的鉴定

J Mol Biol. 1981 Mar 25;147(1):195-7. doi: 10.1016/0022-2836(81)90087-5.

An improved algorithm for matching biological sequences.一种用于匹配生物序列的改进算法。

J Mol Biol. 1982 Dec 15;162(3):705-8. doi: 10.1016/0022-2836(82)90398-9.

A simple method for displaying the hydropathic character of a protein.一种展示蛋白质亲水性特征的简单方法。

J Mol Biol. 1982 May 5;157(1):105-32. doi: 10.1016/0022-2836(82)90515-0.

Proteins.蛋白质

Sci Am. 1985 Oct;253(4):88-99. doi: 10.1038/scientificamerican1085-88.

Structural principles of parallel beta-barrels in proteins.

Proc Natl Acad Sci U S A. 1988 May;85(10):3338-42. doi: 10.1073/pnas.85.10.3338.

Yeast gene SRP1 (serine-rich protein). Intragenic repeat structure and identification of a family of SRP1-related DNA sequences.酵母基因SRP1（富含丝氨酸蛋白）。基因内重复结构及SRP1相关DNA序列家族的鉴定。

J Mol Biol. 1988 Aug 5;202(3):455-70. doi: 10.1016/0022-2836(88)90278-1.

Nucleotide sequence characterization of Ty 1-17, a class II transposon from yeast.酵母II类转座子Ty 1-17的核苷酸序列特征

Nucleic Acids Res. 1985 Sep 25;13(18):6679-93. doi: 10.1093/nar/13.18.6679.

Pilin expression in Neisseria gonorrhoeae is under both positive and negative transcriptional control.淋病奈瑟菌中菌毛蛋白的表达受转录正调控和负调控。

EMBO J. 1988 Dec 20;7(13):4367-78. doi: 10.1002/j.1460-2075.1988.tb03335.x.

Sequence motifs characteristic of DNA[cytosine-N4]methyltransferases: similarity to adenine and cytosine-C5 DNA-methylases.DNA[胞嘧啶-N4]甲基转移酶的特征性序列基序：与腺嘌呤和胞嘧啶-C5 DNA甲基酶的相似性。

Nucleic Acids Res. 1989 Dec 11;17(23):9823-32. doi: 10.1093/nar/17.23.9823.

Sequence of the D-aspartyl/L-isoaspartyl protein methyltransferase from human erythrocytes. Common sequence motifs for protein, DNA, RNA, and small molecule S-adenosylmethionine-dependent methyltransferases.人红细胞D-天冬氨酰/L-异天冬氨酰蛋白甲基转移酶的序列。蛋白质、DNA、RNA及小分子S-腺苷甲硫氨酸依赖性甲基转移酶的共有序列基序。

J Biol Chem. 1989 Nov 25;264(33):20131-9.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验