Sapolsky R J, Brendel V, Karlin S
Department of Mathematics, Stanford University, CA 94305-2125.
Yeast. 1993 Dec;9(12):1287-98. doi: 10.1002/yea.320091202.
The recently published sequence of yeast chromosome III (YCIII) provides the longest continuous stretch of a eukaryotic DNA molecule sequenced to date (315 kb). The sequence contains 116 distinct AUG-initiated open reading frames of at least 200 codons in length, more than 50 of which had not been described previously nor bear significant similarity to known proteins. We have analysed the YCIII known and putative protein sequences with respect to significant statistical features which might reflect on structural and functional characteristics. The YCIII proteins have striking similarities and differences in their sequence attribute distributions compared to the corresponding distributions for all available yeast sequences and other protein collections. Nine examples of YCIII proteins with distinctive sequence features are discussed in detail.
最近发表的酵母三号染色体(YCIII)序列是迄今为止测序的最长的真核生物DNA分子连续片段(315 kb)。该序列包含116个不同的由AUG起始的开放阅读框,长度至少为200个密码子,其中50多个此前未曾描述过,也与已知蛋白质没有显著相似性。我们针对可能反映结构和功能特征的重要统计特征,分析了YCIII已知的和推测的蛋白质序列。与所有可用酵母序列及其他蛋白质集合的相应分布相比,YCIII蛋白质在其序列属性分布上具有显著的相似性和差异。详细讨论了九个具有独特序列特征的YCIII蛋白质实例。