Centre of Molecular and Environmental Biology (CBMA), Department of Biology, University of Minho, Braga, Portugal.
Yeast. 2009 Dec;26(12):675-92. doi: 10.1002/yea.1728.
Within this study, we have used a set of computational techniques to relate the genotypes and phenotypes of natural populations of Saccharomyces cerevisiae, using allelic information from 11 microsatellite loci and results from 24 phenotypic tests. A group of 103 strains was obtained from a larger S. cerevisiae winemaking strain collection by clustering with self-organizing maps. These strains were further characterized regarding their allelic combinations for 11 microsatellites and analysed in phenotypic screens that included taxonomic criteria (carbon and nitrogen assimilation tests, growth at different temperatures) and tests with biotechnological relevance (ethanol resistance, H(2)S or aromatic precursors formation). Phenotypic variability was rather high and each strain showed a unique phenotypic profile. The results, expressed as optical density (A(640)) after 22 h of growth, were in agreement with taxonomic data, although with some exceptions, since few strains were capable of consuming arabinose and ribose to a small extent. Based on microsatellite allelic information, naïve Bayesian classifier correctly assigned (AUC = 0.81, p < 10(-8)) most of the strains to the vineyard from where they were isolated, despite their close location (50-100 km). We also identified subgroups of strains with similar values of a phenotypic feature and microsatellite allelic pattern (AUC > 0.75). Subgroups were found for strains with low ethanol resistance, growth at 30 degrees C and growth in media containing galactose, raffinose or urea. The results demonstrate that computational approaches can be used to establish genotype-phenotype relations and to make predictions about a strain's biotechnological potential.
在这项研究中,我们使用了一组计算技术来关联酿酒酵母自然种群的基因型和表型,使用了来自 11 个微卫星位点的等位基因信息和 24 个表型测试的结果。通过自组织映射聚类,从一个更大的酿酒酵母酿造菌株集中获得了一组 103 株菌株。这些菌株进一步根据其 11 个微卫星的等位基因组合进行了特征描述,并在表型筛选中进行了分析,包括分类学标准(碳和氮同化测试、不同温度下的生长)和具有生物技术相关性的测试(乙醇抗性、H2S 或芳香前体形成)。表型变异性相当高,每个菌株都表现出独特的表型特征。以 22 小时生长后的光密度(A640)表示的结果与分类学数据一致,尽管存在一些例外,因为很少有菌株能够少量消耗阿拉伯糖和核糖。基于微卫星等位基因信息,朴素贝叶斯分类器正确地将(AUC=0.81,p<10(-8))大多数菌株分配到它们分离的葡萄园,尽管它们的位置很近(50-100 公里)。我们还确定了具有相似表型特征和微卫星等位基因模式(AUC>0.75)的菌株亚组。对于乙醇抗性低、30°C 生长和在含有半乳糖、棉子糖或尿素的培养基中生长的菌株,发现了亚组。结果表明,计算方法可用于建立基因型-表型关系,并对菌株的生物技术潜力进行预测。