Timouma Soukaina, Schwartz Jean-Marc, Delneri Daniela
Manchester Institute of Biotechnology, Faculty of Biology Medicine and Health, University of Manchester, M1 7DN Manchester, UK.
Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology Medicine and Health, University of Manchester, M13 9PT Manchester, UK.
Microorganisms. 2020 Oct 9;8(10):1554. doi: 10.3390/microorganisms8101554.
Genome-scale computational approaches are opening opportunities to model and predict favorable combination of traits for strain development. However, mining the genome of complex hybrids is not currently an easy task, due to the high level of redundancy and presence of homologous. For example, is an allopolyploid sterile yeast hybrid used in brewing to produce lager-style beers. The development of new yeast strains with valuable industrial traits such as improved maltose utilization or balanced flavor profiles are now a major ambition and challenge in craft brewing and distilling industries. Moreover, no genome annotation for most of these industrial strains have been published. Here, we developed HybridMine, a new user-friendly, open-source tool for functional annotation of hybrid aneuploid genomes of any species by predicting parental alleles including paralogs. Our benchmark studies showed that HybridMine produced biologically accurate results for hybrid genomes compared to other well-established software. As proof of principle, we carried out a comprehensive structural and functional annotation of complex yeast hybrids to enable system biology prediction studies. HybridMine is developed in Python, Perl, and Bash programming languages and is available in GitHub.
全基因组规模的计算方法为菌株开发中性状的建模和预测有利组合带来了机遇。然而,由于高度的冗余性和同源序列的存在,目前挖掘复杂杂种的基因组并非易事。例如,[此处原文缺失具体例子]是一种用于酿造拉格风格啤酒的异源多倍体不育酵母杂种。开发具有宝贵工业性状(如提高麦芽糖利用率或平衡风味特征)的新酵母菌株,如今是精酿啤酒和蒸馏行业的一项主要目标和挑战。此外,大多数这些工业菌株的基因组注释尚未发表。在此,我们开发了HybridMine,这是一种全新的用户友好型开源工具,可通过预测包括旁系同源物在内的亲本等位基因,对任何物种的杂种非整倍体基因组进行功能注释。我们的基准研究表明,与其他成熟软件相比,HybridMine对杂种基因组产生了生物学上准确的结果。作为原理验证,我们对复杂酵母杂种进行了全面的结构和功能注释,以开展系统生物学预测研究。HybridMine是用Python、Perl和Bash编程语言开发的,可在GitHub上获取。