Biology Department, Brookhaven National Laboratory, Upton, New York 11973, USA; email:
Departments of Plant and Microbial Biology and Molecular and Cell Biology, University of California, Berkeley, California 94720, USA.
Annu Rev Plant Biol. 2019 Apr 29;70:605-638. doi: 10.1146/annurev-arplant-050718-095841. Epub 2019 Mar 1.
Over 100 whole-genome sequences from algae are published or soon to be published. The rapidly increasing availability of these fundamental resources is changing how we understand one of the most diverse, complex, and understudied groups of photosynthetic eukaryotes. Genome sequences provide a window into the functional potential of individual algae, with phylogenomics and functional genomics as tools for contextualizing and transferring knowledge from reference organisms into less well-characterized systems. Remarkably, over half of the proteins encoded by algal genomes are of unknown function, highlighting the volume of functional capabilities yet to be discovered. In this review, we provide an overview of publicly available algal genomes, their associated protein inventories, and their quality, with a summary of the statuses of protein function understanding and predictions.
已有超过 100 个藻类的全基因组序列发表或即将发表。这些基础资源的快速增加正在改变我们对最具多样性、最复杂和研究最少的光合真核生物群体之一的理解方式。基因组序列为了解单个藻类的功能潜力提供了一个窗口,系统发生基因组学和功能基因组学作为工具,可以将参考生物的知识置于上下文中,并将其转移到研究较少的系统中。值得注意的是,藻类基因组编码的蛋白质中超过一半的功能未知,这突出了有待发现的功能能力的数量。在这篇综述中,我们概述了可公开获得的藻类基因组、它们相关的蛋白质目录及其质量,并总结了对蛋白质功能理解和预测的现状。