CMBI, NCMLS, Radboud University Medical Centre. Geert Grooteplein 28, 6525 GA Nijmegen, The Netherlands.
Brief Funct Genomics. 2013 Jul;12(4):366-80. doi: 10.1093/bfgp/elt008. Epub 2013 Apr 26.
There is an increasing availability of complete or draft genome sequences for microbial organisms. These data form a potentially valuable resource for genotype-phenotype association and gene function prediction, provided that phenotypes are consistently annotated for all the sequenced strains. In this review, we address the requirements for successful gene-trait matching. We outline a basic protocol for microbial functional genomics, including genome assembly, annotation of genotypes (including single nucleotide polymorphisms, orthologous groups and prophages), data pre-processing, genotype-phenotype association, visualization and interpretation of results. The methodologies for association described herein can be applied to other data types, opening up possibilities to analyze transcriptome-phenotype associations, and correlate microbial population structure or activity, as measured by metagenomics, to environmental parameters.
越来越多的微生物完整或草图基因组序列可供使用。这些数据为基因型-表型关联和基因功能预测提供了潜在的有价值资源,前提是所有测序菌株的表型都得到一致注释。在这篇综述中,我们讨论了成功进行基因-性状匹配的要求。我们概述了微生物功能基因组学的基本方案,包括基因组组装、基因型注释(包括单核苷酸多态性、直系同源群和噬菌体)、数据预处理、基因型-表型关联、结果可视化和解释。本文描述的关联方法可应用于其他类型的数据,从而有可能分析转录组-表型关联,并将通过宏基因组学测量的微生物种群结构或活性与环境参数相关联。