Pan Shu, Nikolakakis Kiel, Adamczyk Paul A, Pan Min, Ruby Edward G, Reed Jennifer L
From the Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, Wisconsin 53706.
the School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China, and.
J Biol Chem. 2017 Jun 16;292(24):10250-10261. doi: 10.1074/jbc.M116.763193. Epub 2017 Apr 26.
Whereas genomes can be rapidly sequenced, the functions of many genes are incompletely or erroneously annotated because of a lack of experimental evidence or prior functional knowledge in sequence databases. To address this weakness, we describe here a odel-nabled ene earch (MEGS) approach that (i) identifies metabolic functions either missing from an organism's genome annotation or incorrectly assigned to an ORF by using discrepancies between metabolic model predictions and experimental culturing data; (ii) designs functional selection experiments for these specific metabolic functions; and (iii) selects a candidate gene(s) responsible for these functions from a genomic library and directly interrogates this gene's function experimentally. To discover gene functions, MEGS uses genomic functional selections instead of relying on correlations across large experimental datasets or sequence similarity as do other approaches. When applied to the bioluminescent marine bacterium , MEGS successfully identified five genes that are responsible for four metabolic and transport reactions whose absence from a draft metabolic model of caused inaccurate modeling of high-throughput experimental data. This work demonstrates that MEGS provides a rapid and efficient integrated computational and experimental approach for annotating metabolic genes, including those that have previously been uncharacterized or misannotated.
虽然基因组可以快速测序,但由于缺乏实验证据或序列数据库中的先前功能知识,许多基因的功能注释不完整或错误。为了解决这一弱点,我们在此描述一种基于模型的基因搜索(MEGS)方法,该方法(i)通过利用代谢模型预测与实验培养数据之间的差异,识别生物体基因组注释中缺失的代谢功能或错误分配给开放阅读框(ORF)的代谢功能;(ii)为这些特定的代谢功能设计功能选择实验;(iii)从基因组文库中选择负责这些功能的候选基因,并通过实验直接探究该基因的功能。为了发现基因功能,MEGS使用基因组功能选择,而不是像其他方法那样依赖于大型实验数据集之间的相关性或序列相似性。当应用于发光海洋细菌时,MEGS成功鉴定出五个负责四种代谢和转运反应的基因,这些反应在的代谢模型草图中缺失导致了高通量实验数据的不准确建模。这项工作表明,MEGS为注释代谢基因提供了一种快速有效的综合计算和实验方法,包括那些以前未被表征或错误注释的基因。