Miller Ian J, Chevrette Marc G, Kwan Jason C
Pharmaceutical Sciences Division, School of Pharmacy, University of Wisconsin-Madison, Madison, WI 53705, USA.
Department of Genetics, Department of Bacteriology, University of Wisconsin-Madison, Madison, WI 53706, USA.
Mar Drugs. 2017 Jun 6;15(6):165. doi: 10.3390/md15060165.
Genome mining has become an increasingly powerful, scalable, and economically accessible tool for the study of natural product biosynthesis and drug discovery. However, there remain important biological and practical problems that can complicate or obscure biosynthetic analysis in genomic and metagenomic sequencing projects. Here, we focus on limitations of available technology as well as computational and experimental strategies to overcome them. We review the unique challenges and approaches in the study of symbiotic and uncultured systems, as well as those associated with biosynthetic gene cluster (BGC) assembly and product prediction. Finally, to explore sequencing parameters that affect the recovery and contiguity of large and repetitive BGCs assembled , we simulate Illumina and PacBio sequencing of the genome focusing on assembly of the salinilactam () BGC.
基因组挖掘已成为研究天然产物生物合成和药物发现的一种日益强大、可扩展且经济上可及的工具。然而,仍然存在一些重要的生物学和实际问题,这些问题可能会使基因组和宏基因组测序项目中的生物合成分析变得复杂或模糊不清。在这里,我们关注现有技术的局限性以及克服这些局限性的计算和实验策略。我们回顾了共生和未培养系统研究中的独特挑战和方法,以及与生物合成基因簇(BGC)组装和产物预测相关的挑战和方法。最后,为了探索影响大型重复BGC组装的回收率和连续性的测序参数,我们模拟了Illumina和PacBio对该基因组的测序,重点是盐内酰胺()BGC的组装。