Starcevic Antonio, Zucko Jurica, Simunkovic Jurica, Long Paul F, Cullum John, Hranueli Daslav
Faculty of Food Technology and Biotechnology, University of Zagreb, Zagreb, Croatia.
Nucleic Acids Res. 2008 Dec;36(21):6882-92. doi: 10.1093/nar/gkn685. Epub 2008 Oct 31.
The program package 'ClustScan' (Cluster Scanner) is designed for rapid, semi-automatic, annotation of DNA sequences encoding modular biosynthetic enzymes including polyketide synthases (PKS), non-ribosomal peptide synthetases (NRPS) and hybrid (PKS/NRPS) enzymes. The program displays the predicted chemical structures of products as well as allowing export of the structures in a standard format for analyses with other programs. Recent advances in understanding of enzyme function are incorporated to make knowledge-based predictions about the stereochemistry of products. The program structure allows easy incorporation of additional knowledge about domain specificities and function. The results of analyses are presented to the user in a graphical interface, which also allows easy editing of the predictions to incorporate user experience. The versatility of this program package has been demonstrated by annotating biochemical pathways in microbial, invertebrate animal and metagenomic datasets. The speed and convenience of the package allows the annotation of all PKS and NRPS clusters in a complete Actinobacteria genome in 2-3 man hours. The open architecture of ClustScan allows easy integration with other programs, facilitating further analyses of results, which is useful for a broad range of researchers in the chemical and biological sciences.
程序包“ClustScan”(聚类扫描器)旨在对编码模块化生物合成酶的DNA序列进行快速、半自动注释,这些酶包括聚酮合酶(PKS)、非核糖体肽合成酶(NRPS)和杂合(PKS/NRPS)酶。该程序会显示预测的产物化学结构,并允许以标准格式导出结构,以便与其他程序进行分析。结合了酶功能理解方面的最新进展,以对产物的立体化学进行基于知识的预测。程序结构便于纳入有关结构域特异性和功能的更多知识。分析结果在图形界面中呈现给用户,该界面还允许轻松编辑预测结果以纳入用户经验。通过对微生物、无脊椎动物和宏基因组数据集中的生化途径进行注释,已证明了该程序包的多功能性。该程序包的速度和便利性使得在2至3个人工小时内就能对完整放线菌基因组中的所有PKS和NRPS簇进行注释。ClustScan的开放架构便于与其他程序集成,促进对结果的进一步分析,这对化学和生物科学领域的广大研究人员很有用。