InBioS-PhytoSYSTEMS, Eukaryotic Phylogenomics, University of Liège, B-4000 Liège, Belgium.
Microbial Processes and Interactions, TERRA Teaching and Research Centre, Joint Research Unit BioEcoAgro UMRT 1158, Gembloux Agro-Bio Tech, University of Liège, B-5030 Gembloux, Belgium.
Bioinformatics. 2020 Aug 1;36(15):4345-4347. doi: 10.1093/bioinformatics/btaa517.
To support small and large-scale genome mining projects, we present Post-processing Analysis tooLbox for ANTIsmash Reports (Palantir), a dedicated software suite for handling and refining secondary metabolite biosynthetic gene cluster (BGC) data annotated with the popular antiSMASH pipeline. Palantir provides new functionalities building on NRPS/PKS predictions from antiSMASH, such as improved BGC annotation, module delineation and easy access to sub-sequences at different levels (cluster, gene, module and domain). Moreover, it can parse user-provided antiSMASH reports and reformat them for direct use or storage in a relational database.
Palantir is released both as a Perl API available on CPAN (https://metacpan.org/release/Bio-Palantir) and as a web application (http://palantir.uliege.be). As a practical use case, the web interface also features a database built from the mining of 1616 cyanobacterial genomes, of which 1488 were predicted to encode at least one BGC.
Supplementary data are available at Bioinformatics online.
为了支持小规模和大规模的基因组挖掘项目,我们提出了 Post-processing Analysis tooLbox for ANTIsmash Reports (Palantir),这是一个专门用于处理和优化带有流行 antiSMASH 管道注释的次级代谢生物合成基因簇 (BGC) 数据的软件套件。Palantir 基于 antiSMASH 的 NRPS/PKS 预测提供了新的功能,例如改进的 BGC 注释、模块划分以及轻松访问不同级别(簇、基因、模块和域)的子序列。此外,它可以解析用户提供的 antiSMASH 报告,并将其重新格式化,以便直接使用或存储在关系数据库中。
Palantir 既作为可在 CPAN 上获得的 Perl API(https://metacpan.org/release/Bio-Palantir)发布,也作为网络应用程序(http://palantir.uliege.be)发布。作为一个实际用例,网络界面还具有一个从 1616 个蓝藻基因组挖掘构建的数据库,其中 1488 个被预测至少编码一个 BGC。
补充数据可在 Bioinformatics 在线获得。