CAGECAT:比较基因簇分析工具箱,用于快速搜索和可视化同源基因簇。
CAGECAT: The CompArative GEne Cluster Analysis Toolbox for rapid search and visualisation of homologous gene clusters.
机构信息
Bioinformatics Group, Wageningen University and Research, 6708PB, Wageningen, The Netherlands.
School of Molecular Sciences, The University of Western Australia, Crawley, WA, 6009, Australia.
出版信息
BMC Bioinformatics. 2023 May 3;24(1):181. doi: 10.1186/s12859-023-05311-2.
BACKGROUND
Co-localized sets of genes that encode specialized functions are common across microbial genomes and occur in genomes of larger eukaryotes as well. Important examples include Biosynthetic Gene Clusters (BGCs) that produce specialized metabolites with medicinal, agricultural, and industrial value (e.g. antimicrobials). Comparative analysis of BGCs can aid in the discovery of novel metabolites by highlighting distribution and identifying variants in public genomes. Unfortunately, gene-cluster-level homology detection remains inaccessible, time-consuming and difficult to interpret.
RESULTS
The comparative gene cluster analysis toolbox (CAGECAT) is a rapid and user-friendly platform to mitigate difficulties in comparative analysis of whole gene clusters. The software provides homology searches and downstream analyses without the need for command-line or programming expertise. By leveraging remote BLAST databases, which always provide up-to-date results, CAGECAT can yield relevant matches that aid in the comparison, taxonomic distribution, or evolution of an unknown query. The service is extensible and interoperable and implements the cblaster and clinker pipelines to perform homology search, filtering, gene neighbourhood estimation, and dynamic visualisation of resulting variant BGCs. With the visualisation module, publication-quality figures can be customized directly from a web-browser, which greatly accelerates their interpretation via informative overlays to identify conserved genes in a BGC query.
CONCLUSION
Overall, CAGECAT is an extensible software that can be interfaced via a standard web-browser for whole region homology searches and comparison on continually updated genomes from NCBI. The public web server and installable docker image are open source and freely available without registration at: https://cagecat.bioinformatics.nl .
背景
编码专门功能的基因在微生物基因组中是共同的,在较大的真核生物基因组中也存在。重要的例子包括生物合成基因簇 (BGCs),它们产生具有医学、农业和工业价值的特殊代谢物(例如抗生素)。BGCs 的比较分析可以通过突出分布和识别公共基因组中的变体来帮助发现新的代谢物。不幸的是,基因簇级别的同源性检测仍然难以访问、耗时且难以解释。
结果
比较基因簇分析工具箱 (CAGECAT) 是一个快速且用户友好的平台,可以减轻整个基因簇比较分析的困难。该软件提供同源搜索和下游分析,而无需命令行或编程专业知识。通过利用远程 BLAST 数据库(始终提供最新结果),CAGECAT 可以产生相关的匹配项,有助于未知查询的比较、分类分布或进化。该服务是可扩展和互操作的,并实现了 cblaster 和 clinker 管道,以执行同源搜索、过滤、基因邻域估计以及生成的变体 BGC 的动态可视化。通过可视化模块,可以直接从网络浏览器自定义出版质量的图形,通过信息丰富的叠加来识别 BGC 查询中的保守基因,从而大大加快了它们的解释。
结论
总的来说,CAGECAT 是一个可扩展的软件,可以通过标准网络浏览器进行整个区域的同源搜索,并在 NCBI 上持续更新的基因组上进行比较。公共网络服务器和可安装的 docker 映像都是开源的,无需注册即可免费获得:https://cagecat.bioinformatics.nl。