• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

pyPGCF:一个用于系统发育基因组分析、物种划分、细菌基因组核心和指纹蛋白识别的 Python 软件,这些蛋白对植物很重要。

pyPGCF: A Python Software for Phylogenomic Analysis, Species Demarcation, Identification of Core, and Fingerprint Proteins of Bacterial Genomes That Are Important for Plants.

机构信息

Bioinformatics Laboratory, Department of Biochemistry and Biotechnology, University of Thessaly, Biopolis, Greece.

Department of Biochemistry, University of Cambridge, Cambridge, UK.

出版信息

Methods Mol Biol. 2024;2788:139-155. doi: 10.1007/978-1-0716-3782-1_8.

DOI:10.1007/978-1-0716-3782-1_8
PMID:38656512
Abstract

This computational protocol describes how to use pyPGCF, a python software package that runs in the linux environment, in order to analyze bacterial genomes and perform: (i) phylogenomic analysis, (ii) species demarcation, (iii) identification of the core proteins of a bacterial genus and its individual species, (iv) identification of species-specific fingerprint proteins that are found in all strains of a species and, at the same time, are absent from all other species of the genus, (v) functional annotation of the core and fingerprint proteins with eggNOG, and (vi) identification of secondary metabolite biosynthetic gene clusters (smBGCs) with antiSMASH. This software has already been implemented to analyze bacterial genera and species that are important for plants (e.g., Pseudomonas, Bacillus, Streptomyces). In addition, we provide a test dataset and example commands showing how to analyze 165 genomes from 55 species of the genus Bacillus. The main advantages of pyPGCF are that: (i) it uses adjustable orthology cut-offs, (ii) it identifies species-specific fingerprints, and (iii) its computational cost scales linearly with the number of genomes being analyzed. Therefore, pyPGCF is able to deal with a very large number of bacterial genomes, in reasonable timescales, using widely available levels of computing power.

摘要

本计算方案描述了如何使用 pyPGCF,这是一个在 Linux 环境下运行的 Python 软件包,用于分析细菌基因组并执行以下操作:(i)系统发育基因组分析,(ii)物种划分,(iii)鉴定细菌属的核心蛋白及其各个物种,(iv)鉴定在该物种的所有菌株中都存在且同时不存在于该属的所有其他物种中的物种特异性指纹蛋白,(v)用 eggNOG 对核心和指纹蛋白进行功能注释,以及(vi)用 antiSMASH 鉴定次生代谢物生物合成基因簇(smBGCs)。该软件已经被用于分析对植物重要的细菌属和种(例如,假单胞菌、芽孢杆菌、链霉菌)。此外,我们提供了一个测试数据集和示例命令,展示了如何分析来自 55 种芽孢杆菌属的 165 个基因组。pyPGCF 的主要优点是:(i)它使用可调整的同源性截止值,(ii)它鉴定物种特异性指纹,以及(iii)其计算成本与被分析的基因组数量呈线性比例。因此,pyPGCF 能够在合理的时间内,使用广泛可用的计算能力处理大量的细菌基因组。

相似文献

1
pyPGCF: A Python Software for Phylogenomic Analysis, Species Demarcation, Identification of Core, and Fingerprint Proteins of Bacterial Genomes That Are Important for Plants.pyPGCF:一个用于系统发育基因组分析、物种划分、细菌基因组核心和指纹蛋白识别的 Python 软件,这些蛋白对植物很重要。
Methods Mol Biol. 2024;2788:139-155. doi: 10.1007/978-1-0716-3782-1_8.
2
A Guide to Phylogenomic Inference.系统发育基因组推断指南。
Methods Mol Biol. 2024;2802:267-345. doi: 10.1007/978-1-0716-3838-5_11.
3
UBCG: Up-to-date bacterial core gene set and pipeline for phylogenomic tree reconstruction.UBCG:用于系统发育树重建的最新细菌核心基因集和管道。
J Microbiol. 2018 Apr;56(4):280-285. doi: 10.1007/s12275-018-8014-6. Epub 2018 Feb 28.
4
zDB: bacterial comparative genomics made easy.zDB:轻松进行细菌比较基因组学研究。
mSystems. 2024 Jul 23;9(7):e0047324. doi: 10.1128/msystems.00473-24. Epub 2024 Jun 28.
5
A panoramic view of the genomic landscape of the genus .属的基因组景观全景图。
Microb Genom. 2023 Jun;9(6). doi: 10.1099/mgen.0.001028.
6
panX: pan-genome analysis and exploration.panX:泛基因组分析与探索。
Nucleic Acids Res. 2018 Jan 9;46(1):e5. doi: 10.1093/nar/gkx977.
7
The complete genomic sequence of Streptomyces spectabilis NRRL-2792 and identification of secondary metabolite biosynthetic gene clusters.壮观链霉菌 NRRL-2792 的全基因组序列及次级代谢产物生物合成基因簇的鉴定。
J Ind Microbiol Biotechnol. 2019 Aug;46(8):1217-1223. doi: 10.1007/s10295-019-02172-8. Epub 2019 Jun 13.
8
Step-by-Step Bacterial Genome Comparison.分步式细菌基因组比较。
Methods Mol Biol. 2024;2802:107-134. doi: 10.1007/978-1-0716-3838-5_5.
9
ITEP: an integrated toolkit for exploration of microbial pan-genomes.ITEP:用于探索微生物泛基因组的集成工具包。
BMC Genomics. 2014 Jan 3;15:8. doi: 10.1186/1471-2164-15-8.
10
Estimating Pangenomes with Roary.用 Roary 估计泛基因组。
Mol Biol Evol. 2020 Mar 1;37(3):933-939. doi: 10.1093/molbev/msz284.

本文引用的文献

1
A Comparative Analysis of the Core Proteomes within and among the and Evolutionary Groups Reveals the Patterns of Lineage- and Species-Specific Adaptations.对[具体名称未给出]进化组内和组间核心蛋白质组的比较分析揭示了谱系特异性和物种特异性适应模式。
Microorganisms. 2022 Aug 26;10(9):1720. doi: 10.3390/microorganisms10091720.
2
The Notable Achievements and the Prospects of Bacterial Pathogen Genomics.细菌病原体基因组学的显著成就与前景
Microorganisms. 2022 May 17;10(5):1040. doi: 10.3390/microorganisms10051040.
3
eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale.
eggNOG-mapper v2:宏基因组尺度的功能注释、直系同源物分配和结构域预测。
Mol Biol Evol. 2021 Dec 9;38(12):5825-5829. doi: 10.1093/molbev/msab293.
4
antiSMASH 6.0: improving cluster detection and comparison capabilities.antiSMASH 6.0:提高簇检测和比较能力。
Nucleic Acids Res. 2021 Jul 2;49(W1):W29-W35. doi: 10.1093/nar/gkab335.
5
Sensitive protein alignments at tree-of-life scale using DIAMOND.使用 DIAMOND 进行生命之树尺度上的敏感蛋白质比对。
Nat Methods. 2021 Apr;18(4):366-368. doi: 10.1038/s41592-021-01101-x. Epub 2021 Apr 7.
6
IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era.IQ-TREE 2:基因组时代系统发育推断的新模型和有效方法。
Mol Biol Evol. 2020 May 1;37(5):1530-1534. doi: 10.1093/molbev/msaa015.
7
OrthoFinder: phylogenetic orthology inference for comparative genomics.OrthoFinder:用于比较基因组学的系统发育直系同源推断。
Genome Biol. 2019 Nov 14;20(1):238. doi: 10.1186/s13059-019-1832-y.
8
OMA standalone: orthology inference among public and custom genomes and transcriptomes.OMA 独立版:公共和定制基因组和转录组之间的同源推断。
Genome Res. 2019 Jul;29(7):1152-1163. doi: 10.1101/gr.243212.118. Epub 2019 Jun 24.
9
High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries.高通量 ANI 分析 9 万余组原核基因组揭示了清晰的物种界限。
Nat Commun. 2018 Nov 30;9(1):5114. doi: 10.1038/s41467-018-07641-9.
10
eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses.eggNOG 5.0:一个基于 5090 个生物体和 2502 种病毒的层次化、功能和系统发育注释的同源资源。
Nucleic Acids Res. 2019 Jan 8;47(D1):D309-D314. doi: 10.1093/nar/gky1085.