• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PIRATE:一种快速且可扩展的细菌分歧直系同源聚类泛基因组工具包。

PIRATE: A fast and scalable pangenomics toolbox for clustering diverged orthologues in bacteria.

机构信息

The Milner Centre for Evolution, Department of Biology and Biochemistry, Claverton Down, University of Bath, Bath BA2 7AY, UK.

出版信息

Gigascience. 2019 Oct 1;8(10). doi: 10.1093/gigascience/giz119.

DOI:10.1093/gigascience/giz119
PMID:31598686
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6785682/
Abstract

BACKGROUND

Cataloguing the distribution of genes within natural bacterial populations is essential for understanding evolutionary processes and the genetic basis of adaptation. Advances in whole genome sequencing technologies have led to a vast expansion in the amount of bacterial genomes deposited in public databases. There is a pressing need for software solutions which are able to cluster, catalogue and characterise genes, or other features, in increasingly large genomic datasets.

RESULTS

Here we present a pangenomics toolbox, PIRATE (Pangenome Iterative Refinement and Threshold Evaluation), which identifies and classifies orthologous gene families in bacterial pangenomes over a wide range of sequence similarity thresholds. PIRATE builds upon recent scalable software developments to allow for the rapid interrogation of thousands of isolates. PIRATE clusters genes (or other annotated features) over a wide range of amino acid or nucleotide identity thresholds and uses the clustering information to rapidly identify paralogous gene families and putative fission/fusion events. Furthermore, PIRATE orders the pangenome using a directed graph, provides a measure of allelic variation, and estimates sequence divergence for each gene family.

CONCLUSIONS

We demonstrate that PIRATE scales linearly with both number of samples and computation resources, allowing for analysis of large genomic datasets, and compares favorably to other popular tools. PIRATE provides a robust framework for analysing bacterial pangenomes, from largely clonal to panmictic species.

摘要

背景

编目自然细菌种群中基因的分布对于理解进化过程和适应的遗传基础至关重要。全基因组测序技术的进步导致公共数据库中储存的细菌基因组数量大幅增加。现在迫切需要能够对越来越大的基因组数据集进行聚类、编目和特征分析的软件解决方案。

结果

在这里,我们提出了一个泛基因组工具包 PIRATE(泛基因组迭代细化和阈值评估),它可以在广泛的序列相似性阈值范围内识别和分类细菌泛基因组中的直系同源基因家族。PIRATE 建立在最近的可扩展软件发展的基础上,允许快速询问数千个分离物。PIRATE 在广泛的氨基酸或核苷酸身份阈值范围内对基因(或其他注释特征)进行聚类,并使用聚类信息快速识别旁系基因家族和潜在的分裂/融合事件。此外,PIRATE 使用有向图对泛基因组进行排序,提供等位基因变异的度量,并估计每个基因家族的序列分歧。

结论

我们证明 PIRATE 可以线性扩展到样本数量和计算资源,允许对大型基因组数据集进行分析,并与其他流行工具相比具有优势。PIRATE 为分析从主要克隆到泛化的细菌泛基因组提供了一个稳健的框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d63b/6785682/5afcfe150270/giz119fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d63b/6785682/54c1124fb344/giz119fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d63b/6785682/e6f3e70d36d1/giz119fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d63b/6785682/c96dfc5acad6/giz119fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d63b/6785682/a46e62a62f3c/giz119fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d63b/6785682/5afcfe150270/giz119fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d63b/6785682/54c1124fb344/giz119fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d63b/6785682/e6f3e70d36d1/giz119fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d63b/6785682/c96dfc5acad6/giz119fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d63b/6785682/a46e62a62f3c/giz119fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d63b/6785682/5afcfe150270/giz119fig5.jpg

相似文献

1
PIRATE: A fast and scalable pangenomics toolbox for clustering diverged orthologues in bacteria.PIRATE:一种快速且可扩展的细菌分歧直系同源聚类泛基因组工具包。
Gigascience. 2019 Oct 1;8(10). doi: 10.1093/gigascience/giz119.
2
Are we there yet? Driving the road to evolutionary graph-pangenomics.我们到了吗?驶向进化图泛基因组学之路。
Curr Opin Plant Biol. 2022 Apr;66:102195. doi: 10.1016/j.pbi.2022.102195. Epub 2022 Feb 23.
3
Pangenomics Comes of Age: From Bacteria to Plant and Animal Applications.泛基因组学时代的到来:从细菌到动植物的应用。
Trends Genet. 2020 Feb;36(2):132-145. doi: 10.1016/j.tig.2019.11.006. Epub 2019 Dec 24.
4
PPanGGOLiN: Depicting microbial diversity via a partitioned pangenome graph.PPanGGOLiN:通过分区泛基因组图描绘微生物多样性。
PLoS Comput Biol. 2020 Mar 19;16(3):e1007732. doi: 10.1371/journal.pcbi.1007732. eCollection 2020 Mar.
5
Hierarchical sets: analyzing pangenome structure through scalable set visualizations.分层集:通过可扩展的集合可视化分析泛基因组结构。
Bioinformatics. 2017 Jun 1;33(11):1604-1612. doi: 10.1093/bioinformatics/btx034.
6
A graph-based approach for the visualisation and analysis of bacterial pangenomes.基于图的方法用于细菌泛基因组的可视化和分析。
BMC Bioinformatics. 2022 Oct 8;23(1):416. doi: 10.1186/s12859-022-04898-2.
7
GET_HOMOLOGUES, a versatile software package for scalable and robust microbial pangenome analysis.GET_HOMOLOGUES,一个用于可扩展且稳健的微生物泛基因组分析的通用软件包。
Appl Environ Microbiol. 2013 Dec;79(24):7696-701. doi: 10.1128/AEM.02411-13. Epub 2013 Oct 4.
8
Methods and Developments in Graphical Pangenomics.图形泛基因组学的方法与进展
J Indian Inst Sci. 2021;101(3):485-498. doi: 10.1007/s41745-021-00255-z. Epub 2021 Aug 24.
9
Fast and flexible bacterial genomic epidemiology with PopPUNK.使用 PopPUNK 进行快速灵活的细菌基因组流行病学研究。
Genome Res. 2019 Feb;29(2):304-316. doi: 10.1101/gr.241455.118. Epub 2019 Jan 24.
10
Towards plant pangenomics.迈向植物泛基因组学。
Plant Biotechnol J. 2016 Apr;14(4):1099-105. doi: 10.1111/pbi.12499. Epub 2015 Nov 23.

引用本文的文献

1
is a novel symbiont of marine invertebrates that exhibits broad patterns of phylosymbiosis.是一种海洋无脊椎动物的新型共生体,呈现出广泛的系统共生模式。
bioRxiv. 2025 Aug 22:2025.08.22.671635. doi: 10.1101/2025.08.22.671635.
2
Genome-wide association analysis provides insights into the genomics and extracellular expression of proteases.全基因组关联分析为蛋白酶的基因组学和细胞外表达提供了见解。
Virulence. 2025 Dec;16(1):2543116. doi: 10.1080/21505594.2025.2543116. Epub 2025 Aug 14.
3
Linkage-based ortholog refinement in bacterial pangenomes with CLARC.

本文引用的文献

1
Fast and flexible bacterial genomic epidemiology with PopPUNK.使用 PopPUNK 进行快速灵活的细菌基因组流行病学研究。
Genome Res. 2019 Feb;29(2):304-316. doi: 10.1101/gr.241455.118. Epub 2019 Jan 24.
2
Population genomics of bacterial host adaptation.细菌宿主适应性的群体基因组学。
Nat Rev Genet. 2018 Sep;19(9):549-565. doi: 10.1038/s41576-018-0032-z.
3
Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria.猪:一种快速、大规模的细菌基因间区泛基因组分析工具。
使用CLARC在细菌泛基因组中基于连锁的直系同源基因优化
Nucleic Acids Res. 2025 Jun 20;53(12). doi: 10.1093/nar/gkaf488.
4
The complete genome of isolated from coastal waters and Pacific oysters in England.从英国沿海水域和太平洋牡蛎中分离出的(基因组)的完整基因组。 注:原句似乎不太完整,“isolated from coastal waters and Pacific oysters in England”前面应该还有相关主体,比如某种生物或物质等,这里暂且按照完整基因组是从相关对象中分离出的来翻译。
Microbiol Resour Announc. 2025 Jul 10;14(7):e0131824. doi: 10.1128/mra.01318-24. Epub 2025 Jun 9.
5
Treatment of Infections with T4-Related Bacteriophages Belonging to Class : Selecting Phage on the Basis of Their Generalized Transduction Capability.属于类的T4相关噬菌体感染的治疗:基于其广义转导能力选择噬菌体
Viruses. 2025 May 14;17(5):701. doi: 10.3390/v17050701.
6
Linear dicentric chromosomes in bacterial natural isolates reveal common constraints for replicon fusion.细菌自然分离株中的线性双着丝粒染色体揭示了复制子融合的常见限制因素。
mBio. 2025 Jun 11;16(6):e0104625. doi: 10.1128/mbio.01046-25. Epub 2025 May 20.
7
Genomic characterization of from canine otitis highlights the need for a One Health approach to this opportunistic pathogen.犬中耳炎病原体的基因组特征凸显了对这种机会性病原体采取“同一健康”方法的必要性。 (注:原文“from canine otitis”表述不太完整准确,推测可能是想说“pathogen from canine otitis”之类的,这里按合理推测进行了补充翻译。)
Microb Genom. 2025 May;11(5). doi: 10.1099/mgen.0.001407.
8
Methicillin-sensitive lineages contribute towards poor patient outcomes in orthopaedic device-related infections.对甲氧西林敏感的菌系在骨科器械相关感染中会导致患者预后不良。
Microb Genom. 2025 Apr;11(4). doi: 10.1099/mgen.0.001390.
9
Comprehensive genomic and evolutionary analysis of biofilm matrix clusters and proteins in the genus.该属生物膜基质簇和蛋白质的综合基因组与进化分析。
mSystems. 2025 May 20;10(5):e0006025. doi: 10.1128/msystems.00060-25. Epub 2025 Apr 10.
10
Genomic investigation of MRSA bacteremia relapse reveals diverse genomic profiles but convergence in bacteremia-associated genes.耐甲氧西林金黄色葡萄球菌菌血症复发的基因组研究揭示了不同的基因组图谱,但在菌血症相关基因方面存在趋同现象。
medRxiv. 2025 Mar 28:2025.03.24.25324140. doi: 10.1101/2025.03.24.25324140.
Gigascience. 2018 Apr 1;7(4):1-11. doi: 10.1093/gigascience/giy015.
4
panX: pan-genome analysis and exploration.panX:泛基因组分析与探索。
Nucleic Acids Res. 2018 Jan 9;46(1):e5. doi: 10.1093/nar/gkx977.
5
Phandango: an interactive viewer for bacterial population genomics.凡丹戈:一种用于细菌群体基因组学的交互式查看器。
Bioinformatics. 2018 Jan 15;34(2):292-293. doi: 10.1093/bioinformatics/btx610.
6
CLIMB (the Cloud Infrastructure for Microbial Bioinformatics): an online resource for the medical microbiology community.CLIMB(微生物生物信息学的云基础设施):面向医学微生物学领域的在线资源。
Microb Genom. 2016 Sep 20;2(9):e000086. doi: 10.1099/mgen.0.000086. eCollection 2016 Sep.
7
Microreact: visualizing and sharing data for genomic epidemiology and phylogeography.微反应:可视化和共享基因组流行病学和系统地理学数据。
Microb Genom. 2016 Nov 30;2(11):e000093. doi: 10.1099/mgen.0.000093. eCollection 2016 Nov.
8
Rapid scoring of genes in microbial pan-genome-wide association studies with Scoary.使用Scoary在微生物全基因组关联研究中对基因进行快速评分
Genome Biol. 2016 Nov 25;17(1):238. doi: 10.1186/s13059-016-1108-8.
9
Roary: rapid large-scale prokaryote pan genome analysis.Roary:快速大规模原核生物泛基因组分析
Bioinformatics. 2015 Nov 15;31(22):3691-3. doi: 10.1093/bioinformatics/btv421. Epub 2015 Jul 20.
10
Fast and sensitive protein alignment using DIAMOND.使用 DIAMOND 进行快速灵敏的蛋白质比对。
Nat Methods. 2015 Jan;12(1):59-60. doi: 10.1038/nmeth.3176. Epub 2014 Nov 17.