• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PlasClass 可改善质粒序列分类。

PlasClass improves plasmid sequence classification.

机构信息

Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel.

Department of Life Sciences, Ben-Gurion University of the Negev and the National Institute for Biotechnology in the Negev, Marcus Family Campus, Beer-Sheva, Israel.

出版信息

PLoS Comput Biol. 2020 Apr 3;16(4):e1007781. doi: 10.1371/journal.pcbi.1007781. eCollection 2020 Apr.

DOI:10.1371/journal.pcbi.1007781
PMID:32243433
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7159247/
Abstract

Many bacteria contain plasmids, but separating between contigs that originate on the plasmid and those that are part of the bacterial genome can be difficult. This is especially true in metagenomic assembly, which yields many contigs of unknown origin. Existing tools for classifying sequences of plasmid origin give less reliable results for shorter sequences, are trained using a fraction of the known plasmids, and can be difficult to use in practice. We present PlasClass, a new plasmid classifier. It uses a set of standard classifiers trained on the most current set of known plasmid sequences for different sequence lengths. We tested PlasClass sequence classification on held-out data and simulations, as well as publicly available bacterial isolates and plasmidome samples and plasmids assembled from metagenomic samples. PlasClass outperforms the state-of-the-art plasmid classification tool on shorter sequences, which constitute the majority of assembly contigs, allowing it to achieve higher F1 scores in classifying sequences from a wide range of datasets. PlasClass also uses significantly less time and memory. PlasClass can be used to easily classify plasmid and bacterial genome sequences in metagenomic or isolate assemblies. It is available under the MIT license from: https://github.com/Shamir-Lab/PlasClass.

摘要

许多细菌都含有质粒,但将源自质粒的 contigs 与细菌基因组的 contigs 区分开来可能很困难。在宏基因组组装中尤其如此,因为它会产生许多来源未知的 contigs。现有的用于分类质粒来源序列的工具在较短的序列上给出的结果不太可靠,它们是使用已知质粒的一小部分进行训练的,并且在实际中可能难以使用。我们提出了 PlasClass,这是一种新的质粒分类器。它使用一组针对不同序列长度的最新已知质粒序列进行训练的标准分类器。我们在保留数据和模拟数据、公开的细菌分离株和质粒组样本以及从宏基因组样本组装的质粒上测试了 PlasClass 的序列分类。PlasClass 在较短的序列上优于最先进的质粒分类工具,这些序列构成了大多数组装 contigs,使它能够在对来自各种数据集的序列进行分类时获得更高的 F1 分数。PlasClass 还使用的时间和内存显著减少。PlasClass 可用于轻松分类宏基因组或分离株组装中的质粒和细菌基因组序列。它可从以下网址以 MIT 许可证获得:https://github.com/Shamir-Lab/PlasClass。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da2e/7159247/39ae868d5ac6/pcbi.1007781.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da2e/7159247/39ae868d5ac6/pcbi.1007781.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da2e/7159247/39ae868d5ac6/pcbi.1007781.g001.jpg

相似文献

1
PlasClass improves plasmid sequence classification.PlasClass 可改善质粒序列分类。
PLoS Comput Biol. 2020 Apr 3;16(4):e1007781. doi: 10.1371/journal.pcbi.1007781. eCollection 2020 Apr.
2
SCAPP: an algorithm for improved plasmid assembly in metagenomes.SCAPP:一种用于提高宏基因组中质粒组装的算法。
Microbiome. 2021 Jun 25;9(1):144. doi: 10.1186/s40168-021-01068-z.
3
3CAC: improving the classification of phages and plasmids in metagenomic assemblies using assembly graphs.3CAC:利用组装图提高宏基因组组装中噬菌体和质粒的分类。
Bioinformatics. 2022 Sep 16;38(Suppl_2):ii56-ii61. doi: 10.1093/bioinformatics/btac468.
4
PLASMe: a tool to identify PLASMid contigs from short-read assemblies using transformer.PLASMe:一种使用变压器从短读组装中识别 PLASMid 连续体的工具。
Nucleic Acids Res. 2023 Aug 25;51(15):e83. doi: 10.1093/nar/gkad578.
5
Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures.利用序列特征鉴定质体组中可接合和可移动的质粒片段。
Microb Genom. 2020 Nov;6(11). doi: 10.1099/mgen.0.000459. Epub 2020 Oct 19.
6
Platon: identification and characterization of bacterial plasmid contigs in short-read draft assemblies exploiting protein sequence-based replicon distribution scores.柏拉图:利用基于蛋白质序列的复制子分布分数,在短读长草图组装中鉴定和描述细菌质粒片段。
Microb Genom. 2020 Oct;6(10). doi: 10.1099/mgen.0.000398.
7
GraphPlas: Refined Classification of Plasmid Sequences Using Assembly Graphs.GraphPlas:利用组装图谱进行质粒序列的精细化分类。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jan-Feb;19(1):57-67. doi: 10.1109/TCBB.2021.3082915. Epub 2022 Feb 3.
8
PlasmidHunter: accurate and fast prediction of plasmid sequences using gene content profile and machine learning.PlasmidHunter:使用基因内容特征和机器学习进行准确快速的质粒序列预测。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae322.
9
MOB-suite: software tools for clustering, reconstruction and typing of plasmids from draft assemblies.MOB-suite:用于从草图组装中对质粒进行聚类、重建和分型的软件工具。
Microb Genom. 2018 Aug;4(8). doi: 10.1099/mgen.0.000206. Epub 2018 Jul 27.
10
HOTSPOT: hierarchical host prediction for assembled plasmid contigs with transformer.热点:使用 Transformer 进行组装质粒 contigs 的分层宿主预测。
Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad283.

引用本文的文献

1
Investigation of mobile genetic elements and their association with antibiotic resistance genes in clinical pathogens worldwide.全球临床病原体中移动遗传元件及其与抗生素抗性基因的关联研究。
PLoS One. 2025 Aug 18;20(8):e0330304. doi: 10.1371/journal.pone.0330304. eCollection 2025.
2
Giant extrachromosomal element "Inocle" potentially expands the adaptive capacity of the human oral microbiome.巨大的染色体外元件“Inocle”可能会扩大人类口腔微生物群的适应能力。
Nat Commun. 2025 Aug 11;16(1):7397. doi: 10.1038/s41467-025-62406-5.
3
Draft genome sequence of sp. GlSt437 isolated from the Styx Glacier, Antarctica.

本文引用的文献

1
PLSDB: a resource of complete bacterial plasmids.PLSDB:一个完整的细菌质粒资源库。
Nucleic Acids Res. 2019 Jan 8;47(D1):D195-D202. doi: 10.1093/nar/gky1050.
2
Simulating Illumina metagenomic data with InSilicoSeq.用 InSilicoSeq 模拟 Illumina 宏基因组数据。
Bioinformatics. 2019 Feb 1;35(3):521-522. doi: 10.1093/bioinformatics/bty630.
3
PlasFlow: predicting plasmid sequences in metagenomic data using genome signatures.PlasFlow:利用基因组特征预测宏基因组数据中的质粒序列。
从南极洲冥河冰川分离出的GlSt437菌的基因组序列草图。
Microbiol Resour Announc. 2025 Sep 11;14(9):e0024625. doi: 10.1128/mra.00246-25. Epub 2025 Aug 11.
4
Circling in on plasmids: benchmarking plasmid detection and reconstruction tools for short-read data from diverse species.聚焦质粒:对来自不同物种的短读长数据的质粒检测和重建工具进行基准测试
bioRxiv. 2025 Aug 5:2025.07.28.667252. doi: 10.1101/2025.07.28.667252.
5
Metagenomics-Toolkit: the flexible and efficient cloud-based metagenomics workflow featuring machine learning-enabled resource allocation.宏基因组学工具包:基于云的灵活高效宏基因组学工作流程,具有支持机器学习的资源分配功能。
NAR Genom Bioinform. 2025 Jul 17;7(3):lqaf093. doi: 10.1093/nargab/lqaf093. eCollection 2025 Sep.
6
Global health risks lurking in livestock resistome.潜伏在牲畜耐药基因组中的全球健康风险。
Sci Adv. 2025 Jun 27;11(26):eadt8073. doi: 10.1126/sciadv.adt8073.
7
Integrating comparative genomics and risk classification by assessing virulence, antimicrobial resistance, and plasmid spread in microbial communities with gSpreadComp.通过使用gSpreadComp评估微生物群落中的毒力、抗微生物耐药性和质粒传播,整合比较基因组学和风险分类。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf072.
8
Genomic analysis of carbapenem-resistant blood isolates from nationwide surveillance in South Korea.韩国全国监测中耐碳青霉烯类血液分离株的基因组分析。
Front Microbiol. 2025 May 6;16:1562222. doi: 10.3389/fmicb.2025.1562222. eCollection 2025.
9
Metagenomics as a Transformative Tool for Antibiotic Resistance Surveillance: Highlighting the Impact of Mobile Genetic Elements with a Focus on the Complex Role of Phages.宏基因组学作为抗生素耐药性监测的变革性工具:强调移动遗传元件的影响,重点关注噬菌体的复杂作用。
Antibiotics (Basel). 2025 Mar 12;14(3):296. doi: 10.3390/antibiotics14030296.
10
Global analysis of the metaplasmidome: ecological drivers and spread of antibiotic resistance genes across ecosystems.宏质粒组的全球分析:抗生素抗性基因在生态系统中的生态驱动因素及传播
Microbiome. 2025 Mar 19;13(1):77. doi: 10.1186/s40168-025-02062-5.
Nucleic Acids Res. 2018 Apr 6;46(6):e35. doi: 10.1093/nar/gkx1321.
4
Characteristics of ARG-carrying plasmidome in the cultivable microbial community from wastewater treatment system under high oxytetracycline concentration.高浓度土霉素废水处理系统可培养微生物群落中携带 ARG 的质粒组特征。
Appl Microbiol Biotechnol. 2018 Feb;102(4):1847-1858. doi: 10.1007/s00253-018-8738-6. Epub 2018 Jan 13.
5
On the (im)possibility of reconstructing plasmids from whole-genome short-read sequencing data.从全基因组短读测序数据重建质粒的(不)可能性。
Microb Genom. 2017 Aug 18;3(10):e000128. doi: 10.1099/mgen.0.000128. eCollection 2017 Oct.
6
Recycler: an algorithm for detecting plasmids from de novo assembly graphs.Recycler:一种从从头组装图中检测质粒的算法。
Bioinformatics. 2017 Feb 15;33(4):475-482. doi: 10.1093/bioinformatics/btw651.
7
SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.SPAdes:一种新的基因组组装算法及其在单细胞测序中的应用
J Comput Biol. 2012 May;19(5):455-77. doi: 10.1089/cmb.2012.0021. Epub 2012 Apr 16.
8
cBar: a computer program to distinguish plasmid-derived from chromosome-derived sequence fragments in metagenomics data.cBar:一种用于区分宏基因组数据中质粒衍生和染色体衍生序列片段的计算机程序。
Bioinformatics. 2010 Aug 15;26(16):2051-2. doi: 10.1093/bioinformatics/btq299. Epub 2010 Jun 10.