• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
cBar: a computer program to distinguish plasmid-derived from chromosome-derived sequence fragments in metagenomics data.cBar:一种用于区分宏基因组数据中质粒衍生和染色体衍生序列片段的计算机程序。
Bioinformatics. 2010 Aug 15;26(16):2051-2. doi: 10.1093/bioinformatics/btq299. Epub 2010 Jun 10.
2
PPR-Meta: a tool for identifying phages and plasmids from metagenomic fragments using deep learning.PPR-Meta:一种使用深度学习从宏基因组片段中识别噬菌体和质粒的工具。
Gigascience. 2019 Jun 1;8(6). doi: 10.1093/gigascience/giz066.
3
Gene prediction with Glimmer for metagenomic sequences augmented by classification and clustering.基于分类和聚类的宏基因组序列增强的 Glimmer 基因预测。
Nucleic Acids Res. 2012 Jan;40(1):e9. doi: 10.1093/nar/gkr1067. Epub 2011 Nov 18.
4
PlasFlow: predicting plasmid sequences in metagenomic data using genome signatures.PlasFlow:利用基因组特征预测宏基因组数据中的质粒序列。
Nucleic Acids Res. 2018 Apr 6;46(6):e35. doi: 10.1093/nar/gkx1321.
5
3CAC: improving the classification of phages and plasmids in metagenomic assemblies using assembly graphs.3CAC:利用组装图提高宏基因组组装中噬菌体和质粒的分类。
Bioinformatics. 2022 Sep 16;38(Suppl_2):ii56-ii61. doi: 10.1093/bioinformatics/btac468.
6
MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.环境宏基因组的MinION™纳米孔测序:一种合成方法。
Gigascience. 2017 Mar 1;6(3):1-10. doi: 10.1093/gigascience/gix007.
7
SCAPP: an algorithm for improved plasmid assembly in metagenomes.SCAPP:一种用于提高宏基因组中质粒组装的算法。
Microbiome. 2021 Jun 25;9(1):144. doi: 10.1186/s40168-021-01068-z.
8
Eu-Detect: an algorithm for detecting eukaryotic sequences in metagenomic data sets.Eu-Detect:一种用于在宏基因组数据集检测真核序列的算法。
J Biosci. 2011 Sep;36(4):709-17. doi: 10.1007/s12038-011-9105-2.
9
MetCap: a bioinformatics probe design pipeline for large-scale targeted metagenomics.MetCap:一种用于大规模靶向宏基因组学的生物信息学探针设计流程
BMC Bioinformatics. 2015 Feb 28;16(1):65. doi: 10.1186/s12859-015-0501-8.
10
Comparison of de-novo assembly tools for plasmid metagenome analysis.比较用于质粒宏基因组分析的从头组装工具。
Genes Genomics. 2019 Sep;41(9):1077-1083. doi: 10.1007/s13258-019-00839-1. Epub 2019 Jun 11.

引用本文的文献

1
Circling in on plasmids: benchmarking plasmid detection and reconstruction tools for short-read data from diverse species.聚焦质粒:对来自不同物种的短读长数据的质粒检测和重建工具进行基准测试
bioRxiv. 2025 Aug 5:2025.07.28.667252. doi: 10.1101/2025.07.28.667252.
2
DeepMicroClass sorts metagenomic contigs into prokaryotes, eukaryotes and viruses.DeepMicroClass将宏基因组重叠群分类为原核生物、真核生物和病毒。
NAR Genom Bioinform. 2024 May 6;6(2):lqae044. doi: 10.1093/nargab/lqae044. eCollection 2024 Jun.
3
Enhancing Clinical Utility: Utilization of International Standards and Guidelines for Metagenomic Sequencing in Infectious Disease Diagnosis.提高临床实用性:在传染病诊断中应用宏基因组测序的国际标准和指南。
Int J Mol Sci. 2024 Mar 15;25(6):3333. doi: 10.3390/ijms25063333.
4
Diverse plasmid systems and their ecology across human gut metagenomes revealed by PlasX and MobMess.通过 PlasX 和 MobMess 揭示人类肠道宏基因组中的多种质粒系统及其生态。
Nat Microbiol. 2024 Mar;9(3):830-847. doi: 10.1038/s41564-024-01610-3. Epub 2024 Mar 4.
5
A cryptic plasmid is among the most numerous genetic elements in the human gut.隐秘质粒是人类肠道中数量最多的遗传元件之一。
Cell. 2024 Feb 29;187(5):1206-1222.e16. doi: 10.1016/j.cell.2024.01.039.
6
Forecasting the dynamics of a complex microbial community using integrated meta-omics.利用整合宏基因组学预测复杂微生物群落的动态变化。
Nat Ecol Evol. 2024 Jan;8(1):32-44. doi: 10.1038/s41559-023-02241-3. Epub 2023 Nov 13.
7
plASgraph2: using graph neural networks to detect plasmid contigs from an assembly graph.plASgraph2:使用图神经网络从组装图中检测质粒重叠群。
Front Microbiol. 2023 Oct 6;14:1267695. doi: 10.3389/fmicb.2023.1267695. eCollection 2023.
8
PLASMe: a tool to identify PLASMid contigs from short-read assemblies using transformer.PLASMe:一种使用变压器从短读组装中识别 PLASMid 连续体的工具。
Nucleic Acids Res. 2023 Aug 25;51(15):e83. doi: 10.1093/nar/gkad578.
9
Metagenomic analysis reveals gut plasmids as diagnosis markers for colorectal cancer.宏基因组分析揭示肠道质粒可作为结直肠癌的诊断标志物。
Front Microbiol. 2023 May 22;14:1130446. doi: 10.3389/fmicb.2023.1130446. eCollection 2023.
10
Plasmer: an Accurate and Sensitive Bacterial Plasmid Prediction Tool Based on Machine Learning of Shared k-mers and Genomic Features.Plasmer:一种基于共享 k-mers 和基因组特征的机器学习的准确且灵敏的细菌质粒预测工具。
Microbiol Spectr. 2023 Jun 15;11(3):e0464522. doi: 10.1128/spectrum.04645-22. Epub 2023 May 16.

本文引用的文献

1
Introducing the bacterial 'chromid': not a chromosome, not a plasmid.介绍细菌“chromid”:既不是染色体,也不是质粒。
Trends Microbiol. 2010 Apr;18(4):141-8. doi: 10.1016/j.tim.2009.12.010. Epub 2010 Jan 18.
2
Modal codon usage: assessing the typical codon usage of a genome.模体密码子用法:评估基因组的典型密码子用法。
Mol Biol Evol. 2010 Apr;27(4):800-10. doi: 10.1093/molbev/msp281. Epub 2009 Dec 17.
3
TACOA: taxonomic classification of environmental genomic fragments using a kernelized nearest neighbor approach.TACOA:使用核化最近邻方法对环境基因组片段进行分类学分类。
BMC Bioinformatics. 2009 Feb 11;10:56. doi: 10.1186/1471-2105-10-56.
4
Barcodes for genomes and applications.基因组条形码及其应用。
BMC Bioinformatics. 2008 Dec 17;9:546. doi: 10.1186/1471-2105-9-546.
5
Binning sequences using very sparse labels within a metagenome.在宏基因组内使用非常稀疏的标签对序列进行分箱。
BMC Bioinformatics. 2008 Apr 28;9:215. doi: 10.1186/1471-2105-9-215.
6
A continuous process to extract plasmid DNA based on alkaline lysis.一种基于碱裂解法提取质粒DNA的连续过程。
Nat Protoc. 2008;3(2):176-80. doi: 10.1038/nprot.2007.526.
7
What's in the mix: phylogenetic classification of metagenome sequence samples.混合样本中的成分:宏基因组序列样本的系统发育分类
Curr Opin Microbiol. 2007 Oct;10(5):499-503. doi: 10.1016/j.mib.2007.08.004. Epub 2007 Oct 22.
8
Data mining in bioinformatics using Weka.使用Weka进行生物信息学中的数据挖掘。
Bioinformatics. 2004 Oct 12;20(15):2479-81. doi: 10.1093/bioinformatics/bth261. Epub 2004 Apr 8.
9
Duck hepatitis B virus nucleocapsids formed by N-terminally extended or C-terminally truncated core proteins disintegrate during viral DNA maturation.由N端延伸或C端截短的核心蛋白形成的鸭乙型肝炎病毒核衣壳在病毒DNA成熟过程中解体。
J Virol. 1998 Nov;72(11):9116-20. doi: 10.1128/JVI.72.11.9116-9120.1998.
10
Comparison of the predicted and observed secondary structure of T4 phage lysozyme.T4噬菌体溶菌酶预测二级结构与观察到的二级结构的比较。
Biochim Biophys Acta. 1975 Oct 20;405(2):442-51. doi: 10.1016/0005-2795(75)90109-9.

cBar:一种用于区分宏基因组数据中质粒衍生和染色体衍生序列片段的计算机程序。

cBar: a computer program to distinguish plasmid-derived from chromosome-derived sequence fragments in metagenomics data.

机构信息

Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA 30602, USA.

出版信息

Bioinformatics. 2010 Aug 15;26(16):2051-2. doi: 10.1093/bioinformatics/btq299. Epub 2010 Jun 10.

DOI:10.1093/bioinformatics/btq299
PMID:20538725
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2916713/
Abstract

SUMMARY

Huge amount of metagenomic sequence data have been produced as a result of the rapidly increasing efforts worldwide in studying microbial communities as a whole. Most, if not all, sequenced metagenomes are complex mixtures of chromosomal and plasmid sequence fragments from multiple organisms, possibly from different kingdoms. Computational methods for prediction of genomic elements such as genes are significantly different for chromosomes and plasmids, hence raising the need for separation of chromosomal from plasmid sequences in a metagenome. We present a program for classification of a metagenome set into chromosomal and plasmid sequences, based on their distinguishing pentamer frequencies. On a large training set consisting of all the sequenced prokaryotic chromosomes and plasmids, the program achieves approximately 92% in classification accuracy. On a large set of simulated metagenomes with sequence lengths ranging from 300 bp to 100 kbp, the program has classification accuracy from 64.45% to 88.75%. On a large independent test set, the program achieves 88.29% classification accuracy.

AVAILABILITY

The program has been implemented as a standalone prediction program, cBar, which is available at http://csbl.bmb.uga.edu/~ffzhou/cBar.

摘要

摘要

由于全球范围内对微生物群落进行整体研究的努力迅速增加,产生了大量的宏基因组序列数据。如果不是所有的话,那么大多数测序的宏基因组都是来自多个生物体(可能来自不同的生物界)的染色体和质粒序列片段的复杂混合物。用于预测基因等基因组元件的计算方法对于染色体和质粒有很大的不同,因此需要将宏基因组中的染色体与质粒序列分离。我们提出了一种基于五聚体频率区分的宏基因组分类程序,用于将宏基因组集分类为染色体和质粒序列。在由所有已测序的原核染色体和质粒组成的大型训练集上,该程序的分类准确性约为 92%。在包含长度为 300bp 至 100kbp 的序列的大型模拟宏基因组集上,该程序的分类准确性为 64.45%至 88.75%。在大型独立测试集上,该程序的分类准确性达到 88.29%。

可用性

该程序已实现为一个独立的预测程序 cBar,可在 http://csbl.bmb.uga.edu/~ffzhou/cBar 上获得。