• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于宏基因组测序reads 精确分类学分配的统计框架。

A statistical framework for accurate taxonomic assignment of metagenomic sequencing reads.

机构信息

Department of Statistics, Northwestern University, Evanston, Illinois, United States of America.

出版信息

PLoS One. 2012;7(10):e46450. doi: 10.1371/journal.pone.0046450. Epub 2012 Oct 1.

DOI:10.1371/journal.pone.0046450
PMID:23049702
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3462201/
Abstract

The advent of next-generation sequencing technologies has greatly promoted the field of metagenomics which studies genetic material recovered directly from an environment. Characterization of genomic composition of a metagenomic sample is essential for understanding the structure of the microbial community. Multiple genomes contained in a metagenomic sample can be identified and quantitated through homology searches of sequence reads with known sequences catalogued in reference databases. Traditionally, reads with multiple genomic hits are assigned to non-specific or high ranks of the taxonomy tree, thereby impacting on accurate estimates of relative abundance of multiple genomes present in a sample. Instead of assigning reads one by one to the taxonomy tree as many existing methods do, we propose a statistical framework to model the identified candidate genomes to which sequence reads have hits. After obtaining the estimated proportion of reads generated by each genome, sequence reads are assigned to the candidate genomes and the taxonomy tree based on the estimated probability by taking into account both sequence alignment scores and estimated genome abundance. The proposed method is comprehensively tested on both simulated datasets and two real datasets. It assigns reads to the low taxonomic ranks very accurately. Our statistical approach of taxonomic assignment of metagenomic reads, TAMER, is implemented in R and available at http://faculty.wcas.northwestern.edu/hji403/MetaR.htm.

摘要

下一代测序技术的出现极大地推动了宏基因组学领域的发展,该领域研究直接从环境中回收的遗传物质。对宏基因组样本的基因组组成进行特征描述对于理解微生物群落的结构至关重要。通过将序列读取与参考数据库中编目的已知序列进行同源搜索,可以识别和定量包含在宏基因组样本中的多个基因组。传统上,具有多个基因组命中的读取被分配给非特异性或分类树的高级别,从而影响对样本中存在的多个基因组相对丰度的准确估计。我们提出了一种统计框架,而不是像许多现有方法那样逐个将读取分配给分类树,该框架用于对具有命中的鉴定候选基因组进行建模。在获得每个基因组生成的读取的估计比例之后,根据考虑序列比对得分和估计的基因组丰度的估计概率,将读取分配给候选基因组和分类树。所提出的方法在模拟数据集和两个真实数据集上进行了全面测试。它非常准确地将读取分配给低分类等级。我们的宏基因组读取分类分配的统计方法 TAMER 是用 R 实现的,可在 http://faculty.wcas.northwestern.edu/hji403/MetaR.htm 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3462201/f8fa9c6ce92f/pone.0046450.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3462201/7fe8f3846baa/pone.0046450.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3462201/3785b230f1be/pone.0046450.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3462201/9da1efded2e9/pone.0046450.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3462201/f8fa9c6ce92f/pone.0046450.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3462201/7fe8f3846baa/pone.0046450.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3462201/3785b230f1be/pone.0046450.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3462201/9da1efded2e9/pone.0046450.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3462201/f8fa9c6ce92f/pone.0046450.g004.jpg

相似文献

1
A statistical framework for accurate taxonomic assignment of metagenomic sequencing reads.一种用于宏基因组测序reads 精确分类学分配的统计框架。
PLoS One. 2012;7(10):e46450. doi: 10.1371/journal.pone.0046450. Epub 2012 Oct 1.
2
MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.环境宏基因组的MinION™纳米孔测序:一种合成方法。
Gigascience. 2017 Mar 1;6(3):1-10. doi: 10.1093/gigascience/gix007.
3
Re-purposing software for functional characterization of the microbiome.重新利用软件对微生物组进行功能特征分析。
Microbiome. 2021 Jan 9;9(1):4. doi: 10.1186/s40168-020-00971-1.
4
RIEMS: a software pipeline for sensitive and comprehensive taxonomic classification of reads from metagenomics datasets.RIEMS:一种用于对宏基因组学数据集的 reads 进行灵敏且全面的分类学分类的软件流程。
BMC Bioinformatics. 2015 Mar 3;16(1):69. doi: 10.1186/s12859-015-0503-6.
5
An improved statistical model for taxonomic assignment of metagenomics.一种用于宏基因组学分类归属的改进统计模型。
BMC Genet. 2018 Oct 29;19(1):98. doi: 10.1186/s12863-018-0680-1.
6
Strain-level metagenomic assignment and compositional estimation for long reads with MetaMaps.使用 MetaMaps 对长读进行菌株水平宏基因组分配和组成估计。
Nat Commun. 2019 Jul 11;10(1):3066. doi: 10.1038/s41467-019-10934-2.
7
Infer Metagenomic Abundance and Reveal Homologous Genomes Based on the Structure of Taxonomy Tree.基于分类树结构推断宏基因组丰度并揭示同源基因组
IEEE/ACM Trans Comput Biol Bioinform. 2015 Sep-Oct;12(5):1112-22. doi: 10.1109/TCBB.2015.2415814.
8
Mora: abundance aware metagenomic read re-assignment for disentangling similar strains.莫拉:用于区分相似菌株的丰度感知宏基因组读数重新分配法
BMC Bioinformatics. 2024 Apr 23;25(1):161. doi: 10.1186/s12859-024-05768-9.
9
MBMC: An Effective Markov Chain Approach for Binning Metagenomic Reads from Environmental Shotgun Sequencing Projects.MBMC:一种用于对环境鸟枪法测序项目中的宏基因组读数进行分箱的有效马尔可夫链方法。
OMICS. 2016 Aug;20(8):470-9. doi: 10.1089/omi.2016.0081. Epub 2016 Jul 22.
10
MTR: taxonomic annotation of short metagenomic reads using clustering at multiple taxonomic ranks.MTR:使用多种分类等级的聚类对短宏基因组reads 进行分类注释。
Bioinformatics. 2011 Jan 15;27(2):196-203. doi: 10.1093/bioinformatics/btq649. Epub 2010 Dec 1.

引用本文的文献

1
Virus detection by short read high throughput sequencing in a high virus low cellular background.在高病毒低细胞背景下通过短读长高通量测序进行病毒检测。
NPJ Vaccines. 2025 Mar 29;10(1):61. doi: 10.1038/s41541-025-01104-1.
2
Report of the third conference on next-generation sequencing for adventitious virus detection in biologics for humans and animals.第三届人类和动物用生物制品中外源病毒经下一代测序检测会议报告。
Biologicals. 2023 Aug;83:101696. doi: 10.1016/j.biologicals.2023.101696. Epub 2023 Jul 19.
3
Exodus: sequencing-based pipeline for quantification of pooled variants.

本文引用的文献

1
Deep sequencing of the oral microbiome reveals signatures of periodontal disease.口腔微生物组的深度测序揭示了牙周病的特征。
PLoS One. 2012;7(6):e37919. doi: 10.1371/journal.pone.0037919. Epub 2012 Jun 4.
2
Fast and accurate taxonomic assignments of metagenomic sequences using MetaBin.利用 MetaBin 实现宏基因组序列的快速、准确分类学赋值。
PLoS One. 2012;7(4):e34030. doi: 10.1371/journal.pone.0034030. Epub 2012 Apr 4.
3
Accurate genome relative abundance estimation based on shotgun metagenomic reads.基于高通量宏基因组测序reads 的精确基因组相对丰度估计
Exodus:基于测序的池变异定量分析流水线。
Bioinformatics. 2022 Jun 13;38(12):3288-3290. doi: 10.1093/bioinformatics/btac319.
4
Music of metagenomics-a review of its applications, analysis pipeline, and associated tools.宏基因组学音乐——应用、分析流程及其相关工具的综述。
Funct Integr Genomics. 2022 Feb;22(1):3-26. doi: 10.1007/s10142-021-00810-y. Epub 2021 Oct 18.
5
Immediate Effects of Ammonia Shock on Transcription and Composition of a Biogas Reactor Microbiome.氨冲击对沼气反应器微生物群落转录和组成的即时影响。
Front Microbiol. 2019 Sep 6;10:2064. doi: 10.3389/fmicb.2019.02064. eCollection 2019.
6
Long-term investigation of microbial community composition and transcription patterns in a biogas plant undergoing ammonia crisis.长期研究在氨危机中运行的沼气厂中的微生物群落组成和转录模式。
Microb Biotechnol. 2019 Mar;12(2):305-323. doi: 10.1111/1751-7915.13313. Epub 2018 Oct 31.
7
An improved statistical model for taxonomic assignment of metagenomics.一种用于宏基因组学分类归属的改进统计模型。
BMC Genet. 2018 Oct 29;19(1):98. doi: 10.1186/s12863-018-0680-1.
8
Considerations for Optimization of High-Throughput Sequencing Bioinformatics Pipelines for Virus Detection.考虑优化高通量测序生物信息学管道以用于病毒检测。
Viruses. 2018 Sep 27;10(10):528. doi: 10.3390/v10100528.
9
Combining 16S rRNA gene variable regions enables high-resolution microbial community profiling.结合 16S rRNA 基因可变区可实现高分辨率微生物群落分析。
Microbiome. 2018 Jan 26;6(1):17. doi: 10.1186/s40168-017-0396-x.
10
A clinician's guide to microbiome analysis.临床医生微生物组分析指南。
Nat Rev Gastroenterol Hepatol. 2017 Oct;14(10):585-595. doi: 10.1038/nrgastro.2017.97. Epub 2017 Aug 9.
PLoS One. 2011;6(12):e27992. doi: 10.1371/journal.pone.0027992. Epub 2011 Dec 6.
4
Database resources of the National Center for Biotechnology Information.国家生物技术信息中心数据库资源。
Nucleic Acids Res. 2012 Jan;40(Database issue):D13-25. doi: 10.1093/nar/gkr1184. Epub 2011 Dec 2.
5
The oral metagenome in health and disease.口腔宏基因组学在健康与疾病中的研究
ISME J. 2012 Jan;6(1):46-56. doi: 10.1038/ismej.2011.85. Epub 2011 Jun 30.
6
Taxonomic classification of metagenomic shotgun sequences with CARMA3.基于 CARMA3 的宏基因组鸟枪法测序的分类学分类
Nucleic Acids Res. 2011 Aug;39(14):e91. doi: 10.1093/nar/gkr225. Epub 2011 May 17.
7
Mixture models for analysis of the taxonomic composition of metagenomes.用于宏基因组分类组成分析的混合模型。
Bioinformatics. 2011 Jun 15;27(12):1618-24. doi: 10.1093/bioinformatics/btr266. Epub 2011 May 5.
8
Enterotypes of the human gut microbiome.人类肠道微生物组的肠型。
Nature. 2011 May 12;473(7346):174-80. doi: 10.1038/nature09944. Epub 2011 Apr 20.
9
Taxonomic metagenome sequence assignment with structured output models.使用结构化输出模型进行分类宏基因组序列分配。
Nat Methods. 2011 Mar;8(3):191-2. doi: 10.1038/nmeth0311-191.
10
MTR: taxonomic annotation of short metagenomic reads using clustering at multiple taxonomic ranks.MTR:使用多种分类等级的聚类对短宏基因组reads 进行分类注释。
Bioinformatics. 2011 Jan 15;27(2):196-203. doi: 10.1093/bioinformatics/btq649. Epub 2010 Dec 1.