• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重建宏基因组数据中的 16S rRNA 基因。

Reconstructing 16S rRNA genes in metagenomic data.

机构信息

Computer Science and Engineering, Michigan State Univerisity, 428 South Shaw Rd East Lansing, MI 48824, USA and Center for Microbial Ecology, Michigan State University, East Lansing, MI 48824, USA.

出版信息

Bioinformatics. 2015 Jun 15;31(12):i35-43. doi: 10.1093/bioinformatics/btv231.

DOI:10.1093/bioinformatics/btv231
PMID:26072503
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4765874/
Abstract

UNLABELLED

Metagenomic data, which contains sequenced DNA reads of uncultured microbial species from environmental samples, provide a unique opportunity to thoroughly analyze microbial species that have never been identified before. Reconstructing 16S ribosomal RNA, a phylogenetic marker gene, is usually required to analyze the composition of the metagenomic data. However, massive volume of dataset, high sequence similarity between related species, skewed microbial abundance and lack of reference genes make 16S rRNA reconstruction difficult. Generic de novo assembly tools are not optimized for assembling 16S rRNA genes. In this work, we introduce a targeted rRNA assembly tool, REAGO (REconstruct 16S ribosomal RNA Genes from metagenOmic data). It addresses the above challenges by combining secondary structure-aware homology search, zproperties of rRNA genes and de novo assembly. Our experimental results show that our tool can correctly recover more rRNA genes than several popular generic metagenomic assembly tools and specially designed rRNA construction tools.

AVAILABILITY AND IMPLEMENTATION

The source code of REAGO is freely available at https://github.com/chengyuan/reago.

摘要

未标记

宏基因组数据包含了从环境样本中未培养微生物物种的测序 DNA 片段,为彻底分析以前从未鉴定过的微生物物种提供了独特的机会。通常需要重建 16S 核糖体 RNA(一种系统发育标记基因)来分析宏基因组数据的组成。然而,数据集的巨大体积、相关物种之间的高序列相似性、微生物丰度的倾斜以及缺乏参考基因使得 16S rRNA 的重建变得困难。通用的从头组装工具不能针对 16S rRNA 基因进行优化。在这项工作中,我们引入了一种靶向 rRNA 组装工具,REAGO(从宏基因组数据中重建 16S 核糖体 RNA 基因)。它通过结合二级结构感知同源搜索、rRNA 基因的 z 属性和从头组装来解决上述挑战。我们的实验结果表明,与几个流行的通用宏基因组组装工具和专门设计的 rRNA 构建工具相比,我们的工具可以更准确地恢复更多的 rRNA 基因。

可用性和实现

REAGO 的源代码可在 https://github.com/chengyuan/reago 上免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f66f/4765874/68a1e31273bd/btv231f7p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f66f/4765874/025ebd124dcd/btv231f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f66f/4765874/e10822397e88/btv231f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f66f/4765874/786c3ff1919e/btv231f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f66f/4765874/5f7463810264/btv231f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f66f/4765874/f625ce3dfe7b/btv231f5p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f66f/4765874/5a8b24462cbf/btv231f6p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f66f/4765874/68a1e31273bd/btv231f7p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f66f/4765874/025ebd124dcd/btv231f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f66f/4765874/e10822397e88/btv231f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f66f/4765874/786c3ff1919e/btv231f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f66f/4765874/5f7463810264/btv231f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f66f/4765874/f625ce3dfe7b/btv231f5p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f66f/4765874/5a8b24462cbf/btv231f6p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f66f/4765874/68a1e31273bd/btv231f7p.jpg

相似文献

1
Reconstructing 16S rRNA genes in metagenomic data.重建宏基因组数据中的 16S rRNA 基因。
Bioinformatics. 2015 Jun 15;31(12):i35-43. doi: 10.1093/bioinformatics/btv231.
2
Tax4Fun: predicting functional profiles from metagenomic 16S rRNA data.Tax4Fun:从宏基因组16S rRNA数据预测功能概况。
Bioinformatics. 2015 Sep 1;31(17):2882-4. doi: 10.1093/bioinformatics/btv287. Epub 2015 May 7.
3
VITCOMIC2: visualization tool for the phylogenetic composition of microbial communities based on 16S rRNA gene amplicons and metagenomic shotgun sequencing.VITCOMIC2:基于16S rRNA基因扩增子和宏基因组鸟枪法测序的微生物群落系统发育组成可视化工具。
BMC Syst Biol. 2018 Mar 19;12(Suppl 2):30. doi: 10.1186/s12918-018-0545-2.
4
Comparing bacterial communities inferred from 16S rRNA gene sequencing and shotgun metagenomics.比较从16S rRNA基因测序和鸟枪法宏基因组学推断出的细菌群落。
Pac Symp Biocomput. 2011:165-76. doi: 10.1142/9789814335058_0018.
5
DBH: A de Bruijn graph-based heuristic method for clustering large-scale 16S rRNA sequences into OTUs.DBH:一种基于德布鲁因图的启发式方法,用于将大规模16S rRNA序列聚类为操作分类单元。
J Theor Biol. 2017 Jul 21;425:80-87. doi: 10.1016/j.jtbi.2017.04.019. Epub 2017 Apr 26.
6
A comprehensive evaluation of the sl1p pipeline for 16S rRNA gene sequencing analysis.SL1p 管道用于 16S rRNA 基因测序分析的综合评估。
Microbiome. 2017 Aug 14;5(1):100. doi: 10.1186/s40168-017-0314-2.
7
PanFP: pangenome-based functional profiles for microbial communities.PanFP:基于全基因组的微生物群落功能概况
BMC Res Notes. 2015 Sep 26;8:479. doi: 10.1186/s13104-015-1462-8.
8
The All-Species Living Tree project: a 16S rRNA-based phylogenetic tree of all sequenced type strains.全物种生命树项目:基于16S rRNA的所有已测序模式菌株的系统发育树。
Syst Appl Microbiol. 2008 Sep;31(4):241-50. doi: 10.1016/j.syapm.2008.07.001. Epub 2008 Aug 9.
9
Piphillin predicts metagenomic composition and dynamics from DADA2-corrected 16S rDNA sequences.Piphillin 可根据 DADA2 校正的 16S rDNA 序列预测宏基因组组成和动态。
BMC Genomics. 2020 Jan 17;21(1):56. doi: 10.1186/s12864-019-6427-1.
10
Phylogeny-based classification of microbial communities.基于系统发育的微生物群落分类。
Bioinformatics. 2014 Feb 15;30(4):449-56. doi: 10.1093/bioinformatics/btt700. Epub 2013 Dec 24.

引用本文的文献

1
GenomeFISH: genome-based fluorescence in situ hybridization for strain-level visualization of microbial communities.基因组荧光原位杂交技术:基于基因组的荧光原位杂交技术用于微生物群落菌株水平的可视化分析。
ISME J. 2025 Jan 2;19(1). doi: 10.1093/ismejo/wraf138.
2
Consistent microbial insights across sequencing methods in soil studies: the role of reference taxonomies.土壤研究中不同测序方法的一致微生物见解:参考分类法的作用
mSystems. 2025 Jul 22;10(7):e0105924. doi: 10.1128/msystems.01059-24. Epub 2025 Jun 10.
3
Improved maximum growth rate prediction from microbial genomes by integrating phylogenetic information.

本文引用的文献

1
A scalable and accurate targeted gene assembly tool (SAT-Assembler) for next-generation sequencing data.一种用于下一代测序数据的可扩展且准确的靶向基因组装工具(SAT组装器)。
PLoS Comput Biol. 2014 Aug 14;10(8):e1003737. doi: 10.1371/journal.pcbi.1003737. eCollection 2014 Aug.
2
RNA-CODE: a noncoding RNA classification tool for short reads in NGS data lacking reference genomes.RNA-CODE:一种在缺乏参考基因组的 NGS 数据中对短读进行非编码 RNA 分类的工具。
PLoS One. 2013 Oct 25;8(10):e77596. doi: 10.1371/journal.pone.0077596. eCollection 2013.
3
Comparative metagenomic and rRNA microbial diversity characterization using archaeal and bacterial synthetic communities.
通过整合系统发育信息改进从微生物基因组预测最大生长速率
Nat Commun. 2025 May 7;16(1):4256. doi: 10.1038/s41467-025-59558-9.
4
Discovery of a phylogenetically novel tropical marine Gammaproteobacteria elucidated from assembled genomes and the proposed transfer of the genus from the family to fam. nov.从组装基因组中阐明一种系统发育上新的热带海洋γ-变形菌,并提议将该属从**科**转移至**新科**。 (注:原文中family处的两个**科**需根据具体文献中原本的两个科名来准确填写)
Front Microbiol. 2025 Mar 28;16:1437936. doi: 10.3389/fmicb.2025.1437936. eCollection 2025.
5
Long-read metagenomics gives a more accurate insight into the microbiota of long-ripened gouda cheeses.长读长宏基因组学能更准确地洞察长期成熟的高德干酪的微生物群落。
Front Microbiol. 2025 Mar 24;16:1543079. doi: 10.3389/fmicb.2025.1543079. eCollection 2025.
6
Augmenting microbial phylogenomic signal with tailored marker gene sets.用定制的标记基因集增强微生物系统发育信号。
bioRxiv. 2025 Mar 15:2025.03.13.643052. doi: 10.1101/2025.03.13.643052.
7
Multidisciplinary methodologies used in the study of cable bacteria.用于研究电缆细菌的多学科方法。
FEMS Microbiol Rev. 2025 Jan 14;49. doi: 10.1093/femsre/fuae030.
8
Metagenomics reveals the genetic diversity between sublineages of UCYN-A and their algal host plastids.宏基因组学揭示了UCYN-A亚谱系与其藻类宿主质体之间的遗传多样性。
ISME Commun. 2024 Dec 4;4(1):ycae150. doi: 10.1093/ismeco/ycae150. eCollection 2024 Jan.
9
Strain-resolved de-novo metagenomic assembly of viral genomes and microbial 16S rRNAs.病毒基因组和微生物16S rRNA的菌株解析从头宏基因组组装
Microbiome. 2024 Oct 1;12(1):187. doi: 10.1186/s40168-024-01904-y.
10
Taxonomic variation, plastic degradation, and antibiotic resistance traits of plastisphere communities in the maturation pond of a wastewater treatment plant.污水处理厂成熟池中塑料球层群落的分类变化、塑性降解和抗生素抗性特征。
Appl Environ Microbiol. 2024 Oct 23;90(10):e0071524. doi: 10.1128/aem.00715-24. Epub 2024 Sep 27.
利用古菌和细菌合成群落进行比较宏基因组和 rRNA 微生物多样性特征分析。
Environ Microbiol. 2013 Jun;15(6):1882-99. doi: 10.1111/1462-2920.12086. Epub 2013 Feb 6.
4
MetAMOS: a modular and open source metagenomic assembly and analysis pipeline.MetAMOS:一个模块化的开源宏基因组组装与分析流程。
Genome Biol. 2013 Jan 15;14(1):R2. doi: 10.1186/gb-2013-14-1-r2.
5
Stitching gene fragments with a network matching algorithm improves gene assembly for metagenomics.利用网络匹配算法拼接基因片段可提高宏基因组基因组装质量。
Bioinformatics. 2012 Sep 15;28(18):i363-i369. doi: 10.1093/bioinformatics/bts388.
6
MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads.MetaVelvet:Velvet 组装器的扩展,用于从短序列读取进行从头宏基因组组装。
Nucleic Acids Res. 2012 Nov 1;40(20):e155. doi: 10.1093/nar/gks678. Epub 2012 Jul 19.
7
Reconstruction of ribosomal RNA genes from metagenomic data.从宏基因组数据中重建核糖体 RNA 基因。
PLoS One. 2012;7(6):e39948. doi: 10.1371/journal.pone.0039948. Epub 2012 Jun 27.
8
Readjoiner: a fast and memory efficient string graph-based sequence assembler.Readjoiner:一种快速且内存高效的基于字符串图的序列拼接器。
BMC Bioinformatics. 2012 May 6;13:82. doi: 10.1186/1471-2105-13-82.
9
IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth.IDBA-UD:一个用于具有高度不均匀深度的单细胞和宏基因组测序数据的从头组装程序。
Bioinformatics. 2012 Jun 1;28(11):1420-8. doi: 10.1093/bioinformatics/bts174. Epub 2012 Apr 11.
10
Efficient de novo assembly of large genomes using compressed data structures.利用压缩数据结构进行高效的从头基因组组装。
Genome Res. 2012 Mar;22(3):549-56. doi: 10.1101/gr.126953.111. Epub 2011 Dec 7.