• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

组装还是不组装-一个经过验证的比较宏转录组学工作流程(CoMW)。

To assemble or not to resemble-A validated Comparative Metatranscriptomics Workflow (CoMW).

机构信息

Department of Environmental Science, Aarhus University RISØ Campus, Frederiksborgvej 399, 4000 Roskilde, Denmark.

AZTI, Herrera Kaia, Portualdea z/g, 20110 Pasaia, Basque Country, Spain.

出版信息

Gigascience. 2019 Aug 1;8(8). doi: 10.1093/gigascience/giz096.

DOI:10.1093/gigascience/giz096
PMID:31363751
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6667343/
Abstract

BACKGROUND

Metatranscriptomics has been used widely for investigation and quantification of microbial communities' activity in response to external stimuli. By assessing the genes expressed, metatranscriptomics provides an understanding of the interactions between different major functional guilds and the environment. Here, we present a de novo assembly-based Comparative Metatranscriptomics Workflow (CoMW) implemented in a modular, reproducible structure. Metatranscriptomics typically uses short sequence reads, which can either be directly aligned to external reference databases ("assembly-free approach") or first assembled into contigs before alignment ("assembly-based approach"). We also compare CoMW (assembly-based implementation) with an assembly-free alternative workflow, using simulated and real-world metatranscriptomes from Arctic and temperate terrestrial environments. We evaluate their accuracy in precision and recall using generic and specialized hierarchical protein databases.

RESULTS

CoMW provided significantly fewer false-positive results, resulting in more precise identification and quantification of functional genes in metatranscriptomes. Using the comprehensive database M5nr, the assembly-based approach identified genes with only 0.6% false-positive results at thresholds ranging from inclusive to stringent compared with the assembly-free approach, which yielded up to 15% false-positive results. Using specialized databases (carbohydrate-active enzyme and nitrogen cycle), the assembly-based approach identified and quantified genes with 3-5 times fewer false-positive results. We also evaluated the impact of both approaches on real-world datasets.

CONCLUSIONS

We present an open source de novo assembly-based CoMW. Our benchmarking findings support assembling short reads into contigs before alignment to a reference database because this provides higher precision and minimizes false-positive results.

摘要

背景

宏转录组学已广泛应用于研究和量化微生物群落对外界刺激的活性。通过评估表达的基因,宏转录组学可以了解不同主要功能类群与环境之间的相互作用。在这里,我们提出了一种基于从头组装的比较宏转录组学工作流程(CoMW),该工作流程采用模块化、可重复的结构实现。宏转录组学通常使用短序列读段,可以直接与外部参考数据库对齐(“无组装方法”),也可以在对齐之前组装成 contigs(“基于组装的方法”)。我们还使用来自北极和温带陆地环境的模拟和真实宏转录组数据,比较了 CoMW(基于组装的实现)与无组装的替代工作流程。我们使用通用和专门的层次蛋白质数据库来评估它们在精度和召回率方面的准确性。

结果

CoMW 提供的假阳性结果明显较少,从而更精确地识别和定量宏转录组中的功能基因。使用综合数据库 M5nr,与无组装方法相比,基于组装的方法在从包容到严格的阈值范围内识别基因的假阳性结果仅为 0.6%,而无组装方法的假阳性结果高达 15%。使用专门的数据库(碳水化合物活性酶和氮循环),基于组装的方法识别和定量基因的假阳性结果减少了 3-5 倍。我们还评估了这两种方法对真实数据集的影响。

结论

我们提出了一种开源的基于从头组装的 CoMW。我们的基准测试结果支持在将短读段与参考数据库对齐之前将其组装成 contigs,因为这可以提供更高的精度并最大限度地减少假阳性结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb42/6667343/ddb4a655de6d/giz096fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb42/6667343/6842db103acd/giz096fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb42/6667343/15afb7549486/giz096fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb42/6667343/8c98c6ff81f2/giz096fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb42/6667343/ddb4a655de6d/giz096fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb42/6667343/6842db103acd/giz096fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb42/6667343/15afb7549486/giz096fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb42/6667343/8c98c6ff81f2/giz096fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb42/6667343/ddb4a655de6d/giz096fig4.jpg

相似文献

1
To assemble or not to resemble-A validated Comparative Metatranscriptomics Workflow (CoMW).组装还是不组装-一个经过验证的比较宏转录组学工作流程(CoMW)。
Gigascience. 2019 Aug 1;8(8). doi: 10.1093/gigascience/giz096.
2
Fast and simple protein-alignment-guided assembly of orthologous gene families from microbiome sequencing reads.快速而简单的基于蛋白质比对的微生物组测序读段中直系同源基因家族组装方法。
Microbiome. 2017 Jan 25;5(1):11. doi: 10.1186/s40168-017-0233-2.
3
Metagenomics workflow for hybrid assembly, differential coverage binning, metatranscriptomics and pathway analysis (MUFFIN).宏基因组学工作流程用于混合组装、差异覆盖分箱、宏转录组学和途径分析(MUFFIN)。
PLoS Comput Biol. 2021 Feb 9;17(2):e1008716. doi: 10.1371/journal.pcbi.1008716. eCollection 2021 Feb.
4
Comparative performance of transcriptome assembly methods for non-model organisms.非模式生物转录组组装方法的比较性能
BMC Genomics. 2016 Jul 27;17:523. doi: 10.1186/s12864-016-2923-8.
5
Challenges and advances for transcriptome assembly in non-model species.非模式物种转录组组装面临的挑战与进展
PLoS One. 2017 Sep 20;12(9):e0185020. doi: 10.1371/journal.pone.0185020. eCollection 2017.
6
GRASPx: efficient homolog-search of short peptide metagenome database through simultaneous alignment and assembly.GRASPx:通过同时比对和组装实现短肽宏基因组数据库的高效同源搜索
BMC Bioinformatics. 2016 Aug 31;17 Suppl 8(Suppl 8):283. doi: 10.1186/s12859-016-1119-1.
7
STAble: a novel approach to de novo assembly of RNA-seq data and its application in a metabolic model network based metatranscriptomic workflow.STAble:一种新的 RNA-seq 数据从头组装方法及其在基于代谢模型网络的宏转录组学工作流中的应用。
BMC Bioinformatics. 2018 Jul 9;19(Suppl 7):184. doi: 10.1186/s12859-018-2174-6.
8
Depth dependent metatranscriptomes of the marine pico-/nanoplanktonic communities in the Gulf of Aqaba/Eilat during seasonal deep mixing.亚喀巴湾/埃拉特湾季节性深度混合期间海洋微微型/微型浮游生物群落的深度依赖性宏转录组
Mar Genomics. 2014 Dec;18 Pt B:93-5. doi: 10.1016/j.margen.2014.06.005. Epub 2014 Jun 28.
9
Functional analysis of metagenomes and metatranscriptomes using SEED and KEGG.使用 SEED 和 KEGG 对宏基因组和宏转录组进行功能分析。
BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S21. doi: 10.1186/1471-2105-12-S1-S21.
10
Metagenome and Metatranscriptome Analyses Using Protein Family Profiles.利用蛋白质家族谱进行宏基因组和宏转录组分析。
PLoS Comput Biol. 2016 Jul 11;12(7):e1004991. doi: 10.1371/journal.pcbi.1004991. eCollection 2016 Jul.

引用本文的文献

1
Microbial genetic potential differs among cryospheric habitats of the Damma glacier.微生物遗传潜能在达玛冰川的冰冻圈生境中存在差异。
Microb Genom. 2024 Oct;10(10). doi: 10.1099/mgen.0.001301.
2
Environmental community transcriptomics: strategies and struggles.环境群落转录组学:策略与挑战
Brief Funct Genomics. 2025 Jan 15;24. doi: 10.1093/bfgp/elae033.
3
Total RNA analysis of the active microbiome on moving bed biofilm reactor carriers under incrementally increasing micropollutant concentrations.在逐渐增加的微污染物浓度下,移动床生物膜反应器载体上活性微生物组的总 RNA 分析。

本文引用的文献

1
Total RNA sequencing reveals multilevel microbial community changes and functional responses to wood ash application in agricultural and forest soil.总 RNA 测序揭示了农业和森林土壤中施木灰导致的微生物群落多水平变化和功能响应。
FEMS Microbiol Ecol. 2020 Mar 1;96(3). doi: 10.1093/femsec/fiaa016.
2
NCycDB: a curated integrative database for fast and accurate metagenomic profiling of nitrogen cycling genes.NCycDB:一个经过精心整理的综合数据库,用于快速准确地对氮循环基因进行宏基因组分析。
Bioinformatics. 2019 Mar 15;35(6):1040-1048. doi: 10.1093/bioinformatics/bty741.
3
Taxonomic and Functional Compositions Impacted by the Quality of Metatranscriptomic Assemblies.
FEMS Microbiol Ecol. 2024 Aug 13;100(9). doi: 10.1093/femsec/fiae098.
4
Investigating eukaryotic and prokaryotic diversity and functional potential in the cold and alkaline ikaite columns in Greenland.研究格陵兰岛寒冷碱性柱星石中的真核生物和原核生物多样性及功能潜力。
Front Microbiol. 2024 Apr 9;15:1358787. doi: 10.3389/fmicb.2024.1358787. eCollection 2024.
5
Searching for new plastic-degrading enzymes from the plastisphere of alpine soils using a metagenomic mining approach.利用宏基因组挖掘方法从高山土壤的塑料球层中寻找新的塑料降解酶。
PLoS One. 2024 Apr 5;19(4):e0300503. doi: 10.1371/journal.pone.0300503. eCollection 2024.
6
The Feather Moss Hylocomium splendens Affects the Transcriptional Profile of a Symbiotic Cyanobacterium in Relation to Acquisition and Turnover of Key Nutrients.羽藓华丽干藓影响共生蓝藻关键养分获取和周转相关的转录组特征。
Microb Ecol. 2024 Mar 1;87(1):49. doi: 10.1007/s00248-024-02363-6.
7
Abrupt permafrost thaw triggers activity of copiotrophs and microbiome predators.突然的永久冻土融化会引发富营养生物和微生物捕食者的活动。
FEMS Microbiol Ecol. 2023 Oct 17;99(11). doi: 10.1093/femsec/fiad123.
8
Distinct taxonomic and functional profiles of high Arctic and alpine permafrost-affected soil microbiomes.北极高纬度地区和高山永久冻土影响的土壤微生物群落的独特分类学和功能特征。
Environ Microbiome. 2023 Jun 16;18(1):54. doi: 10.1186/s40793-023-00509-6.
9
Reverse engineering environmental metatranscriptomes clarifies best practices for eukaryotic assembly.反向工程环境宏转录组阐明了真核生物组装的最佳实践。
BMC Bioinformatics. 2023 Mar 3;24(1):74. doi: 10.1186/s12859-022-05121-y.
10
The hitchhikers' guide to RNA sequencing and functional analysis.RNA 测序和功能分析的搭便车指南。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac529.
分类学和功能组成受宏转录组组装质量的影响。
Front Microbiol. 2018 Jun 20;9:1235. doi: 10.3389/fmicb.2018.01235. eCollection 2018.
4
SAMSA2: a standalone metatranscriptome analysis pipeline.SAMSA2:一个独立的宏转录组分析管道。
BMC Bioinformatics. 2018 May 21;19(1):175. doi: 10.1186/s12859-018-2189-z.
5
Benchmarking taxonomic assignments based on 16S rRNA gene profiling of the microbiota from commonly sampled environments.基于常见采样环境中微生物群 16S rRNA 基因谱的分类分配基准测试。
Gigascience. 2018 May 1;7(5). doi: 10.1093/gigascience/giy054.
6
Trees, fungi and bacteria: tripartite metatranscriptomics of a root microbiome responding to soil contamination.树木、真菌和细菌:响应土壤污染的根微生物组的三分体宏转录组学。
Microbiome. 2018 Mar 21;6(1):53. doi: 10.1186/s40168-018-0432-5.
7
Metatranscriptome of human faecal microbial communities in a cohort of adult men.人类粪便微生物群落的宏转录组研究。
Nat Microbiol. 2018 Mar;3(3):356-366. doi: 10.1038/s41564-017-0084-4. Epub 2018 Jan 15.
8
Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software.宏基因组解读的批判性评估——宏基因组学软件的一项基准测试
Nat Methods. 2017 Nov;14(11):1063-1071. doi: 10.1038/nmeth.4458. Epub 2017 Oct 2.
9
IMP: a pipeline for reproducible reference-independent integrated metagenomic and metatranscriptomic analyses.IMP:一个用于可重复的、独立于参考序列的综合宏基因组和宏转录组分析的流程。
Genome Biol. 2016 Dec 16;17(1):260. doi: 10.1186/s13059-016-1116-8.
10
SWORD-a highly efficient protein database search.SWORD——一种高效的蛋白质数据库搜索工具。
Bioinformatics. 2016 Sep 1;32(17):i680-i684. doi: 10.1093/bioinformatics/btw445.