• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

宏基因组测序数据的整合基因预测与肽段组装

Integrated gene prediction and peptide assembly of metagenomic sequencing data.

作者信息

Thippabhotla Sirisha, Liu Ben, Podgorny Adam, Yooseph Shibu, Yang Youngik, Zhang Jun, Zhong Cuncong

机构信息

Department of Electrical Engineering and Computer Science, The University of Kansas, Lawrence, KS 66045, USA.

Center for Computational Biology, The University of Kansas, Lawrence, KS 66045, USA.

出版信息

NAR Genom Bioinform. 2023 Mar 11;5(1):lqad023. doi: 10.1093/nargab/lqad023. eCollection 2023 Mar.

DOI:10.1093/nargab/lqad023
PMID:36915411
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10006731/
Abstract

Metagenomics is the study of all genomic content contained in given microbial communities. Metagenomic functional analysis aims to quantify protein families and reconstruct metabolic pathways from the metagenome. It plays a central role in understanding the interaction between the microbial community and its host or environment. functional analysis, which allows the discovery of novel protein families, remains challenging for high-complexity communities. There are currently three main approaches for recovering novel genes or proteins: nucleotide assembly, gene calling and peptide assembly. Unfortunately, their information dependency has been overlooked, and each has been formulated as an independent problem. In this work, we develop a sophisticated workflow called integrated Metagenomic Protein Predictor (iMPP), which leverages the information dependencies for better functional analysis. iMPP contains three novel modules: a hybrid assembly graph generation module, a graph-based gene calling module, and a peptide assembly-based refinement module. iMPP significantly improved the existing gene calling sensitivity on unassembled metagenomic reads, achieving a 92-97% recall rate at a high precision level (>85%). iMPP further allowed for more sensitive and accurate peptide assembly, recovering more reference proteins and delivering more hypothetical protein sequences. The high performance of iMPP can provide a more comprehensive and unbiased view of the microbial communities under investigation. iMPP is freely available from https://github.com/Sirisha-t/iMPP.

摘要

宏基因组学是对特定微生物群落中所含全部基因组内容的研究。宏基因组功能分析旨在对蛋白质家族进行定量,并从宏基因组中重建代谢途径。它在理解微生物群落与其宿主或环境之间的相互作用中起着核心作用。功能分析能够发现新的蛋白质家族,但对于高复杂性群落而言,仍然具有挑战性。目前有三种主要方法用于发现新基因或蛋白质:核苷酸组装、基因识别和肽段组装。遗憾的是,它们之间的信息依赖性被忽视了,并且每种方法都被当作一个独立的问题来处理。在这项工作中,我们开发了一种复杂的工作流程,称为集成宏基因组蛋白质预测器(iMPP),它利用信息依赖性来进行更好的功能分析。iMPP包含三个新颖的模块:一个混合组装图生成模块、一个基于图的基因识别模块和一个基于肽段组装的优化模块。iMPP显著提高了对未组装宏基因组读数的现有基因识别灵敏度,在高精度水平(>85%)下实现了92 - 97%的召回率。iMPP还能实现更灵敏、准确的肽段组装,找回更多的参考蛋白质并提供更多的假设蛋白质序列。iMPP的高性能能够为所研究的微生物群落提供更全面、无偏差的视角。可从https://github.com/Sirisha-t/iMPP免费获取iMPP。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a35c/10006731/b7d513d566d3/lqad023fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a35c/10006731/b991bd5d57fa/lqad023fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a35c/10006731/b54b1975522a/lqad023fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a35c/10006731/b9a8d46dc0a2/lqad023fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a35c/10006731/5a8a079f2576/lqad023fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a35c/10006731/ce09e1a4f1f9/lqad023fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a35c/10006731/1b210eedde06/lqad023fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a35c/10006731/f5a9fb49aad6/lqad023fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a35c/10006731/b7d513d566d3/lqad023fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a35c/10006731/b991bd5d57fa/lqad023fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a35c/10006731/b54b1975522a/lqad023fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a35c/10006731/b9a8d46dc0a2/lqad023fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a35c/10006731/5a8a079f2576/lqad023fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a35c/10006731/ce09e1a4f1f9/lqad023fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a35c/10006731/1b210eedde06/lqad023fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a35c/10006731/f5a9fb49aad6/lqad023fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a35c/10006731/b7d513d566d3/lqad023fig8.jpg

相似文献

1
Integrated gene prediction and peptide assembly of metagenomic sequencing data.宏基因组测序数据的整合基因预测与肽段组装
NAR Genom Bioinform. 2023 Mar 11;5(1):lqad023. doi: 10.1093/nargab/lqad023. eCollection 2023 Mar.
2
De novo assembly of highly polymorphic metagenomic data using in situ generated reference sequences and a novel BLAST-based assembly pipeline.使用原位生成的参考序列和基于新型BLAST的组装流程对高度多态的宏基因组数据进行从头组装。
BMC Bioinformatics. 2017 Apr 26;18(1):223. doi: 10.1186/s12859-017-1630-z.
3
METAMVGL: a multi-view graph-based metagenomic contig binning algorithm by integrating assembly and paired-end graphs.METAMVGL:一种基于多视图图的宏基因组序列拼接 bin 算法,通过整合组装图和配对末端图。
BMC Bioinformatics. 2021 Jul 22;22(Suppl 10):378. doi: 10.1186/s12859-021-04284-4.
4
DRAGoM: Classification and Quantification of Noncoding RNA in Metagenomic Data.DRAGoM:宏基因组数据中非编码RNA的分类与定量分析
Front Genet. 2021 May 5;12:669495. doi: 10.3389/fgene.2021.669495. eCollection 2021.
5
MetaGT: A pipeline for assembly of metatranscriptomes with the aid of metagenomic data.MetaGT:一种借助宏基因组数据组装宏转录组的流程。
Front Microbiol. 2022 Oct 28;13:981458. doi: 10.3389/fmicb.2022.981458. eCollection 2022.
6
Xander: employing a novel method for efficient gene-targeted metagenomic assembly.赞德:采用一种新颖的方法实现高效的靶向宏基因组组装。
Microbiome. 2015 Aug 5;3:32. doi: 10.1186/s40168-015-0093-6. eCollection 2015.
7
GraphBin: refined binning of metagenomic contigs using assembly graphs.GraphBin:使用组装图对宏基因组序列进行精细化分箱。
Bioinformatics. 2020 Jun 1;36(11):3307-3313. doi: 10.1093/bioinformatics/btaa180.
8
Evaluating Assembly and Binning Strategies for Time Series Drinking Water Metagenomes.评估时间序列饮用水宏基因组的组装和分类策略。
Microbiol Spectr. 2021 Dec 22;9(3):e0143421. doi: 10.1128/Spectrum.01434-21. Epub 2021 Nov 3.
9
Benchmarking Community-Wide Estimates of Growth Potential from Metagenomes Using Codon Usage Statistics.基于密码子使用统计数据对宏基因组进行社区级生长潜力估计的基准测试。
mSystems. 2022 Oct 26;7(5):e0074522. doi: 10.1128/msystems.00745-22. Epub 2022 Oct 3.
10
Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes.分子长读长测序有助于复杂土壤宏基因组的组装和基因组分箱。
mSystems. 2016 Jun 28;1(3). doi: 10.1128/mSystems.00045-16. eCollection 2016 May-Jun.

本文引用的文献

1
Prospective correlation between the patient microbiome with response to and development of immune-mediated adverse effects to immunotherapy in lung cancer.肺癌免疫治疗中患者微生物组与免疫介导不良反应的发生和发展的前瞻性相关性。
BMC Cancer. 2021 Jul 13;21(1):808. doi: 10.1186/s12885-021-08530-z.
2
DRAGoM: Classification and Quantification of Noncoding RNA in Metagenomic Data.DRAGoM:宏基因组数据中非编码RNA的分类与定量分析
Front Genet. 2021 May 5;12:669495. doi: 10.3389/fgene.2021.669495. eCollection 2021.
3
Relating Gut Microbiome and Its Modulating Factors to Immunotherapy in Solid Tumors: A Systematic Review.
实体瘤中肠道微生物群及其调节因子与免疫治疗的关系:一项系统综述
Front Oncol. 2021 Mar 18;11:642110. doi: 10.3389/fonc.2021.642110. eCollection 2021.
4
Engineering bacteria for cancer therapy.用于癌症治疗的工程菌。
Emerg Top Life Sci. 2019 Nov 11;3(5):623-629. doi: 10.1042/ETLS20190096.
5
HASLR: Fast Hybrid Assembly of Long Reads.HASLR:长读段的快速混合组装
iScience. 2020 Aug 21;23(8):101389. doi: 10.1016/j.isci.2020.101389. Epub 2020 Jul 25.
6
Recent Advances in Metabolic Pathways of Sulfate Reduction in Intestinal Bacteria.肠道细菌硫酸盐还原代谢途径的最新研究进展。
Cells. 2020 Mar 12;9(3):698. doi: 10.3390/cells9030698.
7
Biosynthesis of the Stress-Protectant and Chemical Chaperon Ectoine: Biochemistry of the Transaminase EctB.应激保护剂和化学伴侣埃克托因的生物合成:转氨酶EctB的生物化学
Front Microbiol. 2019 Dec 10;10:2811. doi: 10.3389/fmicb.2019.02811. eCollection 2019.
8
Microbial communities in the tropical air ecosystem follow a precise diel cycle.热带空气生态系统中的微生物群落遵循精确的昼夜周期。
Proc Natl Acad Sci U S A. 2019 Nov 12;116(46):23299-23308. doi: 10.1073/pnas.1908493116. Epub 2019 Oct 28.
9
Contribution of gut microbiota to metabolism of dietary glycine betaine in mice and in vitro colonic fermentation.肠道微生物群对小鼠和体外结肠发酵中膳食甜菜碱甘氨酸代谢的贡献。
Microbiome. 2019 Jul 10;7(1):103. doi: 10.1186/s40168-019-0718-2.
10
Protein-level assembly increases protein sequence recovery from metagenomic samples manyfold.蛋白质水平的组装使宏基因组样本中蛋白质序列的回收率提高了许多倍。
Nat Methods. 2019 Jul;16(7):603-606. doi: 10.1038/s41592-019-0437-4. Epub 2019 Jun 24.