• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于社区分析的功能关联预测。

Functional association prediction by community profiling.

机构信息

Indiana University, 150 S. Woodlawn Ave, Bloomington, IN 47405, United States.

Indiana University, 150 S. Woodlawn Ave, Bloomington, IN 47405, United States.

出版信息

Methods. 2017 Oct 1;129:8-17. doi: 10.1016/j.ymeth.2017.04.018. Epub 2017 Apr 26.

DOI:10.1016/j.ymeth.2017.04.018
PMID:28454776
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5643221/
Abstract

Recent years have witnessed unprecedented accumulation of DNA sequences and therefore protein sequences (predicted from DNA sequences), due to the advances of sequencing technology. One of the major sources of the hypothetical proteins is the metagenomics research. Current annotation of metagenomes (collections of short metagenomic sequences or assemblies) relies on similarity searches against known gene/protein families, based on which functional profiles of microbial communities can be built. This practice, however, leaves out the hypothetical proteins, which may outnumber the known proteins for many microbial communities. On the other hand, we may ask: what can we gain from the large number of metagenomes made available by the metagenomic studies, for the annotation of metagenomic sequences as well as functional annotation of hypothetical proteins in general? Here we propose a community profiling approach for predicting functional associations between proteins: two proteins are predicted to be associated if they share similar presence and absence profiles (called community profiles) across microbial communities. Community profiling is conceptually similar to the phylogenetic profiling approach to functional prediction, however with fundamental differences. We tested different profile construction methods, the selection of reference metagenomes, and correlation metrics, among others, to optimize the performance of this new approach. We demonstrated that the community profiling approach alone slightly outperforms the phylogenetic profiling approach for associating proteins in species that are well represented by sequenced genomes, and combining phylogenetic and community profiling further improves (though only marginally) the prediction of functional association. Further we showed that community profiling method significantly outperforms phylogenetic profiling, revealing more functional associations, when applied to a more recently sequenced bacterial genome.

摘要

近年来,由于测序技术的进步,DNA 序列和蛋白质序列(由 DNA 序列预测得到)的积累前所未有。假设蛋白的主要来源之一是宏基因组学研究。目前,宏基因组(短宏基因组序列或组装的集合)的注释依赖于基于相似性搜索已知基因/蛋白质家族的方法,在此基础上可以构建微生物群落的功能谱。然而,这种做法忽略了假设蛋白,对于许多微生物群落,假设蛋白的数量可能超过已知蛋白。另一方面,我们可能会问:从宏基因组研究提供的大量宏基因组中,我们可以为宏基因组序列的注释以及一般假设蛋白的功能注释获得什么?在这里,我们提出了一种用于预测蛋白质之间功能关联的群落分析方法:如果两个蛋白质在微生物群落中具有相似的存在和缺失模式(称为群落模式),则预测它们存在关联。群落分析在概念上类似于功能预测的系统发育分析方法,但存在根本差异。我们测试了不同的模式构建方法、参考宏基因组的选择和相关指标等,以优化这种新方法的性能。我们证明,在具有测序基因组充分代表性的物种中,群落分析方法本身在关联蛋白质方面略优于系统发育分析方法,而将系统发育和群落分析结合起来进一步提高(尽管只是略有提高)功能关联的预测。此外,当应用于最近测序的细菌基因组时,我们发现群落分析方法显著优于系统发育分析方法,揭示了更多的功能关联。

相似文献

1
Functional association prediction by community profiling.基于社区分析的功能关联预测。
Methods. 2017 Oct 1;129:8-17. doi: 10.1016/j.ymeth.2017.04.018. Epub 2017 Apr 26.
2
Comparative analysis of functional metagenomic annotation and the mappability of short reads.功能宏基因组注释与短读长可映射性的比较分析。
PLoS One. 2014 Aug 22;9(8):e105776. doi: 10.1371/journal.pone.0105776. eCollection 2014.
3
A bioinformatics pipeline integrating predictive metagenomics profiling for the analysis of 16S rDNA/rRNA sequencing data originated from foods.一个整合了预测宏基因组分析的生物信息学流程,用于分析源自食品的 16S rDNA/rRNA 测序数据。
Food Microbiol. 2018 Dec;76:279-286. doi: 10.1016/j.fm.2018.05.009. Epub 2018 May 24.
4
Identification and Resolution of Microdiversity through Metagenomic Sequencing of Parallel Consortia.通过平行群落的宏基因组测序鉴定和解决微多样性
Appl Environ Microbiol. 2015 Oct 23;82(1):255-67. doi: 10.1128/AEM.02274-15. Print 2016 Jan 1.
5
An accurate and fast alignment-free method for profiling microbial communities.一种用于分析微生物群落的准确且快速的无比对方法。
J Bioinform Comput Biol. 2017 Jun;15(3):1740001. doi: 10.1142/S0219720017400017. Epub 2017 Mar 7.
6
Beyond classification: gene-family phylogenies from shotgun metagenomic reads enable accurate community analysis.超越分类:来自鸟枪法宏基因组读取的基因家族系统发育树可实现精确的群落分析。
BMC Genomics. 2013 Jun 22;14:419. doi: 10.1186/1471-2164-14-419.
7
A multi-source domain annotation pipeline for quantitative metagenomic and metatranscriptomic functional profiling.用于定量宏基因组和宏转录组功能分析的多源域注释管道。
Microbiome. 2018 Aug 28;6(1):149. doi: 10.1186/s40168-018-0532-2.
8
Comparison of different assembly and annotation tools on analysis of simulated viral metagenomic communities in the gut.比较不同的组装和注释工具在分析肠道中模拟病毒宏基因组群落中的应用。
BMC Genomics. 2014 Jan 18;15:37. doi: 10.1186/1471-2164-15-37.
9
Analysis and comparison of very large metagenomes with fast clustering and functional annotation.快速聚类和功能注释的超大宏基因组分析与比较。
BMC Bioinformatics. 2009 Oct 28;10:359. doi: 10.1186/1471-2105-10-359.
10
MG-RAST, a Metagenomics Service for Analysis of Microbial Community Structure and Function.MG-RAST,一种用于分析微生物群落结构和功能的宏基因组学服务。
Methods Mol Biol. 2016;1399:207-33. doi: 10.1007/978-1-4939-3369-3_13.

引用本文的文献

1
Metabolomics as an Emerging Tool in the Search for Astrobiologically Relevant Biomarkers.代谢组学作为寻找具有天体生物学相关性生物标志物的新兴工具。
Astrobiology. 2020 Oct;20(10):1251-1261. doi: 10.1089/ast.2019.2135. Epub 2020 Jun 17.
2
A repository of microbial marker genes related to human health and diseases for host phenotype prediction using microbiome data.一个与人类健康和疾病相关的微生物标记基因库,用于利用微生物组数据预测宿主表型。
Pac Symp Biocomput. 2019;24:236-247.
3
Machine learning methods and systems for data-driven discovery in biomedical informatics.用于生物医学信息学中数据驱动发现的机器学习方法和系统。
Methods. 2017 Oct 1;129:1-2. doi: 10.1016/j.ymeth.2017.09.011.

本文引用的文献

1
A Graph-Centric Approach for Metagenome-Guided Peptide and Protein Identification in Metaproteomics.一种以图形为中心的宏蛋白质组学中宏基因组引导的肽和蛋白质鉴定方法。
PLoS Comput Biol. 2016 Dec 5;12(12):e1005224. doi: 10.1371/journal.pcbi.1005224. eCollection 2016 Dec.
2
An assessment of US microbiome research.美国微生物组研究评估。
Nat Microbiol. 2016 Jan 11;1:15015. doi: 10.1038/nmicrobiol.2015.15.
3
Taking it Personally: Personalized Utilization of the Human Microbiome in Health and Disease.从个人角度出发:人类微生物组在健康和疾病中的个性化利用。
Cell Host Microbe. 2016 Jan 13;19(1):12-20. doi: 10.1016/j.chom.2015.12.016.
4
Entropy-scaling search of massive biological data.海量生物数据的熵尺度搜索
Cell Syst. 2015 Aug 26;1(2):130-140. doi: 10.1016/j.cels.2015.08.004.
5
Fast and sensitive protein alignment using DIAMOND.使用 DIAMOND 进行快速灵敏的蛋白质比对。
Nat Methods. 2015 Jan;12(1):59-60. doi: 10.1038/nmeth.3176. Epub 2014 Nov 17.
6
Binning metagenomic contigs by coverage and composition.根据覆盖度和组成对宏基因组 contigs 进行 binning。
Nat Methods. 2014 Nov;11(11):1144-6. doi: 10.1038/nmeth.3103. Epub 2014 Sep 14.
7
Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes.在不使用参考基因组的情况下,对复杂宏基因组样本中的基因组和遗传元件进行鉴定和组装。
Nat Biotechnol. 2014 Aug;32(8):822-8. doi: 10.1038/nbt.2939. Epub 2014 Jul 6.
8
An integrated catalog of reference genes in the human gut microbiome.人类肠道微生物组参考基因综合目录。
Nat Biotechnol. 2014 Aug;32(8):834-41. doi: 10.1038/nbt.2942. Epub 2014 Jul 6.
9
The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST).SEED 与利用子系统技术进行快速微生物基因组注释(RAST)。
Nucleic Acids Res. 2014 Jan;42(Database issue):D206-14. doi: 10.1093/nar/gkt1226. Epub 2013 Nov 29.
10
Metaproteomics of cellulose methanisation under thermophilic conditions reveals a surprisingly high proteolytic activity.嗜热条件下纤维素甲烷化的宏蛋白质组学揭示了令人惊讶的高蛋白水解活性。
ISME J. 2014 Jan;8(1):88-102. doi: 10.1038/ismej.2013.120. Epub 2013 Aug 15.