Exploring microbial functional biodiversity at the protein family level-From metagenomic sequence reads to annotated protein clusters.

作者信息

Baltoumas Fotis A, Karatzas Evangelos, Paez-Espino David, Venetsianou Nefeli K, Aplakidou Eleni, Oulas Anastasis, Finn Robert D, Ovchinnikov Sergey, Pafilis Evangelos, Kyrpides Nikos C, Pavlopoulos Georgios A

机构信息

Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece.

Lawrence Berkeley National Laboratory, DOE Joint Genome Institute, Berkeley, CA, United States.

出版信息

Front Bioinform. 2023 Mar 3;3:1157956. doi: 10.3389/fbinf.2023.1157956. eCollection 2023.

Abstract

Metagenomics has enabled accessing the genetic repertoire of natural microbial communities. Metagenome shotgun sequencing has become the method of choice for studying and classifying microorganisms from various environments. To this end, several methods have been developed to process and analyze the sequence data from raw reads to end-products such as predicted protein sequences or families. In this article, we provide a thorough review to simplify such processes and discuss the alternative methodologies that can be followed in order to explore biodiversity at the protein family level. We provide details for analysis tools and we comment on their scalability as well as their advantages and disadvantages. Finally, we report the available data repositories and recommend various approaches for protein family annotation related to phylogenetic distribution, structure prediction and metadata enrichment.

摘要
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae34/10029925/66caa4079906/fbinf-03-1157956-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索