Filis Georgios, Bezantakou Dimitra, Rigkos Konstantinos, Noti Despina, Saridis Pavlos, Zarafeta Dimitra, Skretas Georgios
Institute for Bioinnovation, Biomedical Sciences Research Center "Alexander Fleming", Vari, 16672, Greece.
Institute of Chemical Biology, National Hellenic Research Foundation, Athens, 11635, Greece.
Adv Sci (Weinh). 2025 May;12(19):e2414877. doi: 10.1002/advs.202414877. Epub 2025 Mar 25.
The vast majority of microbial diversity remains unculturable, limiting access to novel biotechnological resources. Advances in metagenomics have expanded the understanding of microbial communities, yet targeted protein discovery remains challenging. This study introduces ProteoSeeker, a command-line tool for streamlined metagenomic protein identification and annotation. ProteoSeeker operates in two primary modes: i) Seek mode, which screens the proteins according to user-defined protein families, and ii) Taxonomy mode, which uncovers the taxonomy of the host organisms. By automating key steps, ProteoSeeker reduces computational complexity, enabling time-efficient and comprehensive metagenomic analysis for both specialized and nonspecialized users. The efficiency of ProteoSeeker to achieve targeted enzyme discovery is demonstrated by identifying extremophilic enzymes with desired biochemical features, such as amylases for starch hydrolysis and carbonic anhydrases for CO₂ capture applications. By democratizing functional metagenomics, ProteoSeeker is anticipated to accelerate biotechnology, synthetic biology, and biomedical research and innovation.
绝大多数微生物多样性仍无法培养,这限制了对新型生物技术资源的获取。宏基因组学的进展扩展了对微生物群落的理解,但靶向蛋白质发现仍然具有挑战性。本研究介绍了ProteoSeeker,这是一种用于简化宏基因组蛋白质鉴定和注释的命令行工具。ProteoSeeker以两种主要模式运行:i)搜索模式,根据用户定义的蛋白质家族筛选蛋白质;ii)分类模式,揭示宿主生物体的分类。通过自动化关键步骤,ProteoSeeker降低了计算复杂性,使专业和非专业用户都能进行高效且全面的宏基因组分析。通过鉴定具有所需生化特性的嗜极酶,如用于淀粉水解的淀粉酶和用于二氧化碳捕获应用的碳酸酐酶,证明了ProteoSeeker实现靶向酶发现的效率。通过使功能宏基因组学民主化,预计ProteoSeeker将加速生物技术、合成生物学以及生物医学研究与创新。