• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于比较基因组学的微生物表型预测

Prediction of microbial phenotypes based on comparative genomics.

作者信息

Feldbauer Roman, Schulz Frederik, Horn Matthias, Rattei Thomas

出版信息

BMC Bioinformatics. 2015;16 Suppl 14(Suppl 14):S1. doi: 10.1186/1471-2105-16-S14-S1. Epub 2015 Oct 2.

DOI:10.1186/1471-2105-16-S14-S1
PMID:26451672
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4603748/
Abstract

The accessibility of almost complete genome sequences of uncultivable microbial species from metagenomes necessitates computational methods predicting microbial phenotypes solely based on genomic data. Here we investigate how comparative genomics can be utilized for the prediction of microbial phenotypes. The PICA framework facilitates application and comparison of different machine learning techniques for phenotypic trait prediction. We have improved and extended PICA's support vector machine plug-in and suggest its applicability to large-scale genome databases and incomplete genome sequences. We have demonstrated the stability of the predictive power for phenotypic traits, not perturbed by the rapid growth of genome databases. A new software tool facilitates the in-depth analysis of phenotype models, which associate expected and unexpected protein functions with particular traits. Most of the traits can be reliably predicted in only 60-70% complete genomes. We have established a new phenotypic model that predicts intracellular microorganisms. Thereby we could demonstrate that also independently evolved phenotypic traits, characterized by genome reduction, can be reliably predicted based on comparative genomics. Our results suggest that the extended PICA framework can be used to automatically annotate phenotypes in near-complete microbial genome sequences, as generated in large numbers in current metagenomics studies.

摘要

来自宏基因组的不可培养微生物物种几乎完整的基因组序列的可获取性,使得仅基于基因组数据预测微生物表型的计算方法成为必要。在此,我们研究如何利用比较基因组学来预测微生物表型。PICA框架有助于应用和比较用于表型特征预测的不同机器学习技术。我们改进并扩展了PICA的支持向量机插件,并表明其适用于大规模基因组数据库和不完整基因组序列。我们已经证明了表型特征预测能力的稳定性,不受基因组数据库快速增长的干扰。一个新的软件工具便于对表型模型进行深入分析,该模型将预期和意外的蛋白质功能与特定特征联系起来。大多数特征在基因组仅60 - 70%完整时就能可靠预测。我们建立了一个预测细胞内微生物的新表型模型。由此我们可以证明,基于比较基因组学,以基因组缩减为特征的独立进化的表型特征也能可靠预测。我们的结果表明,扩展的PICA框架可用于自动注释当前宏基因组学研究中大量生成的近乎完整的微生物基因组序列中的表型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58eb/4603748/925efe69963d/1471-2105-16-S14-S1-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58eb/4603748/d8ea416ef84b/1471-2105-16-S14-S1-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58eb/4603748/72113dfdcf3e/1471-2105-16-S14-S1-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58eb/4603748/33706ee72267/1471-2105-16-S14-S1-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58eb/4603748/4d960ac66379/1471-2105-16-S14-S1-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58eb/4603748/06026847d683/1471-2105-16-S14-S1-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58eb/4603748/925efe69963d/1471-2105-16-S14-S1-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58eb/4603748/d8ea416ef84b/1471-2105-16-S14-S1-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58eb/4603748/72113dfdcf3e/1471-2105-16-S14-S1-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58eb/4603748/33706ee72267/1471-2105-16-S14-S1-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58eb/4603748/4d960ac66379/1471-2105-16-S14-S1-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58eb/4603748/06026847d683/1471-2105-16-S14-S1-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58eb/4603748/925efe69963d/1471-2105-16-S14-S1-6.jpg

相似文献

1
Prediction of microbial phenotypes based on comparative genomics.基于比较基因组学的微生物表型预测
BMC Bioinformatics. 2015;16 Suppl 14(Suppl 14):S1. doi: 10.1186/1471-2105-16-S14-S1. Epub 2015 Oct 2.
2
Mapping genomic features to functional traits through microbial whole genome sequences.通过微生物全基因组序列将基因组特征映射到功能性状。
Int J Bioinform Res Appl. 2014;10(4-5):461-78. doi: 10.1504/IJBRA.2014.062995.
3
ICoVeR - an interactive visualization tool for verification and refinement of metagenomic bins.ICoVeR - 一种用于宏基因组分箱验证与优化的交互式可视化工具。
BMC Bioinformatics. 2017 May 2;18(1):233. doi: 10.1186/s12859-017-1653-5.
4
Optimizing and evaluating the reconstruction of Metagenome-assembled microbial genomes.优化和评估宏基因组组装微生物基因组的重建。
BMC Genomics. 2017 Nov 28;18(1):915. doi: 10.1186/s12864-017-4294-1.
5
Recovering complete and draft population genomes from metagenome datasets.从宏基因组数据集中恢复完整和草图的种群基因组。
Microbiome. 2016 Mar 8;4:8. doi: 10.1186/s40168-016-0154-5.
6
Ten years of maintaining and expanding a microbial genome and metagenome analysis system.十年的微生物基因组和宏基因组分析系统的维护和扩展。
Trends Microbiol. 2015 Nov;23(11):730-741. doi: 10.1016/j.tim.2015.07.012. Epub 2015 Oct 14.
7
Large-scale metagenomic sequence clustering on map-reduce clusters.在MapReduce集群上进行大规模宏基因组序列聚类
J Bioinform Comput Biol. 2013 Feb;11(1):1340001. doi: 10.1142/S0219720013400015. Epub 2012 Dec 25.
8
MetCap: a bioinformatics probe design pipeline for large-scale targeted metagenomics.MetCap:一种用于大规模靶向宏基因组学的生物信息学探针设计流程
BMC Bioinformatics. 2015 Feb 28;16(1):65. doi: 10.1186/s12859-015-0501-8.
9
VirFinder: a novel k-mer based tool for identifying viral sequences from assembled metagenomic data.VirFinder:一种新型的基于 k-mer 的工具,用于从组装的宏基因组数据中识别病毒序列。
Microbiome. 2017 Jul 6;5(1):69. doi: 10.1186/s40168-017-0283-5.
10
CAMITAX: Taxon labels for microbial genomes.CAMITAX:微生物基因组的分类标签。
Gigascience. 2020 Jan 1;9(1). doi: 10.1093/gigascience/giz154.

引用本文的文献

1
Predicting bacterial phenotypic traits through improved machine learning using high-quality, curated datasets.通过使用高质量的、经过整理的数据集改进机器学习来预测细菌表型特征。
Commun Biol. 2025 Jun 7;8(1):897. doi: 10.1038/s42003-025-08313-3.
2
MAGs-centric crack: how long will, spore-positive and most , microsymbionts remain recalcitrant to axenic growth?以宏基因组组装基因组为中心的难题:孢子阳性且大多数微共生体对无菌生长保持顽固抗性的情况会持续多久?
Front Microbiol. 2024 Jul 31;15:1367490. doi: 10.3389/fmicb.2024.1367490. eCollection 2024.
3
Gut microbiome diversity within is negatively associated with human obesity.

本文引用的文献

1
Unusual biology across a group comprising more than 15% of domain Bacteria.跨越超过 15%的域细菌群体的不寻常生物学。
Nature. 2015 Jul 9;523(7559):208-11. doi: 10.1038/nature14486. Epub 2015 Jun 15.
2
Sequencing and beyond: integrating molecular 'omics' for microbial community profiling.测序及其他:整合分子“组学”技术进行微生物群落分析
Nat Rev Microbiol. 2015 Jun;13(6):360-72. doi: 10.1038/nrmicro3451. Epub 2015 Apr 27.
3
Intranuclear bacteria: inside the cellular control center of eukaryotes.核内细菌:真核细胞的细胞控制中心内。
肠道微生物多样性与人体肥胖呈负相关。
mSystems. 2024 Aug 20;9(8):e0062724. doi: 10.1128/msystems.00627-24. Epub 2024 Jul 16.
4
Exploring the landscape of symbiotic diversity and distribution in unicellular ciliated protists.探索单细胞纤毛原生动物共生多样性和分布的格局。
Microbiome. 2024 May 24;12(1):96. doi: 10.1186/s40168-024-01809-w.
5
Genome annotation: From human genetics to biodiversity genomics.基因组注释:从人类遗传学到生物多样性基因组学
Cell Genom. 2023 Aug 1;3(8):100375. doi: 10.1016/j.xgen.2023.100375. eCollection 2023 Aug 9.
6
: A Toolset for a Trait-Based Representation of Microbial Genomes.一种用于基于特征的微生物基因组表示的工具集。
Front Bioinform. 2022 Jul 22;2:918853. doi: 10.3389/fbinf.2022.918853. eCollection 2022.
7
MediaDive: the expert-curated cultivation media database.Mediadive:专家策划的栽培培养基数据库。
Nucleic Acids Res. 2023 Jan 6;51(D1):D1531-D1538. doi: 10.1093/nar/gkac803.
8
Inferring microbiota functions from taxonomic genes: a review.从分类基因推断微生物群落功能:综述。
Gigascience. 2022 Jan 12;11(1). doi: 10.1093/gigascience/giab090.
9
BacDive in 2022: the knowledge base for standardized bacterial and archaeal data.BacDive 在 2022 年:标准化细菌和古菌数据知识库。
Nucleic Acids Res. 2022 Jan 7;50(D1):D741-D746. doi: 10.1093/nar/gkab961.
10
Genome-centric metagenomics reveals insights into the evolution and metabolism of a new free-living group in Rhizobiales.基于基因组的宏基因组学揭示了 Rhizobiales 中新的自由生活群体的进化和代谢的新见解。
BMC Microbiol. 2021 Oct 28;21(1):294. doi: 10.1186/s12866-021-02354-4.
Trends Cell Biol. 2015 Jun;25(6):339-46. doi: 10.1016/j.tcb.2015.01.002. Epub 2015 Feb 10.
4
Update on RefSeq microbial genomes resources.RefSeq微生物基因组资源更新
Nucleic Acids Res. 2015 Jan;43(Database issue):D599-605. doi: 10.1093/nar/gku1062. Epub 2014 Dec 15.
5
An ontology for microbial phenotypes.微生物表型的本体论。
BMC Microbiol. 2014 Nov 30;14:294. doi: 10.1186/s12866-014-0294-3.
6
Expanded microbial genome coverage and improved protein family annotation in the COG database.COG数据库中微生物基因组覆盖范围的扩大及蛋白质家族注释的改进。
Nucleic Acids Res. 2015 Jan;43(Database issue):D261-9. doi: 10.1093/nar/gku1223. Epub 2014 Nov 26.
7
The OMA orthology database in 2015: function predictions, better plant support, synteny view and other improvements.2015年的OMA直系同源数据库:功能预测、对植物的更好支持、共线性视图及其他改进
Nucleic Acids Res. 2015 Jan;43(Database issue):D240-9. doi: 10.1093/nar/gku1158. Epub 2014 Nov 15.
8
STRING v10: protein-protein interaction networks, integrated over the tree of life.STRING v10:整合了整个生命之树的蛋白质-蛋白质相互作用网络。
Nucleic Acids Res. 2015 Jan;43(Database issue):D447-52. doi: 10.1093/nar/gku1003. Epub 2014 Oct 28.
9
Carbohydrate metabolism in Archaea: current insights into unusual enzymes and pathways and their regulation.古菌中的碳水化合物代谢:对不常见酶和途径及其调控的最新认识。
Microbiol Mol Biol Rev. 2014 Mar;78(1):89-175. doi: 10.1128/MMBR.00041-13.
10
eggNOG v4.0: nested orthology inference across 3686 organisms.eggNOG v4.0:跨越 3686 个生物体的嵌套同源推断。
Nucleic Acids Res. 2014 Jan;42(Database issue):D231-9. doi: 10.1093/nar/gkt1253. Epub 2013 Dec 1.