• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

优化大规模微生物群落样本的生物群落标记:利用神经网络和迁移学习

Refining biome labeling for large-scale microbial community samples: Leveraging neural networks and transfer learning.

作者信息

Wang Nan, Wang Teng, Ning Kang

机构信息

Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center of AI Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, Hubei, China.

出版信息

Environ Sci Ecotechnol. 2023 Jul 26;17:100304. doi: 10.1016/j.ese.2023.100304. eCollection 2024 Jan.

DOI:10.1016/j.ese.2023.100304
PMID:37635952
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10457426/
Abstract

Microbiome research has generated an extensive amount of data, resulting in a wealth of publicly accessible samples. Accurate annotation of these samples is crucial for effectively utilizing microbiome data across scientific disciplines. However, a notable challenge arises from the lack of essential annotations, particularly regarding collection location and sample biome information, which significantly hinders environmental microbiome research. In this study, we introduce Meta-Sorter, a novel approach utilizing neural networks and transfer learning, to enhance biome labeling for thousands of microbiome samples in the MGnify database that have incomplete information. Our findings demonstrate that Meta-Sorter achieved a remarkable accuracy rate of 96.7% in classifying samples among the 16,507 lacking detailed biome annotations. Notably, Meta-Sorter provides precise classifications for representative environmental samples that were previously ambiguously labeled as "Marine" in MGnify, thereby elucidating their specific origins in benthic and water column environments. Moreover, Meta-Sorter effectively distinguishes samples derived from human-environment interactions, enabling clear differentiation between environmental and human-related studies. By improving the completeness of biome label information for numerous microbial community samples, our research facilitates more accurate knowledge discovery across diverse disciplines, with particular implications for environmental research.

摘要

微生物组研究已产生了大量数据,形成了丰富的可公开获取的样本。准确注释这些样本对于跨学科有效利用微生物组数据至关重要。然而,一个显著的挑战源于缺乏必要的注释,特别是关于采集地点和样本生物群落信息,这严重阻碍了环境微生物组研究。在本研究中,我们引入了Meta-Sorter,这是一种利用神经网络和迁移学习的新方法,用于增强MGnify数据库中数千个信息不完整的微生物组样本的生物群落标签。我们的研究结果表明,Meta-Sorter在对16507个缺乏详细生物群落注释的样本进行分类时,准确率达到了96.7%。值得注意的是,Meta-Sorter为MGnify中先前被模糊标记为“海洋”的代表性环境样本提供了精确分类,从而阐明了它们在底栖和水柱环境中的具体来源。此外,Meta-Sorter有效地区分了源自人类与环境相互作用的样本,能够清晰地区分环境研究和与人类相关的研究。通过提高众多微生物群落样本的生物群落标签信息的完整性,我们的研究促进了跨学科更准确的知识发现,对环境研究具有特别重要的意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/1e4a0b9205d1/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/bfcc01cb544c/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/c7808127c606/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/663519d0e013/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/5db26b3e7bde/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/d5ab4012914f/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/c91f0a4ce7f2/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/1f3bb591b765/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/ec2ce1a01084/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/1e4a0b9205d1/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/bfcc01cb544c/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/c7808127c606/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/663519d0e013/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/5db26b3e7bde/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/d5ab4012914f/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/c91f0a4ce7f2/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/1f3bb591b765/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/ec2ce1a01084/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c7/10457426/1e4a0b9205d1/gr8.jpg

相似文献

1
Refining biome labeling for large-scale microbial community samples: Leveraging neural networks and transfer learning.优化大规模微生物群落样本的生物群落标记:利用神经网络和迁移学习
Environ Sci Ecotechnol. 2023 Jul 26;17:100304. doi: 10.1016/j.ese.2023.100304. eCollection 2024 Jan.
2
Big Data for a Small World: A Review on Databases and Resources for Studying Microbiomes.小世界中的大数据:微生物群落研究的数据库与资源综述
J Indian Inst Sci. 2023 Apr 5:1-17. doi: 10.1007/s41745-023-00370-z.
3
MGnify: the microbiome sequence data analysis resource in 2023.MGnify:2023 年的微生物组序列数据分析资源。
Nucleic Acids Res. 2023 Jan 6;51(D1):D753-D759. doi: 10.1093/nar/gkac1080.
4
Meta-Prism 2.0: Enabling algorithm and web server for ultra-fast, memory-efficient, and accurate analysis among millions of microbial community samples.Meta-Prism 2.0:用于超快速、内存高效且准确分析数百万个微生物群落样本的算法和网络服务器。
Gigascience. 2022 Jul 28;11. doi: 10.1093/gigascience/giac073.
5
EXPERT: transfer learning-enabled context-aware microbial community classification.专家:基于迁移学习的上下文感知微生物群落分类。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac396.
6
Machine Learning Meta-analysis of Large Metagenomic Datasets: Tools and Biological Insights.大型宏基因组数据集的机器学习荟萃分析:工具与生物学见解
PLoS Comput Biol. 2016 Jul 11;12(7):e1004977. doi: 10.1371/journal.pcbi.1004977. eCollection 2016 Jul.
7
MarineMetagenomeDB: a public repository for curated and standardized metadata for marine metagenomes.海洋宏基因组数据库:一个用于整理和标准化海洋宏基因组元数据的公共存储库。
Environ Microbiome. 2022 Nov 18;17(1):57. doi: 10.1186/s40793-022-00449-7.
8
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
9
BIOME-Preserve: A novel storage and transport medium for preserving anaerobic microbiota samples for culture recovery.BIOME-Preserve:一种用于保存厌氧微生物群样本以进行培养恢复的新型储存和运输介质。
PLoS One. 2022 Jan 21;17(1):e0261820. doi: 10.1371/journal.pone.0261820. eCollection 2022.
10
A geographically-diverse collection of 418 human gut microbiome pathway genome databases.一个具有地域多样性的 418 个人类肠道微生物组途径基因组数据库集合。
Sci Data. 2017 Apr 11;4:170035. doi: 10.1038/sdata.2017.35.

引用本文的文献

1
Techniques for learning and transferring knowledge for microbiome-based classification and prediction: review and assessment.基于微生物组的分类和预测的知识学习与转移技术:综述与评估
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf015.

本文引用的文献

1
Database resources of the National Center for Biotechnology Information.国家生物技术信息中心数据库资源。
Nucleic Acids Res. 2021 Jan 8;49(D1):D10-D17. doi: 10.1093/nar/gkaa892.
2
Ultrafast and accurate 16S rRNA microbial community analysis using Kraken 2.使用 Kraken 2 进行快速准确的 16S rRNA 微生物群落分析。
Microbiome. 2020 Aug 28;8(1):124. doi: 10.1186/s40168-020-00900-2.
3
Distribution of antibiotic resistance genes in an agriculturally disturbed lake in China: Their links with microbial communities, antibiotics, and water quality.
抗生素抗性基因在中国受农业干扰的湖泊中的分布:它们与微生物群落、抗生素和水质的关系。
J Hazard Mater. 2020 Jul 5;393:122426. doi: 10.1016/j.jhazmat.2020.122426. Epub 2020 Feb 28.
4
MGnify: the microbiome analysis resource in 2020.MGnify:2020 年的微生物组分析资源。
Nucleic Acids Res. 2020 Jan 8;48(D1):D570-D578. doi: 10.1093/nar/gkz1035.
5
Home chemical and microbial transitions across urbanization.家庭化学物质和微生物在城市化过程中的转变。
Nat Microbiol. 2020 Jan;5(1):108-115. doi: 10.1038/s41564-019-0593-4. Epub 2019 Nov 4.
6
GMrepo: a database of curated and consistently annotated human gut metagenomes.GMrepo:一个经过精心整理和一致注释的人类肠道宏基因组数据库。
Nucleic Acids Res. 2020 Jan 8;48(D1):D545-D553. doi: 10.1093/nar/gkz764.
7
Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2.使用QIIME 2进行可重复、交互式、可扩展和可延伸的微生物组数据科学研究。
Nat Biotechnol. 2019 Aug;37(8):852-857. doi: 10.1038/s41587-019-0209-9.
8
Qiita: rapid, web-enabled microbiome meta-analysis.Qiita:快速、支持网络的微生物组元分析。
Nat Methods. 2018 Oct;15(10):796-798. doi: 10.1038/s41592-018-0141-9. Epub 2018 Oct 1.
9
The influence of ethnicity and geography on human gut microbiome composition.种族和地理位置对人类肠道微生物群组成的影响。
Nat Med. 2018 Oct;24(10):1495-1496. doi: 10.1038/s41591-018-0210-8.
10
Structure and function of the global topsoil microbiome.全球表土微生物组的结构与功能。
Nature. 2018 Aug;560(7717):233-237. doi: 10.1038/s41586-018-0386-6. Epub 2018 Aug 1.