• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用 AMDirT 促进古代宏基因组数据的可访问、快速和适当处理。

Facilitating accessible, rapid, and appropriate processing of ancient metagenomic data with AMDirT.

机构信息

Cluster of Excellence "Balance of the Microverse", Leibniz Institute for Natural Product Research and Infection Biology Hans Knöll Institute, Adolf-Reichwein-Straße 23, Jena, Thuringia, 07745, Germany.

Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Deutscher Pl. 6, Leipzig, Saxony, 04103, Germany.

出版信息

F1000Res. 2024 May 28;12:926. doi: 10.12688/f1000research.134798.2. eCollection 2023.

DOI:10.12688/f1000research.134798.2
PMID:39262445
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11387932/
Abstract

BACKGROUND

Access to sample-level metadata is important when selecting public metagenomic sequencing datasets for reuse in new biological analyses. The Standards, Precautions, and Advances in Ancient Metagenomics community (SPAAM, https://spaam-community.org) has previously published AncientMetagenomeDir, a collection of curated and standardised sample metadata tables for metagenomic and microbial genome datasets generated from ancient samples. However, while sample-level information is useful for identifying relevant samples for inclusion in new projects, Next Generation Sequencing (NGS) library construction and sequencing metadata are also essential for appropriately reprocessing ancient metagenomic data. Currently, recovering information for downloading and preparing such data is difficult when laboratory and bioinformatic metadata is heterogeneously recorded in prose-based publications.

METHODS

Through a series of community-based hackathon events, AncientMetagenomeDir was updated to provide standardised library-level metadata of existing and new ancient metagenomic samples. In tandem, the companion tool 'AMDirT' was developed to facilitate rapid data filtering and downloading of ancient metagenomic data, as well as improving automated metadata curation and validation for AncientMetagenomeDir.

RESULTS

AncientMetagenomeDir was extended to include standardised metadata of over 6000 ancient metagenomic libraries. The companion tool 'AMDirT' provides both graphical- and command-line interface based access to such metadata for users from a wide range of computational backgrounds. We also report on errors with metadata reporting that appear to commonly occur during data upload and provide suggestions on how to improve the quality of data sharing by the community.

CONCLUSIONS

Together, both standardised metadata reporting and tooling will help towards easier incorporation and reuse of public ancient metagenomic datasets into future analyses.

摘要

背景

在选择公共宏基因组测序数据集以供新的生物分析重复使用时,访问样本级别的元数据非常重要。标准、预防措施和古代宏基因组学社区(SPAAM,https://spaam-community.org)之前发布了 AncientMetagenomeDir,这是一个经过精心整理和标准化的样本元数据表格的集合,用于从古代样本中生成的宏基因组和微生物基因组数据集。然而,虽然样本级别的信息对于识别相关样本以纳入新项目非常有用,但下一代测序 (NGS) 文库构建和测序元数据对于适当处理古代宏基因组数据也至关重要。目前,当实验室和生物信息学元数据以基于散文的出版物中的异质方式记录时,恢复用于下载和准备此类数据的信息非常困难。

方法

通过一系列基于社区的黑客马拉松活动,AncientMetagenomeDir 进行了更新,以提供现有和新的古代宏基因组样本的标准化库级元数据。与此同时,开发了配套工具 'AMDirT',以方便快速筛选和下载古代宏基因组数据,以及改进 AncientMetagenomeDir 的自动化元数据整理和验证。

结果

AncientMetagenomeDir 扩展到包含超过 6000 个古代宏基因组文库的标准化元数据。配套工具 'AMDirT' 为来自广泛计算背景的用户提供了对这些元数据的图形和命令行界面访问。我们还报告了在数据上传过程中似乎经常出现的元数据报告错误,并提供了有关如何提高社区数据共享质量的建议。

结论

标准化元数据报告和工具的结合将有助于更轻松地将公共古代宏基因组数据集纳入未来的分析中进行重复使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7337/11387982/79bcc0def238/f1000research-12-164792-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7337/11387982/3c81ad4b6a3d/f1000research-12-164792-g0000.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7337/11387982/5bcbd486e95b/f1000research-12-164792-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7337/11387982/79bcc0def238/f1000research-12-164792-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7337/11387982/3c81ad4b6a3d/f1000research-12-164792-g0000.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7337/11387982/5bcbd486e95b/f1000research-12-164792-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7337/11387982/79bcc0def238/f1000research-12-164792-g0002.jpg

相似文献

1
Facilitating accessible, rapid, and appropriate processing of ancient metagenomic data with AMDirT.使用 AMDirT 促进古代宏基因组数据的可访问、快速和适当处理。
F1000Res. 2024 May 28;12:926. doi: 10.12688/f1000research.134798.2. eCollection 2023.
2
Community-curated and standardised metadata of published ancient metagenomic samples with AncientMetagenomeDir.利用 AncientMetagenomeDir 对已发表的古代宏基因组样本进行社区策划和标准化元数据处理。
Sci Data. 2021 Jan 26;8(1):31. doi: 10.1038/s41597-021-00816-y.
3
grabseqs: simple downloading of reads and metadata from multiple next-generation sequencing data repositories.grabseqs:从多个下一代测序数据存储库中简单地下载读取和元数据。
Bioinformatics. 2020 Jun 1;36(11):3607-3609. doi: 10.1093/bioinformatics/btaa167.
4
AgroSeek: a system for computational analysis of environmental metagenomic data and associated metadata.AgroSeek:一个用于环境宏基因组数据及其相关元数据的计算分析系统。
BMC Bioinformatics. 2021 Mar 10;22(1):117. doi: 10.1186/s12859-021-04035-5.
5
UltraSEQ, a Universal Bioinformatic Platform for Information-Based Clinical Metagenomics and Beyond.UltraSEQ,一个基于信息的临床宏基因组学及其他领域的通用生物信息学平台。
Microbiol Spectr. 2023 Jun 15;11(3):e0416022. doi: 10.1128/spectrum.04160-22. Epub 2023 Apr 11.
6
Library Preparation and Sequencing Platform Introduce Bias in Metagenomic-Based Characterizations of Microbiomes.文库制备和测序平台会导致基于宏基因组学的微生物组特征分析产生偏差。
Microbiol Spectr. 2022 Apr 27;10(2):e0009022. doi: 10.1128/spectrum.00090-22. Epub 2022 Mar 15.
7
HAYSTAC: A Bayesian framework for robust and rapid species identification in high-throughput sequencing data.HAYSTAC:一种用于高通量测序数据中稳健快速物种鉴定的贝叶斯框架。
PLoS Comput Biol. 2022 Sep 30;18(9):e1010493. doi: 10.1371/journal.pcbi.1010493. eCollection 2022 Sep.
8
Towards predicting the geographical origin of ancient samples with metagenomic data.利用宏基因组数据预测古代样本的地理起源。
Sci Rep. 2024 Sep 18;14(1):21794. doi: 10.1038/s41598-023-40246-x.
9
The African Human Microbiome Portal: a public web portal of curated metagenomic metadata.非洲人类微生物组门户:一个经过策展的宏基因组元数据公共网络门户。
Database (Oxford). 2024 Jan 10;2024. doi: 10.1093/database/baad092.
10
A database of animal metagenomes.动物宏基因组数据库。
Sci Data. 2022 Jun 16;9(1):312. doi: 10.1038/s41597-022-01444-w.

引用本文的文献

1
A standards perspective on genomic data reusability and reproducibility.关于基因组数据可重用性和可重复性的标准视角。
Front Bioinform. 2025 Mar 10;5:1572937. doi: 10.3389/fbinf.2025.1572937. eCollection 2025.

本文引用的文献

1
aMeta: an accurate and memory-efficient ancient metagenomic profiling workflow.aMeta:一种准确且内存高效的古代宏基因组分析工作流程。
Genome Biol. 2023 Oct 23;24(1):242. doi: 10.1186/s13059-023-03083-9.
2
nf-core/mag: a best-practice pipeline for metagenome hybrid assembly and binning.nf-core/mag:宏基因组混合组装与分箱的最佳实践流程。
NAR Genom Bioinform. 2022 Feb 2;4(1):lqac007. doi: 10.1093/nargab/lqac007. eCollection 2022 Mar.
3
Reproducible, portable, and efficient ancient genome reconstruction with nf-core/eager.利用nf-core/eager进行可重复、便携且高效的古代基因组重建。
PeerJ. 2021 Mar 16;9:e10947. doi: 10.7717/peerj.10947. eCollection 2021.
4
Community-curated and standardised metadata of published ancient metagenomic samples with AncientMetagenomeDir.利用 AncientMetagenomeDir 对已发表的古代宏基因组样本进行社区策划和标准化元数据处理。
Sci Data. 2021 Jan 26;8(1):31. doi: 10.1038/s41597-021-00816-y.
5
HumanMetagenomeDB: a public repository of curated and standardized metadata for human metagenomes.人源宏基因组数据库:一个经过精心整理和标准化的人源宏基因组元数据公共存储库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D743-D750. doi: 10.1093/nar/gkaa1031.
6
The European Nucleotide Archive in 2020.2020 年的欧洲核苷酸档案库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D82-D85. doi: 10.1093/nar/gkaa1028.
7
gcMeta: a Global Catalogue of Metagenomics platform to support the archiving, standardization and analysis of microbiome data.gcMeta:一个元基因组学平台的全球目录,支持微生物组数据的归档、标准化和分析。
Nucleic Acids Res. 2019 Jan 8;47(D1):D637-D648. doi: 10.1093/nar/gky1008.
8
Genomes OnLine database (GOLD) v.7: updates and new features.基因组在线数据库(GOLD)v.7:更新和新功能。
Nucleic Acids Res. 2019 Jan 8;47(D1):D649-D659. doi: 10.1093/nar/gky977.
9
Bioconda: sustainable and comprehensive software distribution for the life sciences.生物conda:面向生命科学的可持续且全面的软件发行平台。
Nat Methods. 2018 Jul;15(7):475-476. doi: 10.1038/s41592-018-0046-7.
10
Accessible, curated metagenomic data through ExperimentHub.通过ExperimentHub获取经过整理的可访问宏基因组数据。
Nat Methods. 2017 Oct 31;14(11):1023-1024. doi: 10.1038/nmeth.4468.