• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

矩阵与分析元数据标准(MAMS)以促进单细胞数据的协调统一和可重复性。

Matrix and analysis metadata standards (MAMS) to facilitate harmonization and reproducibility of single-cell data.

作者信息

Wang Yichen, Sarfraz Irzam, Teh Wei Kheng, Sokolov Artem, Herb Brian R, Creasy Heather H, Virshup Isaac, Dries Ruben, Degatano Kylee, Mahurkar Anup, Schnell Daniel J, Madrigal Pedro, Hilton Jason, Gehlenborg Nils, Tickle Timothy, Campbell Joshua D

机构信息

Department of Medicine, Boston University School of Medicine, Boston, MA, USA.

European Bioinformatics Institute, European Molecular Biology Laboratory, Hinxton, Cambridgeshire, UK.

出版信息

bioRxiv. 2023 Mar 7:2023.03.06.531314. doi: 10.1101/2023.03.06.531314.

DOI:10.1101/2023.03.06.531314
PMID:36945543
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10028847/
Abstract

A large number of genomic and imaging datasets are being produced by consortia that seek to characterize healthy and disease tissues at single-cell resolution. While much effort has been devoted to capturing information related to biospecimen information and experimental procedures, the metadata standards that describe data matrices and the analysis workflows that produced them are relatively lacking. Detailed metadata schema related to data analysis are needed to facilitate sharing and interoperability across groups and to promote data provenance for reproducibility. To address this need, we developed the Matrix and Analysis Metadata Standards (MAMS) to serve as a resource for data coordinating centers and tool developers. We first curated several simple and complex "use cases" to characterize the types of feature-observation matrices (FOMs), annotations, and analysis metadata produced in different workflows. Based on these use cases, metadata fields were defined to describe the data contained within each matrix including those related to processing, modality, and subsets. Suggested terms were created for the majority of fields to aid in harmonization of metadata terms across groups. Additional provenance metadata fields were also defined to describe the software and workflows that produced each FOM. Finally, we developed a simple list-like schema that can be used to store MAMS information and implemented in multiple formats. Overall, MAMS can be used as a guide to harmonize analysis-related metadata which will ultimately facilitate integration of datasets across tools and consortia. MAMS specifications, use cases, and examples can be found at https://github.com/single-cell-mams/mams/.

摘要

许多基因组和成像数据集正由致力于以单细胞分辨率表征健康和疾病组织的联盟生成。尽管已经投入了大量精力来获取与生物样本信息和实验程序相关的信息,但描述数据矩阵的元数据标准以及生成这些矩阵的分析工作流程相对缺乏。需要详细的与数据分析相关的元数据模式,以促进跨组共享和互操作性,并促进数据溯源以实现可重复性。为满足这一需求,我们开发了矩阵和分析元数据标准(MAMS),作为数据协调中心和工具开发者的资源。我们首先策划了几个简单和复杂的“用例”,以表征不同工作流程中产生的特征-观测矩阵(FOM)、注释和分析元数据的类型。基于这些用例,定义了元数据字段来描述每个矩阵中包含的数据,包括与处理、模态和子集相关的数据。为大多数字段创建了建议术语,以帮助跨组统一元数据术语。还定义了额外的溯源元数据字段,以描述生成每个FOM的软件和工作流程。最后,我们开发了一个简单的列表式模式,可用于存储MAMS信息并以多种格式实现。总体而言,MAMS可作为统一与分析相关的元数据的指南,这最终将促进跨工具和联盟的数据集整合。MAMS规范、用例和示例可在https://github.com/single-cell-mams/mams/上找到。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eb0/10028847/24eacc2bd797/nihpp-2023.03.06.531314v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eb0/10028847/e358e384c964/nihpp-2023.03.06.531314v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eb0/10028847/8f64ce040215/nihpp-2023.03.06.531314v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eb0/10028847/24eacc2bd797/nihpp-2023.03.06.531314v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eb0/10028847/e358e384c964/nihpp-2023.03.06.531314v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eb0/10028847/8f64ce040215/nihpp-2023.03.06.531314v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eb0/10028847/24eacc2bd797/nihpp-2023.03.06.531314v1-f0003.jpg

相似文献

1
Matrix and analysis metadata standards (MAMS) to facilitate harmonization and reproducibility of single-cell data.矩阵与分析元数据标准(MAMS)以促进单细胞数据的协调统一和可重复性。
bioRxiv. 2023 Mar 7:2023.03.06.531314. doi: 10.1101/2023.03.06.531314.
2
MAMS: matrix and analysis metadata standards to facilitate harmonization and reproducibility of single-cell data.MAMS:矩阵和分析元数据标准,以促进单细胞数据的协调和可重复性。
Genome Biol. 2024 Aug 1;25(1):205. doi: 10.1186/s13059-024-03349-w.
3
Management of Metadata Types in Basic Cardiological Research.基础心脏病学研究中元数据类型的管理。
Stud Health Technol Inform. 2021 Sep 21;283:59-68. doi: 10.3233/SHTI210542.
4
linkedISA: semantic representation of ISA-Tab experimental metadata.linkedISA:ISA-Tab 实验元数据的语义表示。
BMC Bioinformatics. 2014;15 Suppl 14(Suppl 14):S4. doi: 10.1186/1471-2105-15-S14-S4. Epub 2014 Nov 27.
5
OMeta: an ontology-based, data-driven metadata tracking system.OMeta:一个基于本体论的数据驱动的元数据跟踪系统。
BMC Bioinformatics. 2019 Jan 7;20(1):8. doi: 10.1186/s12859-018-2580-9.
6
Adamant: a JSON schema-based metadata editor for research data management workflows.坚韧不拔:一个基于 JSON 模式的元数据编辑器,用于研究数据管理工作流程。
F1000Res. 2022 Apr 29;11:475. doi: 10.12688/f1000research.110875.2. eCollection 2022.
7
ProvCaRe: Characterizing scientific reproducibility of biomedical research studies using semantic provenance metadata.ProvCaRe:使用语义来源元数据刻画生物医学研究的科学可重复性。
Int J Med Inform. 2019 Jan;121:10-18. doi: 10.1016/j.ijmedinf.2018.10.009. Epub 2018 Nov 3.
8
The Risa R/Bioconductor package: integrative data analysis from experimental metadata and back again.Risa R/Bioconductor 包:从实验元数据到实验结果的综合数据分析。
BMC Bioinformatics. 2014;15 Suppl 1(Suppl 1):S11. doi: 10.1186/1471-2105-15-S1-S11. Epub 2014 Jan 10.
9
ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository.ODMedit:基于公共元数据存储库的医学数据集成的统一语义标注。
BMC Med Res Methodol. 2016 Jun 1;16:65. doi: 10.1186/s12874-016-0164-9.
10
Toward a Sample Metadata Standard in Public Proteomics Repositories.迈向公共蛋白质组学数据库中的样本元数据标准。
J Proteome Res. 2020 Oct 2;19(10):3906-3909. doi: 10.1021/acs.jproteome.0c00376. Epub 2020 Sep 22.