• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

环境与生命科学领域的FAIR数字对象应包含工作流程操作设计数据和方法信息,以实现研究设置的可重复性和结果的可再现性。

FAIR digital objects in environmental and life sciences should comprise workflow operation design data and method information for repeatability of study setups and reproducibility of results.

作者信息

Harjes Janno, Link Anton, Weibulat Tanja, Triebel Dagmar, Rambold Gerhard

机构信息

University of Bayreuth, Universitätsstraße 30, 95440 Bayreuth, Germany.

Staatliche Naturwissenschaftliche Sammlungen Bayerns, Menzinger Straße 67, 80638 München, Germany.

出版信息

Database (Oxford). 2020 Jan 1;2020. doi: 10.1093/database/baaa059.

DOI:10.1093/database/baaa059
PMID:32815545
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7439577/
Abstract

Repeatability of study setups and reproducibility of research results by underlying data are major requirements in science. Until now, abstract models for describing the structural logic of studies in environmental sciences are lacking and tools for data management are insufficient. Mandatory for repeatability and reproducibility is the use of sophisticated data management solutions going beyond data file sharing. Particularly, it implies maintenance of coherent data along workflows. Design data concern elements from elementary domains of operations being transformation, measurement and transaction. Operation design elements and method information are specified for each consecutive workflow segment from field to laboratory campaigns. The strict linkage of operation design element values, operation values and objects is essential. For enabling coherence of corresponding objects along consecutive workflow segments, the assignment of unique identifiers and the specification of their relations are mandatory. The abstract model presented here addresses these aspects, and the software DiversityDescriptions (DWB-DD) facilitates the management of thusly connected digital data objects and structures. DWB-DD allows for an individual specification of operation design elements and their linking to objects. Two workflow design use cases, one for DNA barcoding and another for cultivation of fungal isolates, are given. To publish those structured data, standard schema mapping and XML-provision of digital objects are essential. Schemas useful for this mapping include the Ecological Markup Language, the Schema for Meta-omics Data of Collection Objects and the Standard for Structured Descriptive Data. Data pipelines with DWB-DD include the mapping and conversion between schemas and functions for data publishing and archiving according to the Open Archival Information System standard. The setting allows for repeatability of study setups, reproducibility of study results and for supporting work groups to structure and maintain their data from the beginning of a study. The theory of 'FAIR++' digital objects is introduced.

摘要

研究设置的可重复性以及基础数据的研究结果的可再现性是科学中的主要要求。到目前为止,环境科学中缺乏用于描述研究结构逻辑的抽象模型,数据管理工具也不足。实现可重复性和可再现性的关键是使用超越数据文件共享的复杂数据管理解决方案。特别是,这意味着要在工作流程中维护连贯的数据。设计数据涉及来自操作基本领域的元素,即转换、测量和事务。针对从野外到实验室活动的每个连续工作流程段,指定操作设计元素和方法信息。操作设计元素值、操作值和对象之间的严格关联至关重要。为了在连续的工作流程段中实现相应对象的连贯性,必须分配唯一标识符并指定它们之间的关系。这里提出的抽象模型解决了这些方面的问题,软件DiversityDescriptions(DWB-DD)有助于管理如此连接的数字数据对象和结构。DWB-DD允许对操作设计元素进行个性化指定并将其与对象链接。给出了两个工作流程设计用例,一个用于DNA条形码分析,另一个用于真菌分离株的培养。要发布这些结构化数据,标准模式映射和数字对象的XML提供至关重要。适用于此映射的模式包括生态标记语言、收集对象的元组学数据模式和结构化描述数据标准。使用DWB-DD的数据管道包括模式之间的映射和转换以及根据开放档案信息系统标准进行数据发布和存档的功能。该设置允许研究设置的可重复性、研究结果的可再现性,并支持工作组从研究开始就对其数据进行结构化和维护。引入了“FAIR++”数字对象理论。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/584f/7439577/6a34ad80b06e/baaa059f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/584f/7439577/28dffd264bd2/baaa059f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/584f/7439577/43cafa44a2fd/baaa059f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/584f/7439577/dbcea614fddd/baaa059f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/584f/7439577/5356f084049e/baaa059f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/584f/7439577/daa8f411a047/baaa059f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/584f/7439577/6a34ad80b06e/baaa059f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/584f/7439577/28dffd264bd2/baaa059f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/584f/7439577/43cafa44a2fd/baaa059f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/584f/7439577/dbcea614fddd/baaa059f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/584f/7439577/5356f084049e/baaa059f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/584f/7439577/daa8f411a047/baaa059f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/584f/7439577/6a34ad80b06e/baaa059f6.jpg

相似文献

1
FAIR digital objects in environmental and life sciences should comprise workflow operation design data and method information for repeatability of study setups and reproducibility of results.环境与生命科学领域的FAIR数字对象应包含工作流程操作设计数据和方法信息,以实现研究设置的可重复性和结果的可再现性。
Database (Oxford). 2020 Jan 1;2020. doi: 10.1093/database/baaa059.
2
Meta-omics data and collection objects (MOD-CO): a conceptual schema and data model for processing sample data in meta-omics research.元组学数据和收集对象 (MOD-CO):元组学研究中处理样本数据的概念模式和数据模型。
Database (Oxford). 2019 Jan 1;2019:baz002. doi: 10.1093/database/baz002.
3
A multi-omics data analysis workflow packaged as a FAIR Digital Object.一个被打包为 FAIR 数字对象的多组学数据分析工作流。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giad115.
4
Workflows in bioinformatics: meta-analysis and prototype implementation of a workflow generator.生物信息学中的工作流程:工作流程生成器的元分析与原型实现
BMC Bioinformatics. 2005 Apr 7;6:87. doi: 10.1186/1471-2105-6-87.
5
Recording provenance of workflow runs with RO-Crate.使用 RO-Crate 记录工作流运行的出处。
PLoS One. 2024 Sep 10;19(9):e0309210. doi: 10.1371/journal.pone.0309210. eCollection 2024.
6
Sharing interoperable workflow provenance: A review of best practices and their practical application in CWLProv.共享可互操作的工作流溯源:最佳实践综述及其在 CWLProv 中的实际应用。
Gigascience. 2019 Nov 1;8(11). doi: 10.1093/gigascience/giz095.
7
On the support of scientific workflows over Pub/Sub brokers.关于在 Pub/Sub 代理之上对科学工作流的支持。
Sensors (Basel). 2013 Aug 20;13(8):10954-80. doi: 10.3390/s130810954.
8
Creating reproducible pharmacogenomic analysis pipelines.创建可重现的药物基因组学分析管道。
Sci Data. 2019 Sep 3;6(1):166. doi: 10.1038/s41597-019-0174-7.
9
RABIX: AN OPEN-SOURCE WORKFLOW EXECUTOR SUPPORTING RECOMPUTABILITY AND INTEROPERABILITY OF WORKFLOW DESCRIPTIONS.RABIX:一个支持工作流描述的可重新计算性和互操作性的开源工作流执行器。
Pac Symp Biocomput. 2017;22:154-165. doi: 10.1142/9789813207813_0016.
10
XML Schema Representation of DICOM Structured Reporting.DICOM结构化报告的XML模式表示
J Am Med Inform Assoc. 2003 Mar-Apr;10(2):213-23. doi: 10.1197/jamia.m1042.

引用本文的文献

1
ARAapp: filling gaps in the ecological knowledge of spiders using an automated and dynamic approach to analyze systematically collected community data.ARAapp:利用自动化和动态方法分析系统收集的群落数据,填补蜘蛛生态知识的空白。
Database (Oxford). 2024 Feb 1;2024. doi: 10.1093/database/baae004.
2
Software infrastructure and data pipelines established for technical interoperability within a cross-border cooperation for the flora of the Bohemian Forest.为波希米亚森林植物群跨境合作中的技术互操作性而建立的软件基础设施和数据管道。
Biodivers Data J. 2022 Oct 14;10:e87254. doi: 10.3897/BDJ.10.e87254. eCollection 2022.
3

本文引用的文献

1
Reproducibility: the search for microbiome standards.可重复性:寻找微生物组标准。
Biotechniques. 2019 Sep;67(3):86-88. doi: 10.2144/btn-2019-0096. Epub 2019 Aug 12.
2
Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2.使用QIIME 2进行可重复、交互式、可扩展和可延伸的微生物组数据科学研究。
Nat Biotechnol. 2019 Aug;37(8):852-857. doi: 10.1038/s41587-019-0209-9.
3
Interoperable and scalable data analysis with microservices: applications in metabolomics.基于微服务的可互操作和可扩展数据分析:在代谢组学中的应用。
Qiime Artifact eXtractor (qax): A Fast and Versatile Tool to Interact with Qiime2 Archives.
Qiime工件提取器(qax):一个用于与Qiime2存档进行交互的快速通用工具。
BioTech (Basel). 2021 Mar 3;10(1):5. doi: 10.3390/biotech10010005.
4
BEXIS2: A FAIR-aligned data management system for biodiversity, ecology and environmental data.BEXIS2:一个符合FAIR原则的生物多样性、生态学和环境数据管理系统。
Biodivers Data J. 2021 Nov 5;9:e72901. doi: 10.3897/BDJ.9.e72901. eCollection 2021.
5
Microplastics accumulate fungal pathogens in terrestrial ecosystems.微塑料在陆地生态系统中积累真菌病原体。
Sci Rep. 2021 Jul 15;11(1):13214. doi: 10.1038/s41598-021-92405-7.
Bioinformatics. 2019 Oct 1;35(19):3752-3760. doi: 10.1093/bioinformatics/btz160.
4
Meta-omics data and collection objects (MOD-CO): a conceptual schema and data model for processing sample data in meta-omics research.元组学数据和收集对象 (MOD-CO):元组学研究中处理样本数据的概念模式和数据模型。
Database (Oxford). 2019 Jan 1;2019:baz002. doi: 10.1093/database/baz002.
5
The UNITE database for molecular identification of fungi: handling dark taxa and parallel taxonomic classifications.UNITE 数据库用于真菌的分子鉴定:处理暗类群和并行的分类学分类。
Nucleic Acids Res. 2019 Jan 8;47(D1):D259-D264. doi: 10.1093/nar/gky1022.
6
Questionable research practices in ecology and evolution.生态学和进化领域的可疑研究实践。
PLoS One. 2018 Jul 16;13(7):e0200303. doi: 10.1371/journal.pone.0200303. eCollection 2018.
7
The next generation of natural history collections.下一代自然历史藏品。
PLoS Biol. 2018 Jul 16;16(7):e2006125. doi: 10.1371/journal.pbio.2006125. eCollection 2018 Jul.
8
Before reproducibility must come preproducibility.在可重复性之前必须先有预可重复性。
Nature. 2018 May;557(7707):613. doi: 10.1038/d41586-018-05256-0.
9
A generic workflow for effective sampling of environmental vouchers with UUID assignment and image processing.一种具有 UUID 分配和图像处理功能的环境凭证有效抽样的通用工作流程。
Database (Oxford). 2018 Jan 1;2018. doi: 10.1093/database/bax096.
10
Visible DNA Microarray System as an Adjunctive Molecular Test in Identification of Pathogenic Fungi Directly from a Blood Culture Bottle.可视 DNA 微阵列系统作为一种辅助分子检测方法,可直接从血培养瓶中鉴定致病真菌。
J Clin Microbiol. 2018 Apr 25;56(5). doi: 10.1128/JCM.01908-17. Print 2018 May.