• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生物医学数据集和工作流程中出处的方法与标准:范围综述方案

Approaches and Criteria for Provenance in Biomedical Data Sets and Workflows: Protocol for a Scoping Review.

作者信息

Gierend Kerstin, Krüger Frank, Waltemath Dagmar, Fünfgeld Maximilian, Ganslandt Thomas, Zeleke Atinkut Alamirrew

机构信息

Department of Biomedical Informatics at the Center for Preventive Medicine and Digital Health, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany.

Department of Communications Engineering, University of Rostock, Rostock, Germany.

出版信息

JMIR Res Protoc. 2021 Nov 22;10(11):e31750. doi: 10.2196/31750.

DOI:10.2196/31750
PMID:34813494
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8663663/
Abstract

BACKGROUND

Provenance supports the understanding of data genesis, and it is a key factor to ensure the trustworthiness of digital objects containing (sensitive) scientific data. Provenance information contributes to a better understanding of scientific results and fosters collaboration on existing data as well as data sharing. This encompasses defining comprehensive concepts and standards for transparency and traceability, reproducibility, validity, and quality assurance during clinical and scientific data workflows and research.

OBJECTIVE

The aim of this scoping review is to investigate existing evidence regarding approaches and criteria for provenance tracking as well as disclosing current knowledge gaps in the biomedical domain. This review covers modeling aspects as well as metadata frameworks for meaningful and usable provenance information during creation, collection, and processing of (sensitive) scientific biomedical data. This review also covers the examination of quality aspects of provenance criteria.

METHODS

This scoping review will follow the methodological framework by Arksey and O'Malley. Relevant publications will be obtained by querying PubMed and Web of Science. All papers in English language will be included, published between January 1, 2006 and March 23, 2021. Data retrieval will be accompanied by manual search for grey literature. Potential publications will then be exported into a reference management software, and duplicates will be removed. Afterwards, the obtained set of papers will be transferred into a systematic review management tool. All publications will be screened, extracted, and analyzed: title and abstract screening will be carried out by 4 independent reviewers. Majority vote is required for consent to eligibility of papers based on the defined inclusion and exclusion criteria. Full-text reading will be performed independently by 2 reviewers and in the last step, key information will be extracted on a pretested template. If agreement cannot be reached, the conflict will be resolved by a domain expert. Charted data will be analyzed by categorizing and summarizing the individual data items based on the research questions. Tabular or graphical overviews will be given, if applicable.

RESULTS

The reporting follows the extension of the Preferred Reporting Items for Systematic reviews and Meta-Analyses statements for Scoping Reviews. Electronic database searches in PubMed and Web of Science resulted in 469 matches after deduplication. As of September 2021, the scoping review is in the full-text screening stage. The data extraction using the pretested charting template will follow the full-text screening stage. We expect the scoping review report to be completed by February 2022.

CONCLUSIONS

Information about the origin of healthcare data has a major impact on the quality and the reusability of scientific results as well as follow-up activities. This protocol outlines plans for a scoping review that will provide information about current approaches, challenges, or knowledge gaps with provenance tracking in biomedical sciences.

INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): DERR1-10.2196/31750.

摘要

背景

出处有助于理解数据起源,是确保包含(敏感)科学数据的数字对象可信度的关键因素。出处信息有助于更好地理解科学成果,促进对现有数据的协作以及数据共享。这包括在临床和科学数据工作流程及研究中定义有关透明度、可追溯性、可重复性、有效性和质量保证的全面概念和标准。

目的

本范围综述的目的是调查有关出处跟踪方法和标准的现有证据,并揭示生物医学领域当前的知识空白。本综述涵盖建模方面以及在(敏感)生物医学科学数据的创建、收集和处理过程中用于生成有意义且可用的出处信息的元数据框架。本综述还包括对出处标准质量方面的考察。

方法

本范围综述将遵循阿克西和奥马利的方法框架。通过查询PubMed和科学网获取相关出版物。将纳入2006年1月1日至2021年3月23日期间发表的所有英文论文。数据检索将辅以对灰色文献的手动搜索。然后将潜在出版物导出到参考管理软件中,并去除重复项。之后,将获得的论文集转移到系统综述管理工具中。对所有出版物进行筛选、提取和分析:标题和摘要筛选将由4名独立评审员进行。根据定义的纳入和排除标准,论文入选需多数投票同意。全文阅读将由2名评审员独立进行,最后一步,将在预先测试的模板上提取关键信息。如果无法达成一致,将由领域专家解决冲突。将通过根据研究问题对各个数据项进行分类和总结来分析图表数据。如有适用,将给出表格或图形概述。

结果

报告遵循系统综述和Meta分析的首选报告项目扩展声明用于范围综述。在PubMed和科学网中进行电子数据库搜索,去重后得到469条匹配结果。截至2021年9月,范围综述处于全文筛选阶段。使用预先测试的图表模板进行数据提取将在全文筛选阶段之后进行。我们预计范围综述报告将于2022年2月完成。

结论

医疗保健数据的来源信息对科学成果的质量和可重用性以及后续活动有重大影响。本方案概述了一项范围综述的计划,该综述将提供有关生物医学科学中出处跟踪的当前方法、挑战或知识空白的信息。

国际注册报告识别号(IRRID):DERR1-10.2196/31750。

相似文献

1
Approaches and Criteria for Provenance in Biomedical Data Sets and Workflows: Protocol for a Scoping Review.生物医学数据集和工作流程中出处的方法与标准:范围综述方案
JMIR Res Protoc. 2021 Nov 22;10(11):e31750. doi: 10.2196/31750.
2
Provenance Information for Biomedical Data and Workflows: Scoping Review.生物医学数据和工作流程的出处信息:范围综述。
J Med Internet Res. 2024 Aug 23;26:e51297. doi: 10.2196/51297.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Ethics of Procuring and Using Organs or Tissue from Infants and Newborns for Transplantation, Research, or Commercial Purposes: Protocol for a Bioethics Scoping Review.从婴儿和新生儿获取器官或组织用于移植、研究或商业目的的伦理问题:生物伦理学范围审查方案
Wellcome Open Res. 2024 Dec 5;9:717. doi: 10.12688/wellcomeopenres.23235.1. eCollection 2024.
5
Initiatives, Concepts, and Implementation Practices of FAIR (Findable, Accessible, Interoperable, and Reusable) Data Principles in Health Data Stewardship Practice: Protocol for a Scoping Review.健康数据管理实践中FAIR(可查找、可访问、可互操作和可重用)数据原则的倡议、概念及实施实践:一项范围综述方案
JMIR Res Protoc. 2021 Feb 2;10(2):e22505. doi: 10.2196/22505.
6
Data Provenance in Biomedical Research: Scoping Review.生物医学研究中的数据溯源:范围综述。
J Med Internet Res. 2023 Mar 27;25:e42289. doi: 10.2196/42289.
7
Data Quality- and Utility-Compliant Anonymization of Common Data Model-Harmonized Electronic Health Record Data: Protocol for a Scoping Review.符合数据质量和效用要求的通用数据模型协调电子健康记录数据匿名化:范围审查方案
JMIR Res Protoc. 2023 Aug 11;12:e46471. doi: 10.2196/46471.
8
Initiatives, Concepts, and Implementation Practices of the Findable, Accessible, Interoperable, and Reusable Data Principles in Health Data Stewardship: Scoping Review.健康数据治理中可发现性、可访问性、互操作性和可重用性数据原则的举措、概念和实施实践:范围综述。
J Med Internet Res. 2023 Aug 28;25:e45013. doi: 10.2196/45013.
9
Challenges and Facilitation Approaches for the Participatory Design of AI-Based Clinical Decision Support Systems: Protocol for a Scoping Review.基于人工智能的临床决策支持系统参与式设计的挑战和促进方法:系统评价方案。
JMIR Res Protoc. 2024 Sep 5;13:e58185. doi: 10.2196/58185.
10
Data Visualization Support for Tumor Boards and Clinical Oncology: Protocol for a Scoping Review.肿瘤病例讨论会和临床肿瘤学的数据可视化支持:一项范围综述的方案
JMIR Res Protoc. 2024 Mar 5;13:e53627. doi: 10.2196/53627.

引用本文的文献

1
Applicability of Retrospective and Prospective Gender Scores for Clinical and Health Data: Protocol for a Scoping Review.回顾性和前瞻性性别评分在临床与健康数据中的适用性:一项范围综述方案
JMIR Res Protoc. 2025 Jan 20;14:e57669. doi: 10.2196/57669.
2
Provenance Information for Biomedical Data and Workflows: Scoping Review.生物医学数据和工作流程的出处信息:范围综述。
J Med Internet Res. 2024 Aug 23;26:e51297. doi: 10.2196/51297.
3
Data Visualization Support for Tumor Boards and Clinical Oncology: Protocol for a Scoping Review.肿瘤病例讨论会和临床肿瘤学的数据可视化支持:一项范围综述的方案
JMIR Res Protoc. 2024 Mar 5;13:e53627. doi: 10.2196/53627.
4
The Reporting Quality of Machine Learning Studies on Pediatric Diabetes Mellitus: Systematic Review.机器学习在儿科糖尿病研究中的报告质量:系统评价。
J Med Internet Res. 2024 Jan 19;26:e47430. doi: 10.2196/47430.
5
The Status of Data Management Practices Across German Medical Data Integration Centers: Mixed Methods Study.德国医学数据集成中心的数据管理实践现状:混合方法研究。
J Med Internet Res. 2023 Nov 8;25:e48809. doi: 10.2196/48809.
6
Data Quality- and Utility-Compliant Anonymization of Common Data Model-Harmonized Electronic Health Record Data: Protocol for a Scoping Review.符合数据质量和效用要求的通用数据模型协调电子健康记录数据匿名化:范围审查方案
JMIR Res Protoc. 2023 Aug 11;12:e46471. doi: 10.2196/46471.
7
Ten Topics to Get Started in Medical Informatics Research.医学信息学研究入门的十个主题
J Med Internet Res. 2023 Jul 24;25:e45948. doi: 10.2196/45948.
8
Data Provenance in Biomedical Research: Scoping Review.生物医学研究中的数据溯源:范围综述。
J Med Internet Res. 2023 Mar 27;25:e42289. doi: 10.2196/42289.

本文引用的文献

1
Reproducibility in machine learning for health research: Still a ways to go.机器学习在健康研究中的可重复性:仍有很长的路要走。
Sci Transl Med. 2021 Mar 24;13(586). doi: 10.1126/scitranslmed.abb1655.
2
Enhancing Traceability in Clinical Research Data through a Metadata Framework.通过元数据框架增强临床研究数据的可追溯性。
Methods Inf Med. 2020 May;59(2-03):75-85. doi: 10.1055/s-0040-1714393. Epub 2020 Sep 7.
3
Exchange of Clinical and Omics Data According to FAIR Principles: A Review of Open Source Solutions.根据 FAIR 原则交换临床和组学数据:开源解决方案综述。
Methods Inf Med. 2020 Jun;59(S 01):e13-e20. doi: 10.1055/s-0040-1712968. Epub 2020 Jul 3.
4
Data Provenance Standards and Recommendations for FAIR Data.FAIR数据的数据来源标准与建议
Stud Health Technol Inform. 2020 Jun 16;270:1237-1238. doi: 10.3233/SHTI200380.
5
Decentralised provenance for healthcare data.医疗数据的去中心化来源。
Int J Med Inform. 2020 Sep;141:104197. doi: 10.1016/j.ijmedinf.2020.104197. Epub 2020 Jun 8.
6
Sharing interoperable workflow provenance: A review of best practices and their practical application in CWLProv.共享可互操作的工作流溯源:最佳实践综述及其在 CWLProv 中的实际应用。
Gigascience. 2019 Nov 1;8(11). doi: 10.1093/gigascience/giz095.
7
Fine-Grained Provenance for Matching & ETL.用于匹配和ETL的细粒度溯源
Proc Int Conf Data Eng. 2019 Apr;2019:184-195. doi: 10.1109/ICDE.2019.00025. Epub 2019 Jun 6.
8
The French Health Data Hub and the German Medical Informatics Initiatives: Two National Projects to Promote Data Sharing in Healthcare.法国健康数据中心与德国医学信息学计划:促进医疗保健数据共享的两个国家项目。
Yearb Med Inform. 2019 Aug;28(1):195-202. doi: 10.1055/s-0039-1677917. Epub 2019 Aug 16.
9
Embedding data provenance into the Learning Health System to facilitate reproducible research.将数据来源嵌入学习型健康系统以促进可重复研究。
Learn Health Syst. 2016 Dec 27;1(2):e10019. doi: 10.1002/lrh2.10019. eCollection 2017 Apr.
10
PRISMA Extension for Scoping Reviews (PRISMA-ScR): Checklist and Explanation.PRISMA 扩展用于范围审查 (PRISMA-ScR): 清单和解释。
Ann Intern Med. 2018 Oct 2;169(7):467-473. doi: 10.7326/M18-0850. Epub 2018 Sep 4.