将数据来源嵌入学习型健康系统以促进可重复研究。

Embedding data provenance into the Learning Health System to facilitate reproducible research.

作者信息

Curcin Vasa

机构信息

Division of Health and Social Care Research King's College London London UK.

Department of Informatics King's College London London UK.

出版信息

Learn Health Syst. 2016 Dec 27;1(2):e10019. doi: 10.1002/lrh2.10019. eCollection 2017 Apr.

DOI:10.1002/lrh2.10019

PMID:31245557

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6516719/

Abstract

INTRODUCTION

The learning health system (LHS) community has taken up the challenge of bringing the complex relationship between clinical research and practice into this brave new world. At the heart of the LHS vision is the notion of routine capture, transformation, and dissemination of data and knowledge, with various use cases, such as clinical studies, quality improvement initiatives, and decision support, constructed on top of specific routes that the data is taking through the system. In order to stop this increased data volume and analytical complexity from obfuscating the research process, it is essential to establish trust in the system through implementing reproducibility and auditability throughout the workflow.

METHODS

Data provenance technologies can automatically capture the trace of the research task and resulting data, thereby facilitating reproducible research. While some computational domains, such as bioinformatics, have embraced the technology through provenance-enabled execution middlewares, disciplines based on distributed, heterogeneous software, such as medical research, are only starting on the road to adoption, motivated by the institutional pressures to improve transparency and reproducibility.

RESULTS

Guided by the experiences of the TRANSFoRm project, we present the opportunities that data provenance offers to the LHS community. We illustrate how provenance can facilitate documenting 21 CFR Part 11 compliance for Food and Drug Administration submissions and provide auditability for decisions made by the decision support tools and discuss the transformational effect of routine provenance capture on data privacy, study reporting, and publishing medical research.

CONCLUSIONS

If the scaling up of the LHS is to succeed, we have to embed mechanisms to verify trust in the system inside our research instruments. In the research world increasingly reliant on electronic tools, provenance gives us a lingua franca to achieve traceability, which we have shown to be essential to building these mechanisms. To realize the vision of making computable provenance a feasible approach to implementing reproducibility in the LHS, we have to provide viable mechanisms for adoption. These include defining meaningful provenance models for problem domains and also introducing provenance support to existing tools in a minimally invasive manner.

摘要

引言

学习型健康系统（LHS）社区已接受挑战，将临床研究与实践之间的复杂关系引入这个全新的世界。LHS愿景的核心是数据和知识的常规捕获、转换及传播概念，在数据流经系统的特定路径之上构建了各种用例，如临床研究、质量改进计划和决策支持。为防止数据量增加和分析复杂性掩盖研究过程，必须通过在整个工作流程中实施可重复性和可审计性来建立对系统的信任。

方法

数据溯源技术可自动捕获研究任务及结果数据的踪迹，从而促进可重复研究。虽然一些计算领域，如生物信息学，已通过启用溯源的执行中间件采用了该技术，但基于分布式、异构软件的学科，如医学研究，受提高透明度和可重复性的机构压力推动，才刚刚踏上采用之路。

结果

以TRANSFoRm项目的经验为指导，我们展示了数据溯源为LHS社区带来的机遇。我们说明了溯源如何有助于记录向美国食品药品监督管理局提交材料时符合21 CFR Part 11的情况，并为决策支持工具所做的决策提供可审计性，还讨论了常规溯源捕获对数据隐私、研究报告和医学研究发表的变革性影响。

结论

如果LHS的扩大规模要取得成功，我们必须在研究工具中嵌入验证对系统信任的机制。在日益依赖电子工具的研究领域，溯源为我们提供了一种通用语言来实现可追溯性，我们已证明这对于构建这些机制至关重要。为实现使可计算溯源成为在LHS中实施可重复性的可行方法这一愿景，我们必须提供可行的采用机制。这些机制包括为问题领域定义有意义的溯源模型，以及以微创方式将溯源支持引入现有工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d53/6516719/3516769951c8/LRH2-1-e10019-g001.jpg

相似文献

Embedding data provenance into the Learning Health System to facilitate reproducible research.

Learn Health Syst. 2016 Dec 27;1(2):e10019. doi: 10.1002/lrh2.10019. eCollection 2017 Apr.

Templates as a method for implementing data provenance in decision support systems.

J Biomed Inform. 2017 Jan;65:1-21. doi: 10.1016/j.jbi.2016.10.022. Epub 2016 Nov 14.

Traceable Research Data Sharing in a German Medical Data Integration Center With FAIR (Findability, Accessibility, Interoperability, and Reusability)-Geared Provenance Implementation: Proof-of-Concept Study.

JMIR Form Res. 2023 Dec 7;7:e50027. doi: 10.2196/50027.

Providing traceability for neuroimaging analyses.

Int J Med Inform. 2013 Sep;82(9):882-94. doi: 10.1016/j.ijmedinf.2013.05.005. Epub 2013 Jun 12.

Leaders' perspectives on learning health systems: a qualitative study.

BMC Health Serv Res. 2020 Nov 26;20(1):1087. doi: 10.1186/s12913-020-05924-w.

Application of Data Provenance in Healthcare Analytics Software: Information Visualisation of User Activities.

AMIA Jt Summits Transl Sci Proc. 2018 May 18;2017:263-272. eCollection 2018.

Provenance for distributed biomedical workflow execution.

Stud Health Technol Inform. 2012;175:91-100.

Recording provenance of workflow runs with RO-Crate.

PLoS One. 2024 Sep 10;19(9):e0309210. doi: 10.1371/journal.pone.0309210. eCollection 2024.

Sharing interoperable workflow provenance: A review of best practices and their practical application in CWLProv.

Gigascience. 2019 Nov 1;8(11). doi: 10.1093/gigascience/giz095.

Requirements and validation of a prototype learning health system for clinical diagnosis.

Learn Health Syst. 2017 May 31;1(4):e10026. doi: 10.1002/lrh2.10026. eCollection 2017 Oct.

引用本文的文献

Bridging the Scientific Knowledge Gap and Reproducibility: A Survey of Provenance, Assertion and Evidence Ontologies.

Proc Int World Wide Web Conf. 2025 Apr-May;2025(Companion):924-928. doi: 10.1145/3701716.3715483. Epub 2025 May 23.

Provenance Information for Biomedical Data and Workflows: Scoping Review.

J Med Internet Res. 2024 Aug 23;26:e51297. doi: 10.2196/51297.

JMIR Form Res. 2023 Dec 7;7:e50027. doi: 10.2196/50027.

Ten Topics to Get Started in Medical Informatics Research.

J Med Internet Res. 2023 Jul 24;25:e45948. doi: 10.2196/45948.

Data Provenance in Biomedical Research: Scoping Review.

J Med Internet Res. 2023 Mar 27;25:e42289. doi: 10.2196/42289.

Developing a Data Quality Standard Primer for Cardiovascular Risk Assessment from Electronic Health Record Data Using the DataGauge Process.

AMIA Annu Symp Proc. 2022 Feb 21;2021:388-397. eCollection 2021.

Approaches and Criteria for Provenance in Biomedical Data Sets and Workflows: Protocol for a Scoping Review.

JMIR Res Protoc. 2021 Nov 22;10(11):e31750. doi: 10.2196/31750.

Assessment of the impact of EHR heterogeneity for clinical research through a case study of silent brain infarction.

BMC Med Inform Decis Mak. 2020 Mar 30;20(1):60. doi: 10.1186/s12911-020-1072-9.

A framework for analysing learning health systems: Are we removing the most impactful barriers?

Learn Health Syst. 2019 Mar 21;3(4):e10189. doi: 10.1002/lrh2.10189. eCollection 2019 Oct.

Our data, our society, our health: A vision for inclusive and transparent health data science in the United Kingdom and beyond.

Learn Health Syst. 2019 Mar 25;3(3):e10191. doi: 10.1002/lrh2.10191. eCollection 2019 Jul.

本文引用的文献

Reproducibility: A tragedy of errors.

Nature. 2016 Feb 4;530(7588):27-9. doi: 10.1038/530027a.

Data Sharing and the Journal.

N Engl J Med. 2016 May 12;374(19):e24. doi: 10.1056/NEJMe1601087. Epub 2016 Jan 25.

The "Efficacy-Effectiveness Gap": Historical Background and Current Conceptualization.

Value Health. 2016 Jan;19(1):75-81. doi: 10.1016/j.jval.2015.09.2938. Epub 2015 Nov 19.

Data Sharing.

N Engl J Med. 2016 Jan 21;374(3):276-7. doi: 10.1056/NEJMe1516564.

Sharing Clinical Trial Data: A Proposal from the International Committee of Medical Journal Editors.

PLoS Med. 2016 Jan 20;13(1):e1001950. doi: 10.1371/journal.pmed.1001950. eCollection 2016 Jan.

Reproducible Research Practices and Transparency across the Biomedical Literature.

PLoS Biol. 2016 Jan 4;14(1):e1002333. doi: 10.1371/journal.pbio.1002333. eCollection 2016 Jan.

Translational Medicine and Patient Safety in Europe: TRANSFoRm--Architecture for the Learning Health System in Europe.

Biomed Res Int. 2015;2015:961526. doi: 10.1155/2015/961526. Epub 2015 Oct 11.

Let's think about cognitive bias.

Nature. 2015 Oct 8;526(7572):163. doi: 10.1038/526163a.

The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) statement.

PLoS Med. 2015 Oct 6;12(10):e1001885. doi: 10.1371/journal.pmed.1001885. eCollection 2015 Oct.

PSYCHOLOGY. Estimating the reproducibility of psychological science.

Science. 2015 Aug 28;349(6251):aac4716. doi: 10.1126/science.aac4716.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

将数据来源嵌入学习型健康系统以促进可重复研究。

Embedding data provenance into the Learning Health System to facilitate reproducible research.

作者信息

机构信息

出版信息

INTRODUCTION

METHODS

RESULTS

CONCLUSIONS

引言

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献