• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

公平的数据共享:通用数据元素和协调的作用。

FAIR data sharing: The roles of common data elements and harmonization.

机构信息

Elligo Health Research and Catalysis, USA.

U.S. National Cancer Institute, USA.

出版信息

J Biomed Inform. 2020 Jul;107:103421. doi: 10.1016/j.jbi.2020.103421. Epub 2020 May 12.

DOI:10.1016/j.jbi.2020.103421
PMID:32407878
Abstract

The value of robust and responsible data sharing in clinical research and healthcare is recognized by patients, patient advocacy groups, researchers, journal editors, and the healthcare industry globally. Privacy and security concerns acknowledged, the act of exchanging data (interoperability) along with its meaning (semantic interoperability) across studies and between partners has been difficult, if not elusive. For shared data to retain its value, a recommendation has been made to follow the Findable, Accessible, Interoperable, Reusable (FAIR) principles. Without applying appropriate data exchange standards with domain-relevant content standards and accessible rich metadata that uses applicable terminologies, interoperability is burdened by the need for transformation and/or mapping. These obstacles to interoperability limit the findability, accessibility and reusability of data, thus diminishing its value and making it impossible to adhere to FAIR principles. One effort to standardize data collection has been through common data elements (CDEs). CDEs are data collection units comprising one or more questions together with a set of valid values. Some CDEs contain standardized terminology concepts that define the meaning of the data, and others include links to unique terminology concept identifiers and unique identifiers for each CDE; however, usually CDEs are defined for specific projects or collaborations and lack traceable or machine readable semantics. While the name implies that these are 'common', this has not necessarily been a requirement, and many CDEs have not been commonly used. The National Institutes of Health (NIH) CDEs are, in fact, a conglomerate of CDEs developed in silos by various NIH institutes. Therefore, CDEs have not brought the anticipated benefit to the industry through widescale interoperability, nor is there widespread reuse of CDEs. Certain institutes in the NIH recommend, albeit do not enforce, institute-specific preferred CDEs; however, at the NIH level a preponderance of choice and a lack of any overarching harmonization of CDEs or consistency in linking them to controlled terminology or common identifiers create confusion for researchers in their efforts to identify the best CDEs for their protocol. The problem of comparing data among studies is exacerbated when researchers select different CDEs for the same variable or data collection field. This manuscript explores reasons for the disappointingly low adoption of CDEs and the inability of CDEs or other clinical research standards to broadly solve the interoperability and data sharing problems. Recommendations are offered for rectifying this situation to enable responsible data sharing that will help in adherence to FAIR principles and the realization of Learning Health Systems for the sake of all of us as patients.

摘要

在全球范围内,患者、患者权益倡导组织、研究人员、期刊编辑和医疗保健行业都认识到在临床研究和医疗保健中稳健负责的数据共享的价值。尽管已经认识到隐私和安全问题,但在研究之间以及合作伙伴之间交换数据(互操作性)及其含义(语义互操作性)一直很困难,如果不是难以实现的话。为了保持数据的价值,有人建议遵循可发现性、可访问性、互操作性、可重用性(FAIR)原则。如果不使用适当的数据交换标准以及与领域相关的内容标准和使用适用术语的可访问丰富元数据,互操作性就会受到需要转换和/或映射的困扰。这些互操作性障碍限制了数据的可发现性、可访问性和可重用性,从而降低了数据的价值,并使其无法遵守 FAIR 原则。通过通用数据元素 (CDE) 来标准化数据收集是一种努力。CDE 是由一个或多个问题以及一组有效值组成的数据收集单元。一些 CDE 包含定义数据含义的标准化术语概念,而另一些则包含指向唯一术语概念标识符和每个 CDE 的唯一标识符的链接;但是,通常 CDE 是为特定项目或合作定义的,并且缺乏可跟踪或机器可读的语义。虽然名称暗示这些是“通用的”,但这不一定是必需的,并且许多 CDE 并未被广泛使用。事实上,美国国立卫生研究院 (NIH) 的 CDE 是由各种 NIH 研究所各自独立开发的 CDE 的集合。因此,CDE 并没有通过广泛的互操作性为行业带来预期的好处,也没有广泛使用 CDE。NIH 的某些机构建议使用特定机构首选的 CDE,但不强制使用;但是,在 NIH 层面,选择过多且缺乏对 CDE 或将其链接到受控术语或通用标识符的任何总体协调或一致性,导致研究人员在努力确定最适合其方案的 CDE 时感到困惑。当研究人员为同一变量或数据收集字段选择不同的 CDE 时,研究之间比较数据的问题会更加严重。本文探讨了令人失望的 CDE 采用率低的原因,以及 CDE 或其他临床研究标准无法广泛解决互操作性和数据共享问题的原因。为了纠正这种情况,提出了一些建议,以实现负责任的数据共享,这将有助于遵守 FAIR 原则,并为了我们所有人作为患者实现学习健康系统。

相似文献

1
FAIR data sharing: The roles of common data elements and harmonization.公平的数据共享:通用数据元素和协调的作用。
J Biomed Inform. 2020 Jul;107:103421. doi: 10.1016/j.jbi.2020.103421. Epub 2020 May 12.
2
Creation of Standardized Common Data Elements for Diagnostic Tests in Infectious Disease Studies: Semantic and Syntactic Mapping.创建传染病研究中诊断测试的标准化通用数据元素:语义和句法映射。
J Med Internet Res. 2024 Jun 10;26:e50049. doi: 10.2196/50049.
3
Composite CDE: modeling composite relationships between common data elements for representing complex clinical data.组合 CDE:建模常见数据元素之间的组合关系,用于表示复杂的临床数据。
BMC Med Inform Decis Mak. 2020 Jul 3;20(1):147. doi: 10.1186/s12911-020-01168-0.
4
Making Metadata Machine-Readable as the First Step to Providing Findable, Accessible, Interoperable, and Reusable Population Health Data: Framework Development and Implementation Study.将元数据转化为机器可读形式作为提供可查找、可访问、可互操作和可重用的人群健康数据的第一步:框架开发与实施研究
Online J Public Health Inform. 2024 Aug 1;16:e56237. doi: 10.2196/56237.
5
NINDS Common Data Elements for Congenital Muscular Dystrophy Clinical Research: A National Institute for Neurological Disorders and Stroke Project.美国国立神经病学与卒中研究院先天性肌营养不良症临床研究通用数据元素:一个美国国立神经病学与卒中研究院项目。
J Neuromuscul Dis. 2018;5(1):75-84. doi: 10.3233/JND-170248.
6
NIH HEAL Clinical Data Elements (CDE) implementation: NIH HEAL Initiative IMPOWR network IDEA-CC.NIH HEAL 临床数据元素(CDE)实施:NIH HEAL 计划 IMPOWR 网络 IDEA-CC。
Pain Med. 2023 Jul 5;24(7):743-749. doi: 10.1093/pm/pnad018.
7
A Data Transformation Methodology to Create Findable, Accessible, Interoperable, and Reusable Health Data: Software Design, Development, and Evaluation Study.一种创建可发现、可访问、可互操作和可重用健康数据的数据转换方法:软件设计、开发和评估研究。
J Med Internet Res. 2023 Mar 8;25:e42822. doi: 10.2196/42822.
8
Clinical MetaData ontology: a simple classification scheme for data elements of clinical data based on semantics.临床元数据本体:基于语义的临床数据数据元素的简单分类方案。
BMC Med Inform Decis Mak. 2019 Aug 20;19(1):166. doi: 10.1186/s12911-019-0877-x.
9
Learning important common data elements from shared study data: The All of Us program analysis.从共享研究数据中学习重要的通用数据元素:All of Us 计划分析。
PLoS One. 2023 Jul 7;18(7):e0283601. doi: 10.1371/journal.pone.0283601. eCollection 2023.
10
National Institute of Neurological Disorders and Stroke and Department of Defense Sport-Related Concussion Common Data Elements Version 1.0 Recommendations.美国国立神经病学与卒中研究院和国防部运动相关性脑震荡通用数据元素 1.0 版推荐意见。
J Neurotrauma. 2018 Dec 1;35(23):2776-2783. doi: 10.1089/neu.2018.5643. Epub 2018 Jul 23.

引用本文的文献

1
The computational model lifecycle: Opportunities and challenges for computational medicine in the healthcare ecosystem.计算模型生命周期:医疗保健生态系统中计算医学面临的机遇与挑战。
Sci Prog. 2025 Jul-Sep;108(3):368504251344145. doi: 10.1177/00368504251344145. Epub 2025 Sep 1.
2
Multimodal data curation via interoperability: use cases with the Medical Imaging and Data Resource Center.通过互操作性进行多模态数据管理:医学影像与数据资源中心的用例
Sci Data. 2025 Aug 1;12(1):1340. doi: 10.1038/s41597-025-05678-2.
3
A natural language processing approach to support biomedical data harmonization: Leveraging large language models.
一种支持生物医学数据协调的自然语言处理方法:利用大语言模型。
PLoS One. 2025 Jul 24;20(7):e0328262. doi: 10.1371/journal.pone.0328262. eCollection 2025.
4
Integrating community perspectives to improve survey completion rates in public health research by refining controversial survey elements.通过优化有争议的调查要素,整合社区观点以提高公共卫生研究中的调查完成率。
J Clin Transl Sci. 2025 Apr 28;9(1):e123. doi: 10.1017/cts.2025.80. eCollection 2025.
5
A CDE-based data structure for radiotherapeutic decision-making in breast cancer.一种基于临床决策要素的乳腺癌放射治疗决策数据结构。
BMC Med Inform Decis Mak. 2025 Jul 1;25(1):220. doi: 10.1186/s12911-025-03036-1.
6
A companion to the preclinical common data elements for rigor, reproducibility and transparency in basic and translational epilepsy research. A report of the TASK3-WG1A: Pharmacology Working Group of the ILAE/AES Joint Translational Task Force.《临床前常见数据元素助力基础及转化性癫痫研究的严谨性、可重复性和透明度》。国际抗癫痫联盟/美国癫痫学会联合转化任务组TASK3-WG1A:药理学工作组报告。
Epilepsia Open. 2025 Aug;10 Suppl 1(Suppl 1):S18-S39. doi: 10.1002/epi4.70053. Epub 2025 Jun 26.
7
A guide to developing harmonized research workflows in a team science context.团队科学背景下开发统一研究工作流程的指南。
Exp Neurol. 2025 Jun 5;392:115333. doi: 10.1016/j.expneurol.2025.115333.
8
From bites to bytes: understanding how and why individual malaria risk varies using artificial intelligence and causal inference.从叮咬到字节:利用人工智能和因果推断理解个体疟疾风险的变化方式及原因。
Front Genet. 2025 May 16;16:1599826. doi: 10.3389/fgene.2025.1599826. eCollection 2025.
9
Advancing precision oncology with AI-powered genomic analysis.通过人工智能驱动的基因组分析推动精准肿瘤学发展。
Front Pharmacol. 2025 Apr 30;16:1591696. doi: 10.3389/fphar.2025.1591696. eCollection 2025.
10
CDEMapper: enhancing National Institutes of Health common data element use with large language models.CDEMapper:利用大语言模型增强美国国立卫生研究院通用数据元素的使用
J Am Med Inform Assoc. 2025 Jul 1;32(7):1130-1139. doi: 10.1093/jamia/ocaf064.