Suppr超能文献

癌症研究的通用数据元素:关于功能与结构的评论

The Common Data Elements for cancer research: remarks on functions and structure.

作者信息

Nadkarni P M, Brandt C A

机构信息

Center for Medical Informatics, Yale University School of Medicine, PO Box 208009, New Haven, CT 06520-8009, USA.

出版信息

Methods Inf Med. 2006;45(6):594-601.

Abstract

OBJECTIVES

The National Cancer Institute (NCI) has developed the Common Data Elements (CDE) to serve as a controlled vocabulary of data descriptors for cancer research, to facilitate data interchange and inter-operability between cancer research centers. We evaluated CDE's structure to see whether it could represent the elements necessary to support its intended purpose, and whether it could prevent errors and inconsistencies from being accidentally introduced. We also performed automated checks for certain types of content errors that provided a rough measure of curation quality.

METHODS

Evaluation was performed on CDE content downloaded via the NCI's CDE Browser, and transformed into relational database form. Evaluation was performed under three categories: 1) compatibility with the ISO/IEC 11179 metadata model, on which CDE structure is based, 2) features necessary for controlled vocabulary support, and 3) support for a stated NCI goal, set up of data collection forms for cancer research.

RESULTS

Various limitations were identified both with respect to content (inconsistency, insufficient definition of elements, redundancy) as well as structure--particularly the need for term and relationship support, as well as the need for metadata supporting the explicit representation of electronic forms that utilize sets of common data elements.

CONCLUSIONS

While there are numerous positive aspects to the CDE effort, there is considerable opportunity for improvement. Our recommendations include review of existing content by diverse experts in the cancer community; integration with the NCI thesaurus to take advantage of the latter's links to nationally used controlled vocabularies, and various schema enhancements required for electronic form support.

摘要

目标

美国国家癌症研究所(NCI)已开发出通用数据元素(CDE),作为癌症研究数据描述符的受控词汇表,以促进癌症研究中心之间的数据交换和互操作性。我们评估了CDE的结构,以确定它是否能够代表支持其预期目的所需的元素,以及它是否能够防止错误和不一致性被意外引入。我们还对某些类型的内容错误进行了自动检查,这些检查提供了对管理质量的粗略衡量。

方法

对通过NCI的CDE浏览器下载并转换为关系数据库形式的CDE内容进行评估。评估分为三类:1)与CDE结构所基于的ISO/IEC 11179元数据模型的兼容性,2)受控词汇表支持所需的功能,3)对NCI既定目标的支持,即建立癌症研究数据收集表单。

结果

在内容(不一致性、元素定义不足、冗余)以及结构方面都发现了各种限制——特别是对术语和关系支持的需求,以及对支持利用通用数据元素集的电子表单的显式表示的元数据的需求。

结论

虽然CDE的工作有许多积极方面,但仍有很大的改进空间。我们的建议包括由癌症领域的不同专家审查现有内容;与NCI叙词表集成,以利用后者与全国使用的受控词汇表的链接,以及电子表单支持所需的各种模式增强。

相似文献

2
Quality evaluation of value sets from cancer study common data elements using the UMLS semantic groups.
J Am Med Inform Assoc. 2012 Jun;19(e1):e129-36. doi: 10.1136/amiajnl-2011-000739. Epub 2012 Apr 17.
4
Mapping clinical phenotype data elements to standardized metadata repositories and controlled terminologies: the eMERGE Network experience.
J Am Med Inform Assoc. 2011 Jul-Aug;18(4):376-86. doi: 10.1136/amiajnl-2010-000061. Epub 2011 May 19.
5
Sharing behavioral data through a grid infrastructure using data standards.
J Am Med Inform Assoc. 2014 Jul-Aug;21(4):642-9. doi: 10.1136/amiajnl-2013-001763. Epub 2013 Sep 27.
6
caCORE: a common infrastructure for cancer informatics.
Bioinformatics. 2003 Dec 12;19(18):2404-12. doi: 10.1093/bioinformatics/btg335.
7
The NCI All Ireland Cancer Conference.
Oncologist. 1999;4(4):275-277.
8
The ISO/IEC 11179 norm for metadata registries: does it cover healthcare standards in empirical research?
J Biomed Inform. 2013 Apr;46(2):318-27. doi: 10.1016/j.jbi.2012.11.008. Epub 2012 Dec 14.
9
[caCORE: core architecture of bioinformation on cancer research in America].
Beijing Da Xue Xue Bao Yi Xue Ban. 2006 Apr 18;38(2):218-21.
10

引用本文的文献

5
Consensus core clinical data elements for meningiomas (v2021.1).
Neuro Oncol. 2022 May 4;24(5):683-693. doi: 10.1093/neuonc/noab259.
6
Pragmatic MDR: a metadata repository with bottom-up standardization of medical metadata through reuse.
BMC Med Inform Decis Mak. 2021 May 17;21(1):160. doi: 10.1186/s12911-021-01524-8.
8
Modeling cancer clinical trials using HL7 FHIR to support downstream applications: A case study with colorectal cancer data.
Int J Med Inform. 2021 Jan;145:104308. doi: 10.1016/j.ijmedinf.2020.104308. Epub 2020 Oct 22.
9
Composite CDE: modeling composite relationships between common data elements for representing complex clinical data.
BMC Med Inform Decis Mak. 2020 Jul 3;20(1):147. doi: 10.1186/s12911-020-01168-0.
10
Unleashing the value of Common Data Elements through the CEDAR Workbench.
AMIA Annu Symp Proc. 2020 Mar 4;2019:681-690. eCollection 2019.

本文引用的文献

1
Terminology access methods leveraging LDAP resources.
Stud Health Technol Inform. 2004;107(Pt 1):545-9.
3
Screening for depression in head and neck cancer.
Psychooncology. 2004 Apr;13(4):269-80. doi: 10.1002/pon.734.
4
The Open Terminology Services (OTS) project.
AMIA Annu Symp Proc. 2003;2003:1011.
5
7
Towards a broad-coverage biomedical ontology based on description logics.
Pac Symp Biocomput. 2003:577-88. doi: 10.1142/9789812776303_0054.
8
Extending the LOINC conceptual schema to support standardized assessment instruments.
J Am Med Inform Assoc. 2002 Nov-Dec;9(6):586-99. doi: 10.1197/jamia.m1033.
10
Sociodemographic factors and quality of life as prognostic indicators in head and neck cancer.
Eur J Cancer. 2001 Feb;37(3):332-9. doi: 10.1016/s0959-8049(00)00385-3.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验