• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于研究的电子健康记录(EHR)数据重用的临床代码集工程:综述

Clinical code set engineering for reusing EHR data for research: A review.

作者信息

Williams Richard, Kontopantelis Evangelos, Buchan Iain, Peek Niels

机构信息

MRC Health eResearch Centre, University of Manchester, Manchester, UK; NIHR Greater Manchester Primary Care Patient Safety Translational Research Centre, University of Manchester, Manchester, UK.

MRC Health eResearch Centre, University of Manchester, Manchester, UK; NIHR School for Primary Care Research, University of Manchester, Manchester, UK.

出版信息

J Biomed Inform. 2017 Jun;70:1-13. doi: 10.1016/j.jbi.2017.04.010. Epub 2017 Apr 22.

DOI:10.1016/j.jbi.2017.04.010
PMID:28442434
Abstract

INTRODUCTION

The construction of reliable, reusable clinical code sets is essential when re-using Electronic Health Record (EHR) data for research. Yet code set definitions are rarely transparent and their sharing is almost non-existent. There is a lack of methodological standards for the management (construction, sharing, revision and reuse) of clinical code sets which needs to be addressed to ensure the reliability and credibility of studies which use code sets.

OBJECTIVE

To review methodological literature on the management of sets of clinical codes used in research on clinical databases and to provide a list of best practice recommendations for future studies and software tools.

METHODS

We performed an exhaustive search for methodological papers about clinical code set engineering for re-using EHR data in research. This was supplemented with papers identified by snowball sampling. In addition, a list of e-phenotyping systems was constructed by merging references from several systematic reviews on this topic, and the processes adopted by those systems for code set management was reviewed.

RESULTS

Thirty methodological papers were reviewed. Common approaches included: creating an initial list of synonyms for the condition of interest (n=20); making use of the hierarchical nature of coding terminologies during searching (n=23); reviewing sets with clinician input (n=20); and reusing and updating an existing code set (n=20). Several open source software tools (n=3) were discovered.

DISCUSSION

There is a need for software tools that enable users to easily and quickly create, revise, extend, review and share code sets and we provide a list of recommendations for their design and implementation.

CONCLUSION

Research re-using EHR data could be improved through the further development, more widespread use and routine reporting of the methods by which clinical codes were selected.

摘要

引言

在将电子健康记录(EHR)数据用于研究时,构建可靠、可重复使用的临床代码集至关重要。然而,代码集定义很少透明,且几乎不存在代码集共享的情况。临床代码集的管理(构建、共享、修订和重用)缺乏方法学标准,需要加以解决,以确保使用代码集的研究的可靠性和可信度。

目的

回顾关于临床数据库研究中使用的临床代码集管理的方法学文献,并为未来研究和软件工具提供最佳实践建议清单。

方法

我们对关于在研究中重新使用EHR数据的临床代码集工程的方法学论文进行了详尽搜索。通过滚雪球抽样确定的论文对其进行了补充。此外,通过合并关于该主题的几篇系统评价中的参考文献,构建了一个电子表型系统列表,并对这些系统采用的代码集管理流程进行了回顾。

结果

审查了30篇方法学论文。常见方法包括:为感兴趣的病症创建同义词初始列表(n = 20);在搜索过程中利用编码术语的层次结构(n = 23);在临床医生的参与下审查代码集(n = 20);以及重用和更新现有代码集(n = 20)。发现了几个开源软件工具(n = 3)。

讨论

需要能够让用户轻松快速地创建、修订、扩展、审查和共享代码集的软件工具,我们为其设计和实施提供了一份建议清单。

结论

通过进一步开发、更广泛地使用以及对临床代码选择方法进行常规报告,可改进对EHR数据的研究重用。

相似文献

1
Clinical code set engineering for reusing EHR data for research: A review.用于研究的电子健康记录(EHR)数据重用的临床代码集工程:综述
J Biomed Inform. 2017 Jun;70:1-13. doi: 10.1016/j.jbi.2017.04.010. Epub 2017 Apr 22.
2
Term sets: A transparent and reproducible representation of clinical code sets.术语集:临床代码集的透明且可重现的表示形式。
PLoS One. 2019 Feb 14;14(2):e0212291. doi: 10.1371/journal.pone.0212291. eCollection 2019.
3
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
4
Quality of Reporting Electronic Health Record Data in Glaucoma: A Systematic Literature Review.电子健康记录中青光眼数据报告质量的系统文献综述。
Ophthalmol Glaucoma. 2024 Sep-Oct;7(5):422-430. doi: 10.1016/j.ogla.2024.04.002. Epub 2024 Apr 8.
5
Concept libraries for automatic electronic health record based phenotyping: A review.基于自动电子健康记录的表型概念库:综述。
Int J Popul Data Sci. 2021 Jun 16;6(1):1362. doi: 10.23889/ijpds.v5i1.1362.
6
Methods for enhancing the reproducibility of biomedical research findings using electronic health records.利用电子健康记录提高生物医学研究结果可重复性的方法。
BioData Min. 2017 Sep 11;10:31. doi: 10.1186/s13040-017-0151-7. eCollection 2017.
7
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
8
SNOMED CT Concept Hierarchies for Sharing Definitions of Clinical Conditions Using Electronic Health Record Data.使用电子健康记录数据共享临床病症定义的SNOMED CT概念层次结构。
Appl Clin Inform. 2018 Jul;9(3):667-682. doi: 10.1055/s-0038-1668090. Epub 2018 Aug 29.
9
Systematic review: development of a consensus code set to identify cirrhosis in electronic health records.系统评价:开发用于识别电子健康记录中肝硬化的共识代码集。
Aliment Pharmacol Ther. 2022 Mar;55(6):645-657. doi: 10.1111/apt.16806. Epub 2022 Feb 15.
10
Code sets for respiratory symptoms in electronic health records research: a systematic review protocol.电子健康记录研究中呼吸系统症状的代码集:系统评价方案。
BMJ Open. 2019 Mar 3;9(3):e025965. doi: 10.1136/bmjopen-2018-025965.

引用本文的文献

1
rcprd: An R package to simplify the extraction and processing of Clinical Practice Research Datalink (CPRD) data, and create analysis-ready datasets.rcprd:一个用于简化临床实践研究数据链(CPRD)数据提取与处理并创建可供分析的数据集的R软件包。
PLoS One. 2025 Aug 19;20(8):e0327229. doi: 10.1371/journal.pone.0327229. eCollection 2025.
2
How well are marginalised groups represented in electronic records? A codelist development project and cross-sectional analysis of UK electronic health records.边缘化群体在电子记录中的代表性如何?一项代码列表开发项目及对英国电子健康记录的横断面分析。
BMJ Open. 2025 Aug 11;15(8):e098305. doi: 10.1136/bmjopen-2024-098305.
3
Robustly measuring multimorbidity using disparate linked datasets.
使用不同的关联数据集稳健地测量多种疾病并存情况。
Commun Med (Lond). 2025 Jul 8;5(1):283. doi: 10.1038/s43856-025-00995-4.
4
An automation framework for clinical codelist development validated with UK data from patients with multiple long-term conditions.一个用于临床代码列表开发的自动化框架,已通过来自患有多种长期疾病患者的英国数据进行验证。
BMC Med Res Methodol. 2025 May 24;25(1):138. doi: 10.1186/s12874-025-02541-1.
5
FHIR Granular Sensitive Data Segmentation.FHIR 细粒度敏感数据分割
Appl Clin Inform. 2025 Jan;16(1):156-166. doi: 10.1055/a-2466-4371. Epub 2025 Feb 19.
6
Observational study of sudden cardiac arrest risk (OSCAR): Rationale and design of an electronic health records cohort.心脏骤停风险观察性研究(OSCAR):电子健康记录队列的基本原理与设计
Int J Cardiol Heart Vasc. 2025 Jan 19;56:101614. doi: 10.1016/j.ijcha.2025.101614. eCollection 2025 Feb.
7
Ingredient-based method to create medication lists and support granular data segmentation.基于成分的方法来创建用药清单并支持细粒度数据分割。
Health Informatics J. 2025 Jan-Mar;31(1):14604582251316781. doi: 10.1177/14604582251316781.
8
Unified Clinical Vocabulary Embeddings for Advancing Precision Medicine.用于推进精准医学的统一临床词汇嵌入
medRxiv. 2024 Dec 10:2024.12.03.24318322. doi: 10.1101/2024.12.03.24318322.
9
Value sets and the problem of redundancy in value set repositories.值集与值集存储库中的冗余问题。
PLoS One. 2024 Dec 9;19(12):e0312289. doi: 10.1371/journal.pone.0312289. eCollection 2024.
10
Checklist and guidance on creating codelists for routinely collected health data research.常规收集的健康数据研究编码列表创建清单及指南
NIHR Open Res. 2024 Sep 18;4:20. doi: 10.3310/nihropenres.13550.2. eCollection 2024.