• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

英国表型组学平台用于开发和验证电子健康记录表型:CALIBER。

UK phenomics platform for developing and validating electronic health record phenotypes: CALIBER.

机构信息

Institute of Health Informatics, University College London, London,United Kingdom.

Health Data Research UK, London, United Kingdom.

出版信息

J Am Med Inform Assoc. 2019 Dec 1;26(12):1545-1559. doi: 10.1093/jamia/ocz105.

DOI:10.1093/jamia/ocz105
PMID:31329239
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6857510/
Abstract

OBJECTIVE

Electronic health records (EHRs) are a rich source of information on human diseases, but the information is variably structured, fragmented, curated using different coding systems, and collected for purposes other than medical research. We describe an approach for developing, validating, and sharing reproducible phenotypes from national structured EHR in the United Kingdom with applications for translational research.

MATERIALS AND METHODS

We implemented a rule-based phenotyping framework, with up to 6 approaches of validation. We applied our framework to a sample of 15 million individuals in a national EHR data source (population-based primary care, all ages) linked to hospitalization and death records in England. Data comprised continuous measurements (for example, blood pressure; medication information; coded diagnoses, symptoms, procedures, and referrals), recorded using 5 controlled clinical terminologies: (1) read (primary care, subset of SNOMED-CT [Systematized Nomenclature of Medicine Clinical Terms]), (2) International Classification of Diseases-Ninth Revision and Tenth Revision (secondary care diagnoses and cause of mortality), (3) Office of Population Censuses and Surveys Classification of Surgical Operations and Procedures, Fourth Revision (hospital surgical procedures), and (4) DM+D prescription codes.

RESULTS

Using the CALIBER phenotyping framework, we created algorithms for 51 diseases, syndromes, biomarkers, and lifestyle risk factors and provide up to 6 validation approaches. The EHR phenotypes are curated in the open-access CALIBER Portal (https://www.caliberresearch.org/portal) and have been used by 40 national and international research groups in 60 peer-reviewed publications.

CONCLUSIONS

We describe a UK EHR phenomics approach within the CALIBER EHR data platform with initial evidence of validity and use, as an important step toward international use of UK EHR data for health research.

摘要

目的

电子健康记录(EHR)是人类疾病信息的丰富来源,但信息结构多样、碎片化,使用不同的编码系统进行整理,并出于医疗研究以外的目的而收集。我们描述了一种从英国国家结构化 EHR 中开发、验证和共享可重复表型的方法,该方法可应用于转化研究。

材料和方法

我们实施了基于规则的表型框架,最多有 6 种验证方法。我们将我们的框架应用于英格兰住院和死亡记录链接的全国性电子健康记录数据源(基于人群的初级保健,所有年龄段)中的 1500 万个体样本。数据包括连续测量值(例如血压、药物信息、编码诊断、症状、程序和转诊),使用 5 种受控临床术语进行记录:(1)读取(初级保健,SNOMED-CT[系统医学术语临床术语]的子集),(2)国际疾病分类第九版和第十版(二级保健诊断和死亡率原因),(3)人口普查和调查办公室手术和程序分类第四版(医院手术程序),以及(4)DM+D 处方代码。

结果

使用 CALIBER 表型框架,我们为 51 种疾病、综合征、生物标志物和生活方式风险因素创建了算法,并提供了多达 6 种验证方法。EHR 表型在开放访问的 CALIBER 门户(https://www.caliberresearch.org/portal)中进行管理,并已被 40 个国家和国际研究小组在 60 篇同行评议出版物中使用。

结论

我们在 CALIBER EHR 数据平台内描述了一种英国 EHR 表型学方法,并初步证明了其有效性和使用,这是朝着国际使用英国 EHR 数据进行健康研究迈出的重要一步。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e058/6857510/2a79a7fa2c51/ocz105f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e058/6857510/11a7b9fbd0a0/ocz105f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e058/6857510/92bd6892abbc/ocz105f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e058/6857510/b759d23a23df/ocz105f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e058/6857510/2a79a7fa2c51/ocz105f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e058/6857510/11a7b9fbd0a0/ocz105f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e058/6857510/92bd6892abbc/ocz105f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e058/6857510/b759d23a23df/ocz105f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e058/6857510/2a79a7fa2c51/ocz105f4.jpg

相似文献

1
UK phenomics platform for developing and validating electronic health record phenotypes: CALIBER.英国表型组学平台用于开发和验证电子健康记录表型:CALIBER。
J Am Med Inform Assoc. 2019 Dec 1;26(12):1545-1559. doi: 10.1093/jamia/ocz105.
2
Phenotyping UK Electronic Health Records from 15 Million Individuals for Precision Medicine: The CALIBER Resource.对1500万英国个体的电子健康记录进行精准医学表型分析:CALIBER资源。
Stud Health Technol Inform. 2019 Jul 4;262:220-223. doi: 10.3233/SHTI190058.
3
A method for cohort selection of cardiovascular disease records from an electronic health record system.一种从电子健康记录系统中选择心血管疾病记录队列的方法。
Int J Med Inform. 2017 Jun;102:138-149. doi: 10.1016/j.ijmedinf.2017.03.015. Epub 2017 Mar 30.
4
SNOMED CT Concept Hierarchies for Sharing Definitions of Clinical Conditions Using Electronic Health Record Data.使用电子健康记录数据共享临床病症定义的SNOMED CT概念层次结构。
Appl Clin Inform. 2018 Jul;9(3):667-682. doi: 10.1055/s-0038-1668090. Epub 2018 Aug 29.
5
6
Natural language processing for disease phenotyping in UK primary care records for research: a pilot study in myocardial infarction and death.用于研究的英国初级医疗记录中疾病表型的自然语言处理:心肌梗死和死亡的一项试点研究
J Biomed Semantics. 2019 Nov 12;10(Suppl 1):20. doi: 10.1186/s13326-019-0214-4.
7
Genetic validation of bipolar disorder identified by automated phenotyping using electronic health records.利用电子健康记录进行自动化表型分析鉴定的双相情感障碍的遗传学验证。
Transl Psychiatry. 2018 Apr 18;8(1):86. doi: 10.1038/s41398-018-0133-7.
8
Evaluating electronic health record data sources and algorithmic approaches to identify hypertensive individuals.评估电子健康记录数据源及识别高血压个体的算法方法。
J Am Med Inform Assoc. 2017 Jan;24(1):162-171. doi: 10.1093/jamia/ocw071. Epub 2016 Aug 7.
9
Next generation phenotyping using the unified medical language system.利用统一医学语言系统进行下一代表型分析。
JMIR Med Inform. 2014 Mar 18;2(1):e5. doi: 10.2196/medinform.3172.
10
Introduction of Systematized Nomenclature of Medicine-Clinical Terms Coding Into an Electronic Health Record and Evaluation of its Impact: Qualitative and Quantitative Study.医学临床术语编码系统引入电子健康记录及其影响评估:定性与定量研究
JMIR Med Inform. 2021 Nov 23;9(11):e29532. doi: 10.2196/29532.

引用本文的文献

1
Artificial Intelligence in Hypertrophic Cardiomyopathy: Advances, Challenges, and Future Directions for Personalized Risk Prediction and Management.肥厚型心肌病中的人工智能:个性化风险预测与管理的进展、挑战及未来方向
Cureus. 2025 Jul 14;17(7):e87907. doi: 10.7759/cureus.87907. eCollection 2025 Jul.
2
A computational framework for defining and validating reproducible phenotyping algorithms of 313 diseases in the UK Biobank.一种用于定义和验证英国生物银行中313种疾病的可重复表型分析算法的计算框架。
Sci Rep. 2025 Jul 9;15(1):24607. doi: 10.1038/s41598-025-05838-9.
3
Robustly measuring multimorbidity using disparate linked datasets.

本文引用的文献

1
A chronological map of 308 physical and mental health conditions from 4 million individuals in the English National Health Service.308 种身心状况的时间图谱,源自英国国民保健署 400 万人的数据。
Lancet Digit Health. 2019 May 20;1(2):e63-e77. doi: 10.1016/S2589-7500(19)30012-3. eCollection 2019 Jun.
2
Advances in Electronic Phenotyping: From Rule-Based Definitions to Machine Learning Models.电子表型分析的进展:从基于规则的定义到机器学习模型
Annu Rev Biomed Data Sci. 2018 Jul;1:53-68. doi: 10.1146/annurev-biodatasci-080917-013315. Epub 2018 May 23.
3
Machine-learning based identification of undiagnosed dementia in primary care: a feasibility study.
使用不同的关联数据集稳健地测量多种疾病并存情况。
Commun Med (Lond). 2025 Jul 8;5(1):283. doi: 10.1038/s43856-025-00995-4.
4
Open science and phenotyping in UK administrative health, education and social care data: the ECHILD phenotype code list repository.英国行政健康、教育和社会护理数据中的开放科学与表型分析:ECHILD表型代码列表存储库
Int J Popul Data Sci. 2025 May 13;10(2):2943. doi: 10.23889/ijpds.v10i2.2943. eCollection 2025.
5
Risk Factors and Prognosis of New-Onset Atrial Fibrillation in Sepsis: A Nationwide Electronic Health Record Study.脓毒症中新发心房颤动的危险因素及预后:一项全国性电子健康记录研究
JACC Adv. 2025 Mar 27;4(4):101681. doi: 10.1016/j.jacadv.2025.101681.
6
Clinical Research Informatics: a Decade-in-Review.临床研究信息学:十年回顾
Yearb Med Inform. 2024 Aug;33(1):127-142. doi: 10.1055/s-0044-1800732. Epub 2025 Apr 8.
7
Identification of heart failure subtypes using transformer-based deep learning modelling: a population-based study of 379,108 individuals.使用基于Transformer的深度学习模型识别心力衰竭亚型:一项基于379,108人的人群研究。
EBioMedicine. 2025 Apr;114:105657. doi: 10.1016/j.ebiom.2025.105657. Epub 2025 Mar 19.
8
Atrial fibrillation development in the heart failure population from nationwide British linked electronic health records.来自英国全国性关联电子健康记录的心力衰竭人群中心房颤动的发生情况。
ESC Heart Fail. 2025 Mar 12. doi: 10.1002/ehf2.15264.
9
Missed opportunities to manage complex comorbidity of heart failure, type 2 diabetes mellitus and chronic kidney disease: a retrospective cohort study.心力衰竭、2型糖尿病和慢性肾脏病复杂合并症管理的错失机会:一项回顾性队列研究
Heart. 2025 Mar 7. doi: 10.1136/heartjnl-2024-325046.
10
CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization.队列诊断:使用人群水平特征在观察性数据源网络中进行表型评估。
PLoS One. 2025 Jan 16;20(1):e0310634. doi: 10.1371/journal.pone.0310634. eCollection 2025.
基于机器学习的基层医疗中未确诊痴呆症的识别:一项可行性研究。
BJGP Open. 2018 Jun 13;2(2):bjgpopen18X101589. doi: 10.3399/bjgpopen18X101589. eCollection 2018 Jul.
4
Clinically recorded heart rate and incidence of 12 coronary, cardiac, cerebrovascular and peripheral arterial diseases in 233,970 men and women: A linked electronic health record study.在 233970 名男性和女性中,临床记录的心率和 12 种冠心病、心脏病、脑血管病和外周动脉疾病的发病率:一项电子健康记录关联研究。
Eur J Prev Cardiol. 2018 Sep;25(14):1485-1495. doi: 10.1177/2047487318785228. Epub 2018 Jul 2.
5
Do GPs accurately record date of death? A UK observational analysis.全科医生是否准确记录死亡日期?一项英国观察性分析。
BMJ Support Palliat Care. 2020 Sep;10(3):e24. doi: 10.1136/bmjspcare-2018-001514. Epub 2018 Jun 27.
6
CogStack - experiences of deploying integrated information retrieval and extraction services in a large National Health Service Foundation Trust hospital.CogStack-在大型国民保健制度基金会信托医院中部署集成信息检索和提取服务的经验。
BMC Med Inform Decis Mak. 2018 Jun 25;18(1):47. doi: 10.1186/s12911-018-0623-9.
7
Evaluation of Semantic Web Technologies for Storing Computable Definitions of Electronic Health Records Phenotyping Algorithms.用于存储电子健康记录表型算法可计算定义的语义网技术评估。
AMIA Annu Symp Proc. 2018 Apr 16;2017:1352-1361. eCollection 2017.
8
Cardiovascular disease risk prediction equations in 400 000 primary care patients in New Zealand: a derivation and validation study.新西兰 40 万例初级保健患者的心血管疾病风险预测方程:一项推导和验证研究。
Lancet. 2018 May 12;391(10133):1897-1907. doi: 10.1016/S0140-6736(18)30664-0. Epub 2018 May 4.
9
The 100 000 Genomes Project: bringing whole genome sequencing to the NHS.“十万基因组计划”:将全基因组测序引入英国国家医疗服务体系。
BMJ. 2018 Apr 24;361:k1687. doi: 10.1136/bmj.k1687.
10
An electronic health records cohort study on heart failure following myocardial infarction in England: incidence and predictors.一项关于英格兰心肌梗死后心力衰竭的电子健康记录队列研究:发病率及预测因素
BMJ Open. 2018 Mar 3;8(3):e018331. doi: 10.1136/bmjopen-2017-018331.