• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

创建下一代表型库:英国健康数据研究表型库

Creating a next-generation phenotype library: the health data research UK Phenotype Library.

作者信息

Thayer Daniel S, Mumtaz Shahzad, Elmessary Muhammad A, Scanlon Ieuan, Zinnurov Artur, Coldea Alex-Ioan, Scanlon Jack, Chapman Martin, Curcin Vasa, John Ann, DelPozo-Banos Marcos, Davies Hannah, Karwath Andreas, Gkoutos Georgios V, Fitzpatrick Natalie K, Quint Jennifer K, Varma Susheel, Milner Chris, Oliveira Carla, Parkinson Helen, Denaxas Spiros, Hemingway Harry, Jefferson Emily

机构信息

SAIL Databank, Medical School, Swansea University, Swansea, SA2 8PP, United Kingdom.

Health Informatics Centre, School of Medicine, University of Dundee, Dundee, DD1 9SY, United Kingdom.

出版信息

JAMIA Open. 2024 Jun 17;7(2):ooae049. doi: 10.1093/jamiaopen/ooae049. eCollection 2024 Jul.

DOI:10.1093/jamiaopen/ooae049
PMID:38895652
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11182945/
Abstract

OBJECTIVE

To enable reproducible research at scale by creating a platform that enables health data users to find, access, curate, and re-use electronic health record phenotyping algorithms.

MATERIALS AND METHODS

We undertook a structured approach to identifying requirements for a phenotype algorithm platform by engaging with key stakeholders. User experience analysis was used to inform the design, which we implemented as a web application featuring a novel metadata standard for defining phenotyping algorithms, access via Application Programming Interface (API), support for computable data flows, and version control. The application has creation and editing functionality, enabling researchers to submit phenotypes directly.

RESULTS

We created and launched the Phenotype Library in October 2021. The platform currently hosts 1049 phenotype definitions defined against 40 health data sources and >200K terms across 16 medical ontologies. We present several case studies demonstrating its utility for supporting and enabling research: the library hosts curated phenotype collections for the BREATHE respiratory health research hub and the Adolescent Mental Health Data Platform, and it is supporting the development of an informatics tool to generate clinical evidence for clinical guideline development groups.

DISCUSSION

This platform makes an impact by being open to all health data users and accepting all appropriate content, as well as implementing key features that have not been widely available, including managing structured metadata, access via an API, and support for computable phenotypes.

CONCLUSIONS

We have created the first openly available, programmatically accessible resource enabling the global health research community to store and manage phenotyping algorithms. Removing barriers to describing, sharing, and computing phenotypes will help unleash the potential benefit of health data for patients and the public.

摘要

目的

通过创建一个平台,使健康数据用户能够查找、访问、整理和重新使用电子健康记录表型分析算法,从而实现大规模的可重复研究。

材料与方法

我们采用结构化方法,通过与关键利益相关者合作来确定表型算法平台的需求。用户体验分析为设计提供了参考,我们将其实现为一个网络应用程序,该程序具有用于定义表型分析算法的新颖元数据标准、通过应用程序编程接口(API)进行访问、对可计算数据流的支持以及版本控制。该应用程序具有创建和编辑功能,使研究人员能够直接提交表型。

结果

我们于2021年10月创建并推出了表型库。该平台目前托管着针对40个健康数据源定义的1049个表型定义,以及跨越16个医学本体的超过20万个术语。我们展示了几个案例研究,证明了其在支持和推动研究方面的效用:该库托管了为BREATHE呼吸健康研究中心和青少年心理健康数据平台精心策划的表型集合,并且正在支持开发一种信息学工具,以为临床指南制定小组生成临床证据。

讨论

该平台通过向所有健康数据用户开放并接受所有合适的内容,以及实施尚未广泛可用的关键功能(包括管理结构化元数据、通过API访问和支持可计算表型)而产生影响。

结论

我们创建了首个公开可用、可通过编程访问的资源,使全球健康研究界能够存储和管理表型分析算法。消除描述、共享和计算表型的障碍将有助于释放健康数据对患者和公众的潜在益处。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2713/11182945/5e005f0901da/ooae049f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2713/11182945/5e005f0901da/ooae049f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2713/11182945/5e005f0901da/ooae049f1.jpg

相似文献

1
Creating a next-generation phenotype library: the health data research UK Phenotype Library.创建下一代表型库:英国健康数据研究表型库
JAMIA Open. 2024 Jun 17;7(2):ooae049. doi: 10.1093/jamiaopen/ooae049. eCollection 2024 Jul.
2
Centralized Interactive Phenomics Resource: an integrated online phenomics knowledgebase for health data users.集中式交互表型资源:为健康数据用户提供集成的在线表型知识库。
J Am Med Inform Assoc. 2024 Apr 19;31(5):1126-1134. doi: 10.1093/jamia/ocae042.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Concept libraries for automatic electronic health record based phenotyping: A review.基于自动电子健康记录的表型概念库:综述。
Int J Popul Data Sci. 2021 Jun 16;6(1):1362. doi: 10.23889/ijpds.v5i1.1362.
5
Feasibility of Using EN 13606 Clinical Archetypes for Defining Computable Phenotypes.使用EN 13606临床原型定义可计算表型的可行性。
Stud Health Technol Inform. 2020 Jun 16;270:228-232. doi: 10.3233/SHTI200156.
6
Evaluation of Semantic Web Technologies for Storing Computable Definitions of Electronic Health Records Phenotyping Algorithms.用于存储电子健康记录表型算法可计算定义的语义网技术评估。
AMIA Annu Symp Proc. 2018 Apr 16;2017:1352-1361. eCollection 2017.
7
Sharing and Reusing Computable Phenotype Definitions.共享和重用可计算表型定义。
medRxiv. 2023 Sep 18:2023.09.17.23295681. doi: 10.1101/2023.09.17.23295681.
8
Clinical Annotation Research Kit (CLARK): Computable Phenotyping Using Machine Learning.临床注释研究工具包(CLARK):使用机器学习进行可计算表型分析。
JMIR Med Inform. 2020 Jan 24;8(1):e16042. doi: 10.2196/16042.
9
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
10
Remote Digital Psychiatry for Mobile Mental Health Assessment and Therapy: MindLogger Platform Development Study.远程数字精神病学用于移动心理健康评估和治疗:MindLogger 平台开发研究。
J Med Internet Res. 2021 Nov 11;23(11):e22369. doi: 10.2196/22369.

引用本文的文献

1
Risk Factors and Prognosis of New-Onset Atrial Fibrillation in Sepsis: A Nationwide Electronic Health Record Study.脓毒症中新发心房颤动的危险因素及预后:一项全国性电子健康记录研究
JACC Adv. 2025 Mar 27;4(4):101681. doi: 10.1016/j.jacadv.2025.101681.
2
The impacts of Liverpool Citizen's Advice on Prescription (CAP) on mental health outcomes- an Instrumental Variable (IV) approach.利物浦公民咨询局对处方(CAP)对心理健康结果的影响——一种工具变量(IV)方法。
SSM Popul Health. 2025 Mar 22;30:101785. doi: 10.1016/j.ssmph.2025.101785. eCollection 2025 Jun.
3
Methods for identifying health status from routinely collected health data: An overview.

本文引用的文献

1
Determining prescriptions in electronic healthcare record data: methods for development of standardized, reproducible drug codelists.在电子健康记录数据中确定处方:标准化、可重复的药品代码列表的开发方法。
JAMIA Open. 2023 Aug 29;6(3):ooad078. doi: 10.1093/jamiaopen/ooad078. eCollection 2023 Oct.
2
Age, sex, and socioeconomic differences in multimorbidity measured in four ways: UK primary care cross-sectional analysis.四种方法测量的英国初级保健横断面分析中的多病症的年龄、性别和社会经济差异。
Br J Gen Pract. 2023 Mar 30;73(729):e249-e256. doi: 10.3399/BJGP.2022.0405. Print 2023 Apr.
3
Framework of the Centralized Interactive Phenomics Resource (CIPHER) standard for electronic health data-based phenomics knowledgebase.
从常规收集的健康数据中识别健康状况的方法:概述。
Integr Med Res. 2025 Mar;14(1):101100. doi: 10.1016/j.imr.2024.101100. Epub 2024 Nov 15.
4
Refining the CHA2DS2VASc risk stratification scheme: shall we drop the sex category criterion?CHA2DS2VASc 风险分层方案的精细化:是否应摒弃性别分类标准?
Europace. 2024 Nov 1;26(11). doi: 10.1093/europace/euae280.
集中交互表型资源(CIPHER)标准框架,用于基于电子健康数据的表型知识库。
J Am Med Inform Assoc. 2023 Apr 19;30(5):958-964. doi: 10.1093/jamia/ocad030.
4
MixEHR-Guided: A guided multi-modal topic modeling approach for large-scale automatic phenotyping using the electronic health record.混合 EHR 引导:一种使用电子健康记录进行大规模自动表型分析的引导式多模态主题建模方法。
J Biomed Inform. 2022 Oct;134:104190. doi: 10.1016/j.jbi.2022.104190. Epub 2022 Sep 1.
5
CODE-EHR best-practice framework for the use of structured electronic health-care records in clinical research.用于临床研究中使用结构化电子健康记录的 CODE-EHR 最佳实践框架。
Lancet Digit Health. 2022 Oct;4(10):e757-e764. doi: 10.1016/S2589-7500(22)00151-0. Epub 2022 Aug 29.
6
Heterogeneity in phenotype, disease progression and drug response in type 2 diabetes.2 型糖尿病表型、疾病进展和药物反应的异质性。
Nat Med. 2022 May;28(5):982-988. doi: 10.1038/s41591-022-01790-7. Epub 2022 May 9.
7
Concept Libraries for Repeatable and Reusable Research: Qualitative Study Exploring the Needs of Users.用于可重复和可复用研究的概念库:探索用户需求的定性研究
JMIR Hum Factors. 2022 Mar 15;9(1):e31021. doi: 10.2196/31021.
8
Clinical and operational insights from data-driven care pathway mapping: a systematic review.基于数据驱动的护理路径图的临床和操作见解:系统评价。
BMC Med Inform Decis Mak. 2022 Feb 17;22(1):43. doi: 10.1186/s12911-022-01756-2.
9
Desiderata for the development of next-generation electronic health record phenotype libraries.下一代电子健康记录表型库发展的要点。
Gigascience. 2021 Sep 11;10(9). doi: 10.1093/gigascience/giab059.
10
Phenoflow: A Microservice Architecture for Portable Workflow-based Phenotype Definitions. Phenoflow:一种用于便携式基于工作流的表型定义的微服务架构。
AMIA Jt Summits Transl Sci Proc. 2021 May 17;2021:142-151. eCollection 2021.