• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过电子健康记录系统识别罕见病患者有助于评估临床结果。

Systematic identification of rare disease patients in electronic health records enables evaluation of clinical outcomes.

作者信息

Yadaw Arjun S, Sid Eric, Sidky Hythem, Zeng Chenjie, Zhu Qian, Mathé Ewy A

机构信息

Division of Preclinical Innovation, National Center for Advancing Translational Sciences (NCATS), NIH, Rockville, MD, USA.

Division of Rare Diseases Research Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, Bethesda, MD 20892, USA.

出版信息

medRxiv. 2025 May 6:2025.05.02.25325348. doi: 10.1101/2025.05.02.25325348.

DOI:10.1101/2025.05.02.25325348
PMID:40385408
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12083593/
Abstract

BACKGROUND

Identifying rare disease (RD) patients in electronic health records (EHR) is challenging, as more than 10,000 rare diseases are not typically captured by clinical coding systems. This limits the assessment of clinical outcomes for RD patients. This study introduces a semiautomated approach to map RDs to appropriate codes, that is applicable across various EHR systems. By improving RD patient identification, this method facilitates the analysis of clinical outcomes and disease severity in the RD population. We exemplify this by utilizing large EHR datasets such as those in the National COVID Cohort Collaborative (N3C) with over 21 million patients.

METHODS

We developed a semiautomated workflow to enumerate RD-specific SNOMED-CT and ICD-10 codes, starting with 12,003 GARD IDs mapped to ORPHANET. This process linked RDs to SNOMED-CT and ICD-10 codes, applying exclusion criteria based on group of disorders. We created an extensive list of SNOMED-CT codes with descendants from the OHDSI atlas and performed phenotype filtering, removing irrelevant codes. The final list included 12,081 SNOMED-CT codes and 357 ICD-10 codes for further analysis, enabling the identification and mapping of rare diseases in EHR.

RESULTS

Our semiautomated workflow identified 357 RD-specific ICD-10 codes and 12,081 SNOMED-CT codes representing 6,342 RDs which are categorized into 30 Orphanet linearization classes. We exemplify the utility of these codes by performing a preliminary univariate analysis of COVID-19 outcomes in a large cohort of 4,835,718 COVID-19 positive individuals in N3C, of which 404,735 (8.37%) were identified as having preexisting RD. The mortality and hospitalization risk ratios for rare RD classes ranged from 0.23 - 5.28 and 0.93 - 3.13, respectively (p-values <0.001).

CONCLUSIONS

Our systematic and automated workflow enables rapid identification of rare disease patients across diverse EHR systems. We demonstrate its utility by evaluating COVID-19 severity outcomes by rare disease classes in the N3C cohort. These findings support the need for targeted preventive healthcare interventions and highlight the potential for future research on long COVID, COVID-19 reinfection, and other outcomes in the rare disease population.

摘要

背景

在电子健康记录(EHR)中识别罕见病(RD)患者具有挑战性,因为超过10000种罕见病通常未被临床编码系统收录。这限制了对RD患者临床结局的评估。本研究引入了一种半自动方法,将罕见病映射到适当的编码,该方法适用于各种EHR系统。通过改进RD患者识别,此方法有助于分析RD人群的临床结局和疾病严重程度。我们通过利用大型EHR数据集(如国家新冠队列协作组(N3C)中超过2100万患者的数据集)来举例说明这一点。

方法

我们开发了一种半自动工作流程,以枚举特定于罕见病的SNOMED-CT和ICD-10编码,从映射到《孤儿病数据库》的12003个全球遗传疾病注册中心(GARD)ID开始。此过程将罕见病与SNOMED-CT和ICD-10编码相链接,并根据疾病组应用排除标准。我们创建了一个包含来自观察医疗效果合作组织(OHDSI)图谱后代的SNOMED-CT编码的广泛列表,并进行了表型筛选,去除不相关的编码。最终列表包括12081个SNOMED-CT编码和357个ICD-10编码以供进一步分析,从而能够在EHR中识别和映射罕见病。

结果

我们的半自动工作流程识别出357个特定于罕见病的ICD-10编码和12081个SNOMED-CT编码,代表6342种罕见病,这些罕见病被分类为30个《孤儿病数据库》线性化类别。我们通过对N3C中4835718名新冠阳性个体的大型队列进行新冠结局的初步单变量分析来举例说明这些编码的效用,其中404735人(8.37%)被确定患有既往罕见病。罕见病类别的死亡率和住院风险比分别为0.23 - 5.28和0.93 - 3.13(p值<0.001)。

结论

我们的系统且自动化的工作流程能够在不同的EHR系统中快速识别罕见病患者。我们通过在N3C队列中按罕见病类别评估新冠严重程度结局来证明其效用。这些发现支持了针对性预防保健干预的必要性,并突出了未来对罕见病患者群体中长新冠、新冠再感染及其他结局进行研究的潜力。

相似文献

1
Systematic identification of rare disease patients in electronic health records enables evaluation of clinical outcomes.通过电子健康记录系统识别罕见病患者有助于评估临床结果。
medRxiv. 2025 May 6:2025.05.02.25325348. doi: 10.1101/2025.05.02.25325348.
2
Impact of Preexisting Rare Diseases on COVID-19 Severity, Reinfection, and Long COVID, and the Modifying Effects of Vaccination and Antiviral Therapy: A Retrospective Study from the N3C Data Enclave.既往罕见病对新冠病毒疾病严重程度、再感染及新冠长期症状的影响,以及疫苗接种和抗病毒治疗的调节作用:一项来自国家COVID-19合作数据库(N3C数据专区)的回顾性研究
medRxiv. 2025 Jul 10:2025.07.09.25331138. doi: 10.1101/2025.07.09.25331138.
3
Impact of residual disease as a prognostic factor for survival in women with advanced epithelial ovarian cancer after primary surgery.原发性手术后晚期上皮性卵巢癌患者残留病灶对生存预后的影响。
Cochrane Database Syst Rev. 2022 Sep 26;9(9):CD015048. doi: 10.1002/14651858.CD015048.pub2.
4
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
5
Validation of administrative health data for the identification of endometriosis diagnosis.用于识别子宫内膜异位症诊断的行政健康数据验证
Hum Reprod. 2025 Feb 1;40(2):289-295. doi: 10.1093/humrep/deae281.
6
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
7
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
8
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
9
Automated devices for identifying peripheral arterial disease in people with leg ulceration: an evidence synthesis and cost-effectiveness analysis.用于识别下肢溃疡患者外周动脉疾病的自动化设备:证据综合和成本效益分析。
Health Technol Assess. 2024 Aug;28(37):1-158. doi: 10.3310/TWCG3912.
10
Surfactant therapy guided by tests for lung maturity in preterm infants at risk of respiratory distress syndrome.表面活性物质治疗在有呼吸窘迫综合征风险的早产儿中根据肺成熟度检测指导。
Cochrane Database Syst Rev. 2023 Oct 26;10(10):CD013158. doi: 10.1002/14651858.CD013158.pub2.

本文引用的文献

1
Prevalence and demographics of 331 rare diseases and associated COVID-19-related mortality among 58 million individuals: a nationwide retrospective observational study.5800万人中331种罕见疾病的患病率、人口统计学特征及相关的COVID-19相关死亡率:一项全国性回顾性观察研究。
Lancet Digit Health. 2025 Feb;7(2):e145-e156. doi: 10.1016/S2589-7500(24)00253-X.
2
Utilization of EHRs for clinical trials: a systematic review.利用电子健康记录进行临床试验:系统评价。
BMC Med Res Methodol. 2024 Mar 18;24(1):70. doi: 10.1186/s12874-024-02177-7.
3
Cluster analysis and visualisation of electronic health records data to identify undiagnosed patients with rare genetic diseases.
电子健康记录数据的聚类分析和可视化,以识别未确诊的罕见遗传疾病患者。
Sci Rep. 2024 Mar 1;14(1):5056. doi: 10.1038/s41598-024-55424-8.
4
The landscape for rare diseases in 2024.2024年罕见病的概况。
Lancet Glob Health. 2024 Mar;12(3):e341. doi: 10.1016/S2214-109X(24)00056-1.
5
Development of a rare disease algorithm to identify persons at risk of Gaucher disease using electronic health records in the United States.开发一种罕见病算法,以利用美国电子健康记录识别戈谢病高危人群。
Orphanet J Rare Dis. 2023 Sep 9;18(1):280. doi: 10.1186/s13023-023-02868-2.
6
Targeting shared molecular etiologies to accelerate drug development for rare diseases.针对共同的分子病因学来加速罕见病药物研发。
EMBO Mol Med. 2023 Jul 10;15(7):e17159. doi: 10.15252/emmm.202217159. Epub 2023 Jun 27.
7
An artificial intelligence-based approach for identifying rare disease patients using retrospective electronic health records applied for Pompe disease.一种基于人工智能的方法,利用回顾性电子健康记录识别庞贝病罕见病患者。
Front Neurol. 2023 Apr 21;14:1108222. doi: 10.3389/fneur.2023.1108222. eCollection 2023.
8
Disentangling the Cost of Orphan Drugs Marketed in the United States.厘清美国上市的罕见病药物成本
Healthcare (Basel). 2023 Feb 13;11(4):558. doi: 10.3390/healthcare11040558.
9
Hospital mortality in patients with rare diseases during pandemics: lessons learnt from the COVID-19 and SARS pandemics.大流行期间罕见病患者的医院死亡率:COVID-19 和 SARS 大流行中的经验教训。
Orphanet J Rare Dis. 2021 Aug 12;16(1):361. doi: 10.1186/s13023-021-01994-z.
10
Clinical Characterization and Prediction of Clinical Severity of SARS-CoV-2 Infection Among US Adults Using Data From the US National COVID Cohort Collaborative.利用美国国家 COVID 队列协作的数据,对美国成年人中 SARS-CoV-2 感染的临床特征和临床严重程度进行临床描述和预测。
JAMA Netw Open. 2021 Jul 1;4(7):e2116901. doi: 10.1001/jamanetworkopen.2021.16901.