• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过在人类表型本体论和疾病编码之间进行映射来连接罕见病和常见疾病词汇表。

Linking rare and common disease vocabularies by mapping between the human phenotype ontology and phecodes.

作者信息

McArthur Evonne, Bastarache Lisa, Capra John A

机构信息

Vanderbilt Genetics Institute, Vanderbilt University, Nashville, Tennessee, USA.

Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA.

出版信息

JAMIA Open. 2023 Feb 28;6(1):ooad007. doi: 10.1093/jamiaopen/ooad007. eCollection 2023 Apr.

DOI:10.1093/jamiaopen/ooad007
PMID:36875690
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9976874/
Abstract

Enabling discovery across the spectrum of rare and common diseases requires the integration of biological knowledge with clinical data; however, differences in terminologies present a major barrier. For example, the Human Phenotype Ontology (HPO) is the primary vocabulary for describing features of rare diseases, while most clinical encounters use International Classification of Diseases (ICD) billing codes. ICD codes are further organized into clinically meaningful phenotypes via phecodes. Despite their prevalence, no robust phenome-wide disease mapping between HPO and phecodes/ICD exists. Here, we synthesize evidence using diverse sources and methods-including text matching, the National Library of Medicine's Unified Medical Language System (UMLS), Wikipedia, SORTA, and PheMap-to define a mapping between phecodes and HPO terms via 38 950 links. We evaluate the precision and recall for each domain of evidence, both individually and jointly. This flexibility permits users to tailor the HPO-phecode links for diverse applications along the spectrum of monogenic to polygenic diseases.

摘要

实现对罕见病和常见疾病全谱的发现需要将生物学知识与临床数据相结合;然而,术语差异是一个主要障碍。例如,人类表型本体(HPO)是描述罕见病特征的主要词汇表,而大多数临床诊疗使用国际疾病分类(ICD)计费代码。ICD代码通过phecode进一步组织成具有临床意义的表型。尽管它们很普遍,但在HPO与phecode/ICD之间不存在强大的全表型疾病映射。在这里,我们使用多种来源和方法——包括文本匹配、美国国立医学图书馆的统一医学语言系统(UMLS)、维基百科、SORTA和PheMap——综合证据,通过38950个链接定义phecode与HPO术语之间的映射。我们分别和联合评估每个证据领域的精确率和召回率。这种灵活性允许用户针对从单基因疾病到多基因疾病全谱的各种应用定制HPO-phecode链接。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cf7/9976874/5be13b07694e/ooad007f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cf7/9976874/86869d746be5/ooad007f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cf7/9976874/5be13b07694e/ooad007f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cf7/9976874/86869d746be5/ooad007f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cf7/9976874/5be13b07694e/ooad007f2.jpg

相似文献

1
Linking rare and common disease vocabularies by mapping between the human phenotype ontology and phecodes.通过在人类表型本体论和疾病编码之间进行映射来连接罕见病和常见疾病词汇表。
JAMIA Open. 2023 Feb 28;6(1):ooad007. doi: 10.1093/jamiaopen/ooad007. eCollection 2023 Apr.
2
Evaluating phecodes, clinical classification software, and ICD-9-CM codes for phenome-wide association studies in the electronic health record.评估电子健康记录中全表型关联研究的疾病编码、临床分类软件和国际疾病分类第九版临床修订本编码。
PLoS One. 2017 Jul 7;12(7):e0175508. doi: 10.1371/journal.pone.0175508. eCollection 2017.
3
Mapping ICD-10 and ICD-10-CM Codes to Phecodes: Workflow Development and Initial Evaluation.将ICD - 10和ICD - 10 - CM编码映射到疾病编码:工作流程开发与初步评估
JMIR Med Inform. 2019 Nov 29;7(4):e14325. doi: 10.2196/14325.
4
Improving the phenotype risk score as a scalable approach to identifying patients with Mendelian disease.提高表型风险评分作为一种可扩展的方法来识别孟德尔疾病患者。
J Am Med Inform Assoc. 2019 Dec 1;26(12):1437-1447. doi: 10.1093/jamia/ocz179.
5
Using Phecodes for Research with the Electronic Health Record: From PheWAS to PheRS.使用 pheCodes 进行电子健康记录研究:从 pheWAS 到 pheRS。
Annu Rev Biomed Data Sci. 2021 Jul 20;4:1-19. doi: 10.1146/annurev-biodatasci-122320-112352.
6
SORTA: a system for ontology-based re-coding and technical annotation of biomedical phenotype data.SORTA:一种用于生物医学表型数据的基于本体的重新编码和技术注释的系统。
Database (Oxford). 2015 Sep 18;2015. doi: 10.1093/database/bav089. Print 2015.
7
Developing and evaluating pediatric phecodes (Peds-Phecodes) for high-throughput phenotyping using electronic health records.开发和评估基于电子健康记录的高通量表型分析的儿科 phecode(Peds-Phecodes)。
J Am Med Inform Assoc. 2024 Jan 18;31(2):386-395. doi: 10.1093/jamia/ocad233.
8
Data-driven method to enhance craniofacial and oral phenotype vocabularies.基于数据驱动的方法来增强颅面和口腔表型词汇。
J Am Dent Assoc. 2019 Nov;150(11):933-939.e2. doi: 10.1016/j.adaj.2019.05.029.
9
Mapping the gene ontology into the unified medical language system.将基因本体映射到统一医学语言系统中。
Comp Funct Genomics. 2004;5(4):354-61. doi: 10.1002/cfg.407.
10
Developing and Evaluating Pediatric Phecodes (Peds-Phecodes) for High-Throughput Phenotyping Using Electronic Health Records.利用电子健康记录开发和评估用于高通量表型分析的儿科疾病编码(儿科Phecodes)
medRxiv. 2023 Aug 24:2023.08.22.23294435. doi: 10.1101/2023.08.22.23294435.

引用本文的文献

1
Exclusion-based exome sequencing in critically ill adults 18-40 years old has a 24% diagnostic rate and finds racial disparities in access to genetic testing.对18至40岁的危重症成年患者进行基于排除法的外显子组测序,诊断率为24%,并发现了基因检测可及性方面的种族差异。
Am J Hum Genet. 2025 Aug 7;112(8):1792-1804. doi: 10.1016/j.ajhg.2025.06.007. Epub 2025 Jul 7.
2
Variants in BSN, encoding the presynaptic protein Bassoon, result in a distinct neurodevelopmental disorder with a broad phenotypic range.编码突触前蛋白巴松管的BSN基因变异会导致一种具有广泛表型范围的独特神经发育障碍。
Am J Hum Genet. 2025 May 16. doi: 10.1016/j.ajhg.2025.04.011.
3

本文引用的文献

1
GWAS Central: an expanding resource for finding and visualising genotype and phenotype data from genome-wide association studies.GWAS 中心:一个不断扩展的资源,用于发现和可视化全基因组关联研究中的基因型和表型数据。
Nucleic Acids Res. 2023 Jan 6;51(D1):D986-D993. doi: 10.1093/nar/gkac1017.
2
The phers R package: using phenotype risk scores based on electronic health records to study Mendelian disease and rare genetic variants.phers R 包:使用基于电子健康记录的表型风险评分来研究孟德尔疾病和罕见遗传变异。
Bioinformatics. 2022 Oct 31;38(21):4972-4974. doi: 10.1093/bioinformatics/btac619.
3
Using Phecodes for Research with the Electronic Health Record: From PheWAS to PheRS.
Variants in , encoding the presynaptic protein Bassoon, result in a novel neurodevelopmental disorder with a broad phenotypic range.
编码突触前蛋白巴松管的基因变异会导致一种具有广泛表型范围的新型神经发育障碍。
medRxiv. 2025 Feb 12:2025.02.10.25321755. doi: 10.1101/2025.02.10.25321755.
4
Implications of mappings between International Classification of Diseases clinical diagnosis codes and Human Phenotype Ontology terms.国际疾病分类临床诊断编码与人类表型本体术语之间映射的意义。
JAMIA Open. 2024 Nov 18;7(4):ooae118. doi: 10.1093/jamiaopen/ooae118. eCollection 2024 Dec.
5
Expanding drug targets for 112 chronic diseases using a machine learning-assisted genetic priority score.利用机器学习辅助的遗传优先级评分,拓展 112 种慢性疾病的药物靶点。
Nat Commun. 2024 Oct 15;15(1):8891. doi: 10.1038/s41467-024-53333-y.
6
Reduced Penetrance is Common Among Predicted Loss-of-Function Variants and is Likely Driven by Residual Allelic Activity.预测的功能丧失变异中,降低的外显率很常见,且可能由残余等位基因活性驱动。
medRxiv. 2025 Mar 20:2024.09.23.24314008. doi: 10.1101/2024.09.23.24314008.
7
Universal Exome Sequencing in Critically Ill Adults: A Diagnostic Yield of 25% and Race-Based Disparities in Access to Genetic Testing.危重症成年患者的全外显子组测序:诊断率达25%及基因检测获取方面基于种族的差异
medRxiv. 2024 Apr 1:2024.03.11.24304088. doi: 10.1101/2024.03.11.24304088.
8
A deep learning transformer model predicts high rates of undiagnosed rare disease in large electronic health systems.一种深度学习变压器模型预测大型电子健康系统中未诊断罕见病的高发生率。
medRxiv. 2023 Dec 24:2023.12.21.23300393. doi: 10.1101/2023.12.21.23300393.
9
Next-generation phenotyping: introducing phecodeX for enhanced discovery research in medical phenomics.下一代表型分析:引入 phecodeX 以增强医学表型组学中的发现研究。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad655.
使用 pheCodes 进行电子健康记录研究:从 pheWAS 到 pheRS。
Annu Rev Biomed Data Sci. 2021 Jul 20;4:1-19. doi: 10.1146/annurev-biodatasci-122320-112352.
4
Linking common human diseases to their phenotypes; development of a resource for human phenomics.将常见人类疾病与其表型相联系;开发人类表型组学资源。
J Biomed Semantics. 2021 Aug 23;12(1):17. doi: 10.1186/s13326-021-00249-x.
5
The Human Phenotype Ontology in 2021.2021 年人类表型本体论。
Nucleic Acids Res. 2021 Jan 8;49(D1):D1207-D1217. doi: 10.1093/nar/gkaa1043.
6
PheMap: a multi-resource knowledge base for high-throughput phenotyping within electronic health records.PheMap:一个用于电子健康记录中高通量表型分析的多资源知识库。
J Am Med Inform Assoc. 2020 Nov 1;27(11):1675-1687. doi: 10.1093/jamia/ocaa104.
7
A longitudinal footprint of genetic epilepsies using automated electronic medical record interpretation.利用自动化电子病历解读技术对遗传性癫痫进行纵向研究。
Genet Med. 2020 Dec;22(12):2060-2070. doi: 10.1038/s41436-020-0923-1. Epub 2020 Aug 10.
8
Improving the phenotype risk score as a scalable approach to identifying patients with Mendelian disease.提高表型风险评分作为一种可扩展的方法来识别孟德尔疾病患者。
J Am Med Inform Assoc. 2019 Dec 1;26(12):1437-1447. doi: 10.1093/jamia/ocz179.
9
Mapping ICD-10 and ICD-10-CM Codes to Phecodes: Workflow Development and Initial Evaluation.将ICD - 10和ICD - 10 - CM编码映射到疾病编码:工作流程开发与初步评估
JMIR Med Inform. 2019 Nov 29;7(4):e14325. doi: 10.2196/14325.
10
Encoding Clinical Data with the Human Phenotype Ontology for Computational Differential Diagnostics.使用人类表型本体对临床数据进行编码以进行计算性鉴别诊断。
Curr Protoc Hum Genet. 2019 Sep;103(1):e92. doi: 10.1002/cphg.92.