• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高保真表型分析:丰富性与无偏性

High-fidelity phenotyping: richness and freedom from bias.

作者信息

Hripcsak George, Albers David J

机构信息

Department of Biomedical Informatics, Columbia University Medical Center, New York, NY, USA.

出版信息

J Am Med Inform Assoc. 2018 Mar 1;25(3):289-294. doi: 10.1093/jamia/ocx110.

DOI:10.1093/jamia/ocx110
PMID:29040596
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7282504/
Abstract

Electronic health record phenotyping is the use of raw electronic health record data to assert characterizations about patients. Researchers have been doing it since the beginning of biomedical informatics, under different names. Phenotyping will benefit from an increasing focus on fidelity, both in the sense of increasing richness, such as measured levels, degree or severity, timing, probability, or conceptual relationships, and in the sense of reducing bias. Research agendas should shift from merely improving binary assignment to studying and improving richer representations. The field is actively researching new temporal directions and abstract representations, including deep learning. The field would benefit from research in nonlinear dynamics, in combining mechanistic models with empirical data, including data assimilation, and in topology. The health care process produces substantial bias, and studying that bias explicitly rather than treating it as merely another source of noise would facilitate addressing it.

摘要

电子健康记录表型分析是利用原始电子健康记录数据来确定患者的特征。自生物医学信息学诞生之初,研究人员就一直在进行这项工作,只是名称不同。表型分析将受益于对保真度的日益关注,这既体现在增加丰富度方面,如测量水平、程度或严重性、时间、概率或概念关系,也体现在减少偏差方面。研究议程应从仅仅改进二元赋值转向研究和改进更丰富的表示形式。该领域正在积极研究新的时间方向和抽象表示形式,包括深度学习。该领域将受益于非线性动力学、将机制模型与经验数据相结合(包括数据同化)以及拓扑学方面的研究。医疗保健过程会产生大量偏差,明确研究这种偏差而非仅仅将其视为另一种噪声来源将有助于解决该问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae0c/7282504/c912fb4ad642/ocx110f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae0c/7282504/e6ee462531a1/ocx110f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae0c/7282504/c912fb4ad642/ocx110f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae0c/7282504/e6ee462531a1/ocx110f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae0c/7282504/c912fb4ad642/ocx110f2.jpg

相似文献

1
High-fidelity phenotyping: richness and freedom from bias.高保真表型分析:丰富性与无偏性
J Am Med Inform Assoc. 2018 Mar 1;25(3):289-294. doi: 10.1093/jamia/ocx110.
2
Deep Phenotyping of Chinese Electronic Health Records by Recognizing Linguistic Patterns of Phenotypic Narratives With a Sequence Motif Discovery Tool: Algorithm Development and Validation.利用序列基序发现工具识别表型叙述的语言模式对中国电子健康记录进行深度表型分析:算法开发与验证
J Med Internet Res. 2022 Jun 3;24(6):e37213. doi: 10.2196/37213.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Facilitating biomedical researchers' interrogation of electronic health record data: Ideas from outside of biomedical informatics.促进生物医学研究人员对电子健康记录数据的查询:来自生物医学信息学领域之外的观点。
J Biomed Inform. 2016 Apr;60:376-84. doi: 10.1016/j.jbi.2016.03.004. Epub 2016 Mar 10.
5
Possible Sources of Bias in Primary Care Electronic Health Record Data Use and Reuse.基层医疗电子健康记录数据使用与再利用中可能存在的偏差来源。
J Med Internet Res. 2018 May 29;20(5):e185. doi: 10.2196/jmir.9134.
6
High-throughput phenotyping with temporal sequences.高通量表型分析与时间序列。
J Am Med Inform Assoc. 2021 Mar 18;28(4):772-781. doi: 10.1093/jamia/ocaa288.
7
Weakly Semi-supervised phenotyping using Electronic Health records.基于电子健康记录的弱监督表型研究
J Biomed Inform. 2022 Oct;134:104175. doi: 10.1016/j.jbi.2022.104175. Epub 2022 Sep 5.
8
Comparison of Automated Sepsis Identification Methods and Electronic Health Record-based Sepsis Phenotyping: Improving Case Identification Accuracy by Accounting for Confounding Comorbid Conditions.自动脓毒症识别方法与基于电子健康记录的脓毒症表型分析比较:通过考虑混杂合并症提高病例识别准确性
Crit Care Explor. 2019 Oct 30;1(10):e0053. doi: 10.1097/CCE.0000000000000053. eCollection 2019 Oct.
9
Identify and mitigate bias in electronic phenotyping: A comprehensive study from computational perspective.
J Biomed Inform. 2024 Aug;156:104671. doi: 10.1016/j.jbi.2024.104671. Epub 2024 Jun 12.
10
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

引用本文的文献

1
A probabilistic approach for building disease phenotypes across electronic health records.一种基于电子健康记录构建疾病表型的概率方法。
BioData Min. 2025 Jun 11;18(1):39. doi: 10.1186/s13040-025-00454-9.
2
Variation in Phenotype Definitions in Observational Clinical Research: A Review of Three Conditions.观察性临床研究中表型定义的差异:三种疾病的综述
AMIA Annu Symp Proc. 2025 May 22;2024:1021-1029. eCollection 2024.
3
An automation framework for clinical codelist development validated with UK data from patients with multiple long-term conditions.

本文引用的文献

1
Survival Analysis with Electronic Health Record Data: Experiments with Chronic Kidney Disease.利用电子健康记录数据进行生存分析:慢性肾脏病实验
Stat Anal Data Min. 2014 Oct;7(5):385-403. doi: 10.1002/sam.11236. Epub 2014 Aug 19.
2
Personalized glucose forecasting for type 2 diabetes using data assimilation.使用数据同化技术对2型糖尿病进行个性化血糖预测。
PLoS Comput Biol. 2017 Apr 27;13(4):e1005232. doi: 10.1371/journal.pcbi.1005232. eCollection 2017 Apr.
3
Comparing lagged linear correlation, lagged regression, Granger causality, and vector autoregression for uncovering associations in EHR data.
一个用于临床代码列表开发的自动化框架,已通过来自患有多种长期疾病患者的英国数据进行验证。
BMC Med Res Methodol. 2025 May 24;25(1):138. doi: 10.1186/s12874-025-02541-1.
4
CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization.队列诊断:使用人群水平特征在观察性数据源网络中进行表型评估。
PLoS One. 2025 Jan 16;20(1):e0310634. doi: 10.1371/journal.pone.0310634. eCollection 2025.
5
The necessity of validity diagnostics when drawing causal inferences from observational data: lessons from a multi-database evaluation of the risk of non-infectious uveitis among patients exposed to Remicade.从观察性数据得出因果推断时进行有效性诊断的必要性:来自一项针对接受类克治疗的患者发生非感染性葡萄膜炎风险的多数据库评估的经验教训。
BMC Med Res Methodol. 2024 Dec 27;24(1):322. doi: 10.1186/s12874-024-02428-7.
6
Value sets and the problem of redundancy in value set repositories.值集与值集存储库中的冗余问题。
PLoS One. 2024 Dec 9;19(12):e0312289. doi: 10.1371/journal.pone.0312289. eCollection 2024.
7
US public health surveillance, reimagined.重新构想的美国公共卫生监测。
Learn Health Syst. 2024 Aug 14;8(4):e10445. doi: 10.1002/lrh2.10445. eCollection 2024 Oct.
8
Derivation and validation of generalized sepsis-induced acute respiratory failure phenotypes among critically ill patients: a retrospective study.危重症患者中广义脓毒症诱导的急性呼吸衰竭表型的推导与验证:一项回顾性研究。
Crit Care. 2024 Oct 1;28(1):321. doi: 10.1186/s13054-024-05061-4.
9
Secondary use of patient data within decentralized studies using the example of rare diseases in Germany: A data scientist's exploration of process and lessons learned.以德国罕见病为例,探讨分散式研究中患者数据的二次利用:一位数据科学家对流程及经验教训的探索。
Digit Health. 2024 Aug 10;10:20552076241265219. doi: 10.1177/20552076241265219. eCollection 2024 Jan-Dec.
10
Derivation and Validation of Generalized Sepsis-induced Acute Respiratory Failure Phenotypes Among Critically Ill Patients: A Retrospective Study.重症患者中广义脓毒症诱导的急性呼吸衰竭表型的推导与验证:一项回顾性研究。
Res Sq. 2024 Apr 30:rs.3.rs-4307475. doi: 10.21203/rs.3.rs-4307475/v1.
比较滞后线性相关、滞后回归、格兰杰因果关系和向量自回归以揭示电子健康记录(EHR)数据中的关联。
AMIA Annu Symp Proc. 2017 Feb 10;2016:779-788. eCollection 2016.
4
Inference of Ancestral Recombination Graphs through Topological Data Analysis.通过拓扑数据分析推断祖先重组图
PLoS Comput Biol. 2016 Aug 17;12(8):e1005071. doi: 10.1371/journal.pcbi.1005071. eCollection 2016 Aug.
5
Predicting Negative Emotions Based on Mobile Phone Usage Patterns: An Exploratory Study.基于手机使用模式预测负面情绪:一项探索性研究。
JMIR Res Protoc. 2016 Aug 10;5(3):e160. doi: 10.2196/resprot.5551.
6
Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records.深度患者:一种从电子健康记录中预测患者未来的无监督表示。
Sci Rep. 2016 May 17;6:26094. doi: 10.1038/srep26094.
7
Learning statistical models of phenotypes using noisy labeled training data.使用带有噪声标签的训练数据学习表型的统计模型。
J Am Med Inform Assoc. 2016 Nov;23(6):1166-1173. doi: 10.1093/jamia/ocw028. Epub 2016 May 12.
8
Electronic medical record phenotyping using the anchor and learn framework.使用锚定与学习框架进行电子病历表型分析。
J Am Med Inform Assoc. 2016 Jul;23(4):731-40. doi: 10.1093/jamia/ocw011. Epub 2016 Apr 23.
9
PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability.PheKB:一个用于创建可移植电子表型算法的目录和工作流程。
J Am Med Inform Assoc. 2016 Nov;23(6):1046-1052. doi: 10.1093/jamia/ocv202. Epub 2016 Mar 28.
10
Tensor factorization toward precision medicine.面向精准医学的张量分解
Brief Bioinform. 2017 May 1;18(3):511-514. doi: 10.1093/bib/bbw026.