一种用于从电子健康记录中自动提取严重程度的分类方法的开发与验证。

Development and validation of a classification approach for extracting severity automatically from electronic health records.

作者信息

Boland Mary Regina, Tatonetti Nicholas P, Hripcsak George

机构信息

Department of Biomedical Informatics, Columbia University, New York, NY USA ; Observational Health Data Sciences and Informatics (OHDSI), Columbia University, 622 West 168th Street, PH-20, New York, NY USA ; Department of Systems Biology, Columbia University, New York, NY USA ; Department of Medicine, Columbia University, New York, NY USA.

出版信息

J Biomed Semantics. 2015 Apr 6;6:14. doi: 10.1186/s13326-015-0010-8. eCollection 2015.

DOI:10.1186/s13326-015-0010-8

PMID:25848530

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4386082/

Abstract

BACKGROUND

Electronic Health Records (EHRs) contain a wealth of information useful for studying clinical phenotype-genotype relationships. Severity is important for distinguishing among phenotypes; however other severity indices classify patient-level severity (e.g., mild vs. acute dermatitis) rather than phenotype-level severity (e.g., acne vs. myocardial infarction). Phenotype-level severity is independent of the individual patient's state and is relative to other phenotypes. Further, phenotype-level severity does not change based on the individual patient. For example, acne is mild at the phenotype-level and relative to other phenotypes. Therefore, a given patient may have a severe form of acne (this is the patient-level severity), but this does not effect its overall designation as a mild phenotype at the phenotype-level.

METHODS

We present a method for classifying severity at the phenotype-level that uses the Systemized Nomenclature of Medicine - Clinical Terms. Our method is called the Classification Approach for Extracting Severity Automatically from Electronic Health Records (CAESAR). CAESAR combines multiple severity measures - number of comorbidities, medications, procedures, cost, treatment time, and a proportional index term. CAESAR employs a random forest algorithm and these severity measures to discriminate between severe and mild phenotypes.

RESULTS

Using a random forest algorithm and these severity measures as input, CAESAR differentiates between severe and mild phenotypes (sensitivity = 91.67, specificity = 77.78) when compared to a manually evaluated reference standard (k = 0.716).

CONCLUSIONS

CAESAR enables researchers to measure phenotype severity from EHRs to identify phenotypes that are important for comparative effectiveness research.

摘要

背景

电子健康记录（EHRs）包含大量有助于研究临床表型 - 基因型关系的信息。严重程度对于区分表型很重要；然而，其他严重程度指数是对患者层面的严重程度进行分类（例如，轻度与急性皮炎），而非表型层面的严重程度（例如，痤疮与心肌梗死）。表型层面的严重程度独立于个体患者的状态，并且相对于其他表型而言。此外，表型层面的严重程度不会因个体患者而改变。例如，痤疮在表型层面是轻度的，并且相对于其他表型也是如此。因此，给定患者可能患有严重形式的痤疮（这是患者层面的严重程度），但这并不影响其在表型层面整体被指定为轻度表型。

方法

我们提出一种在表型层面进行严重程度分类的方法，该方法使用医学系统命名法 - 临床术语。我们的方法称为从电子健康记录中自动提取严重程度的分类方法（CAESAR）。CAESAR结合了多种严重程度度量指标——合并症数量、用药情况、手术操作、费用、治疗时间以及一个比例指数项。CAESAR采用随机森林算法以及这些严重程度度量指标来区分严重和轻度表型。

结果

将随机森林算法和这些严重程度度量指标作为输入，与人工评估的参考标准相比（κ = 0.716），CAESAR能够区分严重和轻度表型（敏感性 = 91.67，特异性 = 77.78）。

结论

CAESAR使研究人员能够从电子健康记录中测量表型严重程度，以识别对比较效果研究重要的表型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c64/4386082/c6339b8e0bd6/13326_2015_10_Fig1_HTML.jpg

相似文献

Development and validation of a classification approach for extracting severity automatically from electronic health records.一种用于从电子健康记录中自动提取严重程度的分类方法的开发与验证。

J Biomed Semantics. 2015 Apr 6;6:14. doi: 10.1186/s13326-015-0010-8. eCollection 2015.

Improving condition severity classification with an efficient active learning based framework.使用基于高效主动学习的框架改进病情严重程度分类。

J Biomed Inform. 2016 Jun;61:44-54. doi: 10.1016/j.jbi.2016.03.016. Epub 2016 Mar 22.

Development and validation of a claims-based prediction model for COPD severity.基于索赔数据的 COPD 严重程度预测模型的建立与验证。

Respir Med. 2013 Oct;107(10):1568-77. doi: 10.1016/j.rmed.2013.05.012. Epub 2013 Jun 25.

Semi-supervised learning of the electronic health record for phenotype stratification.用于表型分层的电子健康记录的半监督学习

J Biomed Inform. 2016 Dec;64:168-178. doi: 10.1016/j.jbi.2016.10.007. Epub 2016 Oct 12.

Enhancing electronic health record measurement of depression severity and suicide ideation: a Distributed Ambulatory Research in Therapeutics Network (DARTNet) study.增强电子健康记录对抑郁严重程度和自杀意念的测量：分布式门诊研究治疗网络（DARTNet）研究。

J Am Board Fam Med. 2012 Sep-Oct;25(5):582-93. doi: 10.3122/jabfm.2012.05.110053.

Understanding COPD: A vision on phenotypes, comorbidities and treatment approach.了解慢性阻塞性肺疾病：关于表型、合并症及治疗方法的展望

Rev Port Pneumol (2006). 2016 Mar-Apr;22(2):101-11. doi: 10.1016/j.rppnen.2015.12.001. Epub 2016 Jan 27.

The readiness of SNOMED problem list concepts for meaningful use of electronic health records.SNOMED 问题列表概念对电子健康记录的有效使用的准备情况。

Artif Intell Med. 2013 Jun;58(2):73-80. doi: 10.1016/j.artmed.2013.03.008. Epub 2013 Apr 18.

Clinical redesign using all patient refined diagnosis related groups.使用所有患者细化诊断相关组进行临床重新设计。

Pediatrics. 2004 Oct;114(4):965-9. doi: 10.1542/peds.2004-0650.

Validating an ontology-based algorithm to identify patients with type 2 diabetes mellitus in electronic health records.验证一种基于本体的算法，以在电子健康记录中识别2型糖尿病患者。

Int J Med Inform. 2014 Oct;83(10):768-78. doi: 10.1016/j.ijmedinf.2014.06.002. Epub 2014 Jun 20.

Supporting information retrieval from electronic health records: A report of University of Michigan's nine-year experience in developing and using the Electronic Medical Record Search Engine (EMERSE).支持从电子健康记录中检索信息：密歇根大学开发和使用电子病历搜索引擎（EMERSE）九年经验报告。

J Biomed Inform. 2015 Jun;55:290-300. doi: 10.1016/j.jbi.2015.05.003. Epub 2015 May 13.

引用本文的文献

Developing a Standardization Algorithm for Categorical Laboratory Tests for Clinical Big Data Research: Retrospective Study.开发用于临床大数据研究的分类实验室检查标准化算法：回顾性研究

JMIR Med Inform. 2019 Aug 29;7(3):e14083. doi: 10.2196/14083.

Inter-labeler and intra-labeler variability of condition severity classification models using active and passive learning methods.采用主动学习和被动学习方法的条件严重程度分类模型的标签间和标签内变异性。

Artif Intell Med. 2017 Sep;81:12-32. doi: 10.1016/j.artmed.2017.03.003. Epub 2017 Apr 27.

Automatic health record review to help prioritize gravely ill Social Security disability applicants.自动健康记录审查，以帮助确定重症社会保障残疾申请人的优先顺序。

J Am Med Inform Assoc. 2017 Jul 1;24(4):709-716. doi: 10.1093/jamia/ocw159.

Clinical Genomics: Challenges and Opportunities.临床基因组学：挑战与机遇。

Crit Rev Eukaryot Gene Expr. 2016;26(2):97-113. doi: 10.1615/CritRevEukaryotGeneExpr.2016015724.

PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability.PheKB：一个用于创建可移植电子表型算法的目录和工作流程。

J Am Med Inform Assoc. 2016 Nov;23(6):1046-1052. doi: 10.1093/jamia/ocv202. Epub 2016 Mar 28.

Improving condition severity classification with an efficient active learning based framework.使用基于高效主动学习的框架改进病情严重程度分类。

J Biomed Inform. 2016 Jun;61:44-54. doi: 10.1016/j.jbi.2016.03.016. Epub 2016 Mar 22.

Special issue on bio-ontologies and phenotypes.关于生物本体和表型的特刊。

J Biomed Semantics. 2015 Dec 17;6:40. doi: 10.1186/s13326-015-0040-2. eCollection 2015.

The digital revolution in phenotyping.表型分析中的数字革命。

Brief Bioinform. 2016 Sep;17(5):819-30. doi: 10.1093/bib/bbv083. Epub 2015 Sep 29.

本文引用的文献

Medication-wide association studies.药物广泛关联研究。

CPT Pharmacometrics Syst Pharmacol. 2013 Sep 18;2(9):e76. doi: 10.1038/psp.2013.52.

Mining the ultimate phenome repository.挖掘终极表型组库。

Nat Biotechnol. 2013 Dec;31(12):1095-7. doi: 10.1038/nbt.2757.

Temporal properties of diagnosis code time series in aggregate.总体诊断代码时间序列的时间特性。

IEEE J Biomed Health Inform. 2013 Mar;17(2):477-83. doi: 10.1109/JBHI.2013.2244610.

Discovering body site and severity modifiers in clinical texts.发现临床文本中的身体部位和严重程度修饰语。

J Am Med Inform Assoc. 2014 May-Jun;21(3):448-54. doi: 10.1136/amiajnl-2013-001766. Epub 2013 Oct 3.

Defining a comprehensive verotype using electronic health records for personalized medicine.利用电子健康记录为个性化医疗定义全面的综合基因型。

J Am Med Inform Assoc. 2013 Dec;20(e2):e232-8. doi: 10.1136/amiajnl-2013-001932. Epub 2013 Sep 3.

Correlating electronic health record concepts with healthcare process events.将电子健康记录概念与医疗保健流程事件相关联。

J Am Med Inform Assoc. 2013 Dec;20(e2):e311-8. doi: 10.1136/amiajnl-2013-001922. Epub 2013 Aug 23.

Don't take your EHR to heaven, donate it to science: legal and research policies for EHR post mortem.切勿将电子健康记录带入天堂，将其捐赠给科学：电子健康记录死后的法律和研究政策。

J Am Med Inform Assoc. 2014 Jan-Feb;21(1):8-12. doi: 10.1136/amiajnl-2013-002061. Epub 2013 Aug 21.

Next-generation phenotyping of electronic health records.电子健康记录的下一代表型分析。

J Am Med Inform Assoc. 2013 Jan 1;20(1):117-21. doi: 10.1136/amiajnl-2012-001145. Epub 2012 Sep 6.

Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research.电子健康记录数据质量评估的方法和维度：为临床研究提供可重用性。

J Am Med Inform Assoc. 2013 Jan 1;20(1):144-51. doi: 10.1136/amiajnl-2011-000681. Epub 2012 Jun 25.

Berkson's bias, selection bias, and missing data.伯克森偏倚、选择偏倚和数据缺失。

Epidemiology. 2012 Jan;23(1):159-64. doi: 10.1097/EDE.0b013e31823b6296.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于从电子健康记录中自动提取严重程度的分类方法的开发与验证。

Development and validation of a classification approach for extracting severity automatically from electronic health records.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献