在稀疏数据中识别肥胖及合并症。

Recognizing obesity and comorbidities in sparse data.

作者信息

Uzuner Ozlem

机构信息

University at Albany, SUNY, Albany, NY, USA.

出版信息

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):561-70. doi: 10.1197/jamia.M3115. Epub 2009 Apr 23.

DOI:10.1197/jamia.M3115

PMID:19390096

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2705260/

Abstract

In order to survey, facilitate, and evaluate studies of medical language processing on clinical narratives, i2b2 (Informatics for Integrating Biology to the Bedside) organized its second challenge and workshop. This challenge focused on automatically extracting information on obesity and fifteen of its most common comorbidities from patient discharge summaries. For each patient, obesity and any of the comorbidities could be Present, Absent, or Questionable (i.e., possible) in the patient, or Unmentioned in the discharge summary of the patient. i2b2 provided data for, and invited the development of, automated systems that can classify obesity and its comorbidities into these four classes based on individual discharge summaries. This article refers to obesity and comorbidities as diseases. It refers to the categories Present, Absent, Questionable, and Unmentioned as classes. The task of classifying obesity and its comorbidities is called the Obesity Challenge. The data released by i2b2 was annotated for textual judgments reflecting the explicitly reported information on diseases, and intuitive judgments reflecting medical professionals' reading of the information presented in discharge summaries. There were very few examples of some disease classes in the data. The Obesity Challenge paid particular attention to the performance of systems on these less well-represented classes. A total of 30 teams participated in the Obesity Challenge. Each team was allowed to submit two sets of up to three system runs for evaluation, resulting in a total of 136 submissions. The submissions represented a combination of rule-based and machine learning approaches. Evaluation of system runs shows that the best predictions of textual judgments come from systems that filter the potentially noisy portions of the narratives, project dictionaries of disease names onto the remaining text, apply negation extraction, and process the text through rules. Information on disease-related concepts, such as symptoms and medications, and general medical knowledge help systems infer intuitive judgments on the diseases.

摘要

为了调查、促进和评估关于临床叙述的医学语言处理研究，i2b2（整合生物学与床边信息学）组织了第二届挑战赛和研讨会。本次挑战赛的重点是从患者出院小结中自动提取肥胖及其十五种最常见合并症的信息。对于每位患者，肥胖和任何合并症在患者身上可能为存在、不存在、可疑（即有可能），或者在患者的出院小结中未提及。i2b2提供了数据，并邀请开发能够根据个体出院小结将肥胖及其合并症分类为这四类的自动化系统。本文将肥胖及其合并症称为疾病，将存在、不存在、可疑和未提及这几个类别称为类。将肥胖及其合并症进行分类的任务称为肥胖症挑战赛。i2b2发布的数据经过注释，用于反映关于疾病的明确报告信息的文本判断，以及反映医学专业人员对出院小结中呈现信息的解读的直观判断。数据中某些疾病类别的示例非常少。肥胖症挑战赛特别关注系统在这些代表性较差的类别上的表现。共有30个团队参加了肥胖症挑战赛。每个团队最多可提交两组、每组三个系统运行结果进行评估，总共提交了136份结果。这些提交结果代表了基于规则和机器学习方法的结合。系统运行结果的评估表明，文本判断的最佳预测来自那些过滤叙述中潜在噪声部分、将疾病名称词典应用于剩余文本、进行否定提取并通过规则处理文本的系统。关于疾病相关概念（如症状和药物）的信息以及一般医学知识有助于系统推断对疾病的直观判断。

相似文献

Recognizing obesity and comorbidities in sparse data.

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):561-70. doi: 10.1197/jamia.M3115. Epub 2009 Apr 23.

A text mining approach to the prediction of disease status from clinical discharge summaries.

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):596-600. doi: 10.1197/jamia.M3096. Epub 2009 Apr 23.

A rule-based approach for identifying obesity and its comorbidities in medical discharge summaries.

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):576-9. doi: 10.1197/jamia.M3086. Epub 2009 Apr 23.

A system for classifying disease comorbidity status from medical discharge summaries using automated hotspot and negated concept detection.

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):590-5. doi: 10.1197/jamia.M3095. Epub 2009 Apr 23.

Description of a rule-based system for the i2b2 challenge in natural language processing for clinical data.

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):571-5. doi: 10.1197/jamia.M3083. Epub 2009 Apr 23.

Semi-automated construction of decision rules to predict morbidities from clinical texts.

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):601-5. doi: 10.1197/jamia.M3097. Epub 2009 Apr 23.

Developing a FHIR-based EHR phenotyping framework: A case study for identification of patients with obesity and multiple comorbidities from discharge summaries.

J Biomed Inform. 2019 Nov;99:103310. doi: 10.1016/j.jbi.2019.103310. Epub 2019 Oct 14.

Identifying patient smoking status from medical discharge records.

J Am Med Inform Assoc. 2008 Jan-Feb;15(1):14-24. doi: 10.1197/jamia.M2408. Epub 2007 Oct 18.

Second i2b2 workshop on natural language processing challenges for clinical records.

AMIA Annu Symp Proc. 2008 Nov 6:1252-3.

Semantic classification of diseases in discharge summaries using a context-aware rule-based classifier.

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):580-4. doi: 10.1197/jamia.M3087. Epub 2009 Apr 23.

引用本文的文献

Keyword-optimized template insertion for clinical note classification via prompt-based learning.

BMC Med Inform Decis Mak. 2025 Jul 3;25(1):247. doi: 10.1186/s12911-025-03071-y.

A comparative analysis of machine learning models and human expertise for nursing intervention classification.

JAMIA Open. 2025 Jun 27;8(3):ooaf057. doi: 10.1093/jamiaopen/ooaf057. eCollection 2025 Jun.

Secondary Use of Clinical Problem List Descriptions for Bi-Encoder Based ICD-10 Classification.

AMIA Annu Symp Proc. 2025 May 22;2024:620-627. eCollection 2024.

Question Answering for Electronic Health Records: Scoping Review of Datasets and Models.

J Med Internet Res. 2024 Oct 30;26:e53636. doi: 10.2196/53636.

Knowledge graph construction for heart failure using large language models with prompt engineering.

Front Comput Neurosci. 2024 Jul 2;18:1389475. doi: 10.3389/fncom.2024.1389475. eCollection 2024.

Impact of possible errors in natural language processing-derived data on downstream epidemiologic analysis.

JAMIA Open. 2023 Dec 27;6(4):ooad111. doi: 10.1093/jamiaopen/ooad111. eCollection 2023 Dec.

Applying Natural Language Processing to Textual Data From Clinical Data Warehouses: Systematic Review.

JMIR Med Inform. 2023 Dec 15;11:e42477. doi: 10.2196/42477.

Heart disease risk factors detection from electronic health records using advanced NLP and deep learning techniques.

Sci Rep. 2023 May 3;13(1):7173. doi: 10.1038/s41598-023-34294-6.

Automated Detection of Substance-Use Status and Related Information from Clinical Text.

Sensors (Basel). 2022 Dec 8;22(24):9609. doi: 10.3390/s22249609.

A scoping review of publicly available language tasks in clinical natural language processing.

J Am Med Inform Assoc. 2022 Sep 12;29(10):1797-1806. doi: 10.1093/jamia/ocac127.

本文引用的文献

Description of a rule-based system for the i2b2 challenge in natural language processing for clinical data.

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):571-5. doi: 10.1197/jamia.M3083. Epub 2009 Apr 23.

A rule-based approach for identifying obesity and its comorbidities in medical discharge summaries.

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):576-9. doi: 10.1197/jamia.M3086. Epub 2009 Apr 23.

Semantic classification of diseases in discharge summaries using a context-aware rule-based classifier.

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):580-4. doi: 10.1197/jamia.M3087. Epub 2009 Apr 23.

Natural language processing framework to assess clinical conditions.

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):585-9. doi: 10.1197/jamia.M3091. Epub 2009 Apr 23.

A system for classifying disease comorbidity status from medical discharge summaries using automated hotspot and negated concept detection.

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):590-5. doi: 10.1197/jamia.M3095. Epub 2009 Apr 23.

A text mining approach to the prediction of disease status from clinical discharge summaries.

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):596-600. doi: 10.1197/jamia.M3096. Epub 2009 Apr 23.

Semi-automated construction of decision rules to predict morbidities from clinical texts.

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):601-5. doi: 10.1197/jamia.M3097. Epub 2009 Apr 23.

Second i2b2 workshop on natural language processing challenges for clinical records.

AMIA Annu Symp Proc. 2008 Nov 6:1252-3.

Identifying patient smoking status from medical discharge records.

J Am Med Inform Assoc. 2008 Jan-Feb;15(1):14-24. doi: 10.1197/jamia.M2408. Epub 2007 Oct 18.

The spread of obesity in a large social network over 32 years.

N Engl J Med. 2007 Jul 26;357(4):370-9. doi: 10.1056/NEJMsa066082. Epub 2007 Jul 25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

在稀疏数据中识别肥胖及合并症。

Recognizing obesity and comorbidities in sparse data.

作者信息

Uzuner Ozlem

机构信息

University at Albany, SUNY, Albany, NY, USA.

出版信息

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):561-70. doi: 10.1197/jamia.M3115. Epub 2009 Apr 23.

DOI:10.1197/jamia.M3115

PMID:19390096

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2705260/

Abstract

摘要

在稀疏数据中识别肥胖及合并症。

Recognizing obesity and comorbidities in sparse data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

在稀疏数据中识别肥胖及合并症。

Recognizing obesity and comorbidities in sparse data.

作者信息

机构信息

出版信息