• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于自然语言处理和贝叶斯网络分类器的急诊科报告中流感的检测。

Influenza detection from emergency department reports using natural language processing and Bayesian network classifiers.

机构信息

Real-time Outbreak and Disease Surveillance Laboratory (RODS), Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA Intelligent Systems Program, University of Pittsburgh, Pittsburgh, Pennsylvania, USA.

Real-time Outbreak and Disease Surveillance Laboratory (RODS), Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA.

出版信息

J Am Med Inform Assoc. 2014 Sep-Oct;21(5):815-23. doi: 10.1136/amiajnl-2013-001934. Epub 2014 Jan 9.

DOI:10.1136/amiajnl-2013-001934
PMID:24406261
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4147621/
Abstract

OBJECTIVES

To evaluate factors affecting performance of influenza detection, including accuracy of natural language processing (NLP), discriminative ability of Bayesian network (BN) classifiers, and feature selection.

METHODS

We derived a testing dataset of 124 influenza patients and 87 non-influenza (shigellosis) patients. To assess NLP finding-extraction performance, we measured the overall accuracy, recall, and precision of Topaz and MedLEE parsers for 31 influenza-related findings against a reference standard established by three physician reviewers. To elucidate the relative contribution of NLP and BN classifier to classification performance, we compared the discriminative ability of nine combinations of finding-extraction methods (expert, Topaz, and MedLEE) and classifiers (one human-parameterized BN and two machine-parameterized BNs). To assess the effects of feature selection, we conducted secondary analyses of discriminative ability using the most influential findings defined by their likelihood ratios.

RESULTS

The overall accuracy of Topaz was significantly better than MedLEE (with post-processing) (0.78 vs 0.71, p<0.0001). Classifiers using human-annotated findings were superior to classifiers using Topaz/MedLEE-extracted findings (average area under the receiver operating characteristic (AUROC): 0.75 vs 0.68, p=0.0113), and machine-parameterized classifiers were superior to the human-parameterized classifier (average AUROC: 0.73 vs 0.66, p=0.0059). The classifiers using the 17 'most influential' findings were more accurate than classifiers using all 31 subject-matter expert-identified findings (average AUROC: 0.76>0.70, p<0.05).

CONCLUSIONS

Using a three-component evaluation method we demonstrated how one could elucidate the relative contributions of components under an integrated framework. To improve classification performance, this study encourages researchers to improve NLP accuracy, use a machine-parameterized classifier, and apply feature selection methods.

摘要

目的

评估影响流感检测性能的因素,包括自然语言处理(NLP)的准确性、贝叶斯网络(BN)分类器的判别能力和特征选择。

方法

我们从 124 例流感患者和 87 例非流感(志贺菌病)患者中提取了一个测试数据集。为了评估 NLP 发现提取性能,我们针对由三位医师审阅者建立的参考标准,测量了 Topaz 和 MedLEE 解析器对 31 个流感相关发现的整体准确性、召回率和精确率。为了阐明 NLP 和 BN 分类器对分类性能的相对贡献,我们比较了九种发现提取方法(专家、Topaz 和 MedLEE)和分类器(一种人工参数化 BN 和两种机器参数化 BN)的判别能力。为了评估特征选择的效果,我们使用似然比定义的最有影响力的发现进行了二次判别能力分析。

结果

Topaz 的总体准确性明显优于 MedLEE(后处理)(0.78 比 0.71,p<0.0001)。使用人工注释发现的分类器优于使用 Topaz/MedLEE 提取发现的分类器(平均接收者操作特征曲线下面积(AUROC):0.75 比 0.68,p=0.0113),机器参数化分类器优于人工参数化分类器(平均 AUROC:0.73 比 0.66,p=0.0059)。使用 17 个“最有影响力”发现的分类器比使用所有 31 个主题专家识别发现的分类器更准确(平均 AUROC:0.76>0.70,p<0.05)。

结论

使用三组件评估方法,我们展示了如何在集成框架下阐明组件的相对贡献。为了提高分类性能,本研究鼓励研究人员提高 NLP 准确性、使用机器参数化分类器和应用特征选择方法。

相似文献

1
Influenza detection from emergency department reports using natural language processing and Bayesian network classifiers.基于自然语言处理和贝叶斯网络分类器的急诊科报告中流感的检测。
J Am Med Inform Assoc. 2014 Sep-Oct;21(5):815-23. doi: 10.1136/amiajnl-2013-001934. Epub 2014 Jan 9.
2
A study of the transferability of influenza case detection systems between two large healthcare systems.一项关于流感病例检测系统在两个大型医疗系统之间可转移性的研究。
PLoS One. 2017 Apr 5;12(4):e0174970. doi: 10.1371/journal.pone.0174970. eCollection 2017.
3
Comparison of machine learning classifiers for influenza detection from emergency department free-text reports.基于急诊科自由文本报告的流感检测中机器学习分类器的比较
J Biomed Inform. 2015 Dec;58:60-69. doi: 10.1016/j.jbi.2015.08.019. Epub 2015 Sep 16.
4
The effects of natural language processing on cross-institutional portability of influenza case detection for disease surveillance.自然语言处理对用于疾病监测的流感病例检测跨机构可移植性的影响。
Appl Clin Inform. 2017 May 31;8(2):560-580. doi: 10.4338/ACI-2016-12-RA-0211.
5
Automated outcome classification of emergency department computed tomography imaging reports.急诊 CT 影像报告的自动化结果分类。
Acad Emerg Med. 2013 Aug;20(8):848-54. doi: 10.1111/acem.12174.
6
Use of semantic features to classify patient smoking status.利用语义特征对患者吸烟状况进行分类。
AMIA Annu Symp Proc. 2008 Nov 6;2008:450-4.
7
Recognition of medication information from discharge summaries using ensembles of classifiers.使用分类器集成识别出院小结中的药物信息。
BMC Med Inform Decis Mak. 2012 May 7;12:36. doi: 10.1186/1472-6947-12-36.
8
Word2Vec inversion and traditional text classifiers for phenotyping lupus.用于狼疮表型分析的词向量反演和传统文本分类器
BMC Med Inform Decis Mak. 2017 Aug 22;17(1):126. doi: 10.1186/s12911-017-0518-1.
9
Classification of clinically useful sentences in clinical evidence resources.临床证据资源中临床有用句子的分类。
J Biomed Inform. 2016 Apr;60:14-22. doi: 10.1016/j.jbi.2016.01.003. Epub 2016 Jan 13.
10
Ensembles of natural language processing systems for portable phenotyping solutions.用于便携表型解决方案的自然语言处理系统集合。
J Biomed Inform. 2019 Dec;100:103318. doi: 10.1016/j.jbi.2019.103318. Epub 2019 Oct 23.

引用本文的文献

1
A Cross-Sectional Study on Whether Comprehensively Gathering Information From Medical Records Is Useful for the Collection of Operational Characteristics.关于全面收集病历信息对收集操作特征是否有用的横断面研究。
Cureus. 2024 Jun 4;16(6):e61641. doi: 10.7759/cureus.61641. eCollection 2024 Jun.
2
Natural Language Processing for Clinical Laboratory Data Repository Systems: Implementation and Evaluation for Respiratory Viruses.临床实验室数据存储系统的自然语言处理:呼吸道病毒的实施与评估
JMIR AI. 2023 Jun 6;2:e44835. doi: 10.2196/44835.
3
Using natural language processing in emergency medicine health service research: A systematic review and meta-analysis.在急诊医学卫生服务研究中使用自然语言处理:一项系统评价和荟萃分析。
Acad Emerg Med. 2024 Jul;31(7):696-706. doi: 10.1111/acem.14937. Epub 2024 May 16.
4
Automated, machine learning-based alerts increase epilepsy surgery referrals: A randomized controlled trial.基于自动化、机器学习的警报可增加癫痫手术转诊:一项随机对照试验。
Epilepsia. 2023 Jul;64(7):1791-1799. doi: 10.1111/epi.17629. Epub 2023 May 27.
5
Implementation of Machine Learning Pipelines for Clinical Practice: Development and Validation Study.用于临床实践的机器学习管道的实施:开发与验证研究。
JMIR Med Inform. 2022 Dec 16;10(12):e37833. doi: 10.2196/37833.
6
Examining the Use of an Artificial Intelligence Model to Diagnose Influenza: Development and Validation Study.运用人工智能模型诊断流感:开发与验证研究。
J Med Internet Res. 2022 Dec 23;24(12):e38751. doi: 10.2196/38751.
7
Disaster and Pandemic Management Using Machine Learning: A Survey.利用机器学习进行灾害和大流行管理:一项综述。
IEEE Internet Things J. 2020 Dec 15;8(21):16047-16071. doi: 10.1109/JIOT.2020.3044966. eCollection 2021 Nov 1.
8
Detection and Prevention of Virus Infection.病毒感染的检测与预防。
Adv Exp Med Biol. 2022;1368:21-52. doi: 10.1007/978-981-16-8969-7_2.
9
A scholarly network of AI research with an information science focus: Global North and Global South perspectives.一个以信息科学为重点的人工智能研究学术网络:全球北方和全球南方的视角。
PLoS One. 2022 Apr 15;17(4):e0266565. doi: 10.1371/journal.pone.0266565. eCollection 2022.
10
Machine Learning and Syncope Management in the ED: The Future Is Coming.急诊科中的机器学习与晕厥管理:未来已来。
Medicina (Kaunas). 2021 Apr 6;57(4):351. doi: 10.3390/medicina57040351.

本文引用的文献

1
Probabilistic, Decision-theoretic Disease Surveillance and Control.概率性决策理论疾病监测与控制
Online J Public Health Inform. 2011;3(3). doi: 10.5210/ojphi.v3i3.3798. Epub 2011 Dec 22.
2
Probabilistic case detection for disease surveillance using data in electronic medical records.利用电子病历数据进行疾病监测的概率病例检测
Online J Public Health Inform. 2011;3(3). doi: 10.5210/ojphi.v3i3.3793. Epub 2011 Dec 22.
3
Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network.基于电子病历的表型算法验证:eMERGE 网络的结果和经验教训。
J Am Med Inform Assoc. 2013 Jun;20(e1):e147-54. doi: 10.1136/amiajnl-2012-000896. Epub 2013 Mar 26.
4
Modeling and executing electronic health records driven phenotyping algorithms using the NQF Quality Data Model and JBoss® Drools Engine.使用国家质量论坛(NQF)质量数据模型和JBoss®Drools引擎对电子健康记录驱动的表型算法进行建模和执行。
AMIA Annu Symp Proc. 2012;2012:532-41. Epub 2012 Nov 3.
5
Importance of multi-modal approaches to effectively identify cataract cases from electronic health records.重视多模式方法,以有效从电子健康记录中识别白内障病例。
J Am Med Inform Assoc. 2012 Mar-Apr;19(2):225-34. doi: 10.1136/amiajnl-2011-000456.
6
Comparison of natural language processing biosurveillance methods for identifying influenza from encounter notes.比较自然语言处理生物监测方法,以从就诊记录中识别流感。
Ann Intern Med. 2012 Jan 3;156(1 Pt 1):11-8. doi: 10.7326/0003-4819-156-1-201201030-00003.
7
Analyzing the heterogeneity and complexity of Electronic Health Record oriented phenotyping algorithms.分析面向电子健康记录的表型算法的异质性和复杂性。
AMIA Annu Symp Proc. 2011;2011:274-83. Epub 2011 Oct 22.
8
The SHARPn project on secondary use of Electronic Medical Record data: progress, plans, and possibilities.关于电子病历数据二次利用的SHARPn项目:进展、计划与可能性
AMIA Annu Symp Proc. 2011;2011:248-56. Epub 2011 Oct 22.
9
pROC: an open-source package for R and S+ to analyze and compare ROC curves.pROC:一个用于 R 和 S+的开源软件包,用于分析和比较 ROC 曲线。
BMC Bioinformatics. 2011 Mar 17;12:77. doi: 10.1186/1471-2105-12-77.
10
An efficient bayesian method for predicting clinical outcomes from genome-wide data.一种用于从全基因组数据预测临床结果的高效贝叶斯方法。
AMIA Annu Symp Proc. 2010 Nov 13;2010:127-31.