• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用具有症状相关性感知的朴素贝叶斯分类器增强基于本体的诊断推理。

Enhancing ontology-driven diagnostic reasoning with a symptom-dependency-aware Naïve Bayes classifier.

机构信息

School of Electronics and Computer Engineering, Peking University Shenzhen Graduate School, Shenzhen, 518055, People's Republic of China.

Alibaba Group, Bellevue, WA, USA.

出版信息

BMC Bioinformatics. 2019 Jun 13;20(1):330. doi: 10.1186/s12859-019-2924-0.

DOI:10.1186/s12859-019-2924-0
PMID:31196129
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6567606/
Abstract

BACKGROUND

Ontology has attracted substantial attention from both academia and industry. Handling uncertainty reasoning is important in researching ontology. For example, when a patient is suffering from cirrhosis, the appearance of abdominal vein varices is four times more likely than the presence of bitter taste. Such medical knowledge is crucial for decision-making in various medical applications but is missing from existing medical ontologies. In this paper, we aim to discover medical knowledge probabilities from electronic medical record (EMR) texts to enrich ontologies. First, we build an ontology by identifying meaningful entity mentions from EMRs. Then, we propose a symptom-dependency-aware naïve Bayes classifier (SDNB) that is based on the assumption that there is a level of dependency among symptoms. To ensure the accuracy of the diagnostic classification, we incorporate the probability of a disease into the ontology via innovative approaches.

RESULTS

We conduct a series of experiments to evaluate whether the proposed method can discover meaningful and accurate probabilities for medical knowledge. Based on over 30,000 deidentified medical records, we explore 336 abdominal diseases and 81 related symptoms. Among these 336 gastrointestinal diseases, the probabilities of 31 diseases are obtained via our method. These 31 probabilities of diseases and 189 conditional probabilities between diseases and the symptoms are added into the generated ontology.

CONCLUSION

In this paper, we propose a medical knowledge probability discovery method that is based on the analysis and extraction of EMR text data for enriching a medical ontology with probability information. The experimental results demonstrate that the proposed method can effectively identify accurate medical knowledge probability information from EMR data. In addition, the proposed method can efficiently and accurately calculate the probability of a patient suffering from a specified disease, thereby demonstrating the advantage of combining an ontology and a symptom-dependency-aware naïve Bayes classifier.

摘要

背景

本体在学术界和工业界都受到了广泛关注。处理不确定性推理在研究本体时很重要。例如,当患者患有肝硬化时,出现腹部静脉曲胀的可能性是口苦的四倍。这种医学知识对于各种医疗应用的决策至关重要,但现有的医学本体中却没有。在本文中,我们旨在从电子病历(EMR)文本中发现医学知识概率,以丰富本体。首先,我们通过从 EMR 中识别有意义的实体提及来构建本体。然后,我们提出了一种基于症状依赖假设的朴素贝叶斯分类器(SDNB)。为了确保诊断分类的准确性,我们通过创新方法将疾病的概率纳入本体。

结果

我们进行了一系列实验,以评估所提出的方法是否能够发现有意义和准确的医学知识概率。基于超过 30000 份去识别医疗记录,我们探索了 336 种腹部疾病和 81 种相关症状。在这 336 种胃肠道疾病中,有 31 种疾病的概率是通过我们的方法获得的。这些 31 种疾病的概率和 189 种疾病与症状之间的条件概率被添加到生成的本体中。

结论

在本文中,我们提出了一种基于 EMR 文本数据分析和提取的医学知识概率发现方法,用于为医学本体丰富概率信息。实验结果表明,所提出的方法可以有效地从 EMR 数据中识别准确的医学知识概率信息。此外,所提出的方法可以高效准确地计算患者患指定疾病的概率,从而证明了本体和症状依赖的朴素贝叶斯分类器结合的优势。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b9/6567606/a39e864c9634/12859_2019_2924_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b9/6567606/b3b47663ce52/12859_2019_2924_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b9/6567606/bb15e8f0c4ed/12859_2019_2924_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b9/6567606/2a3bd8c8c2cb/12859_2019_2924_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b9/6567606/cf53c680b4df/12859_2019_2924_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b9/6567606/8c251bc4233a/12859_2019_2924_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b9/6567606/a39e864c9634/12859_2019_2924_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b9/6567606/b3b47663ce52/12859_2019_2924_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b9/6567606/bb15e8f0c4ed/12859_2019_2924_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b9/6567606/2a3bd8c8c2cb/12859_2019_2924_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b9/6567606/cf53c680b4df/12859_2019_2924_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b9/6567606/8c251bc4233a/12859_2019_2924_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b9/6567606/a39e864c9634/12859_2019_2924_Fig6_HTML.jpg

相似文献

1
Enhancing ontology-driven diagnostic reasoning with a symptom-dependency-aware Naïve Bayes classifier.利用具有症状相关性感知的朴素贝叶斯分类器增强基于本体的诊断推理。
BMC Bioinformatics. 2019 Jun 13;20(1):330. doi: 10.1186/s12859-019-2924-0.
2
CBN: Constructing a clinical Bayesian network based on data from the electronic medical record.基于电子病历数据构建临床贝叶斯网络。
J Biomed Inform. 2018 Dec;88:1-10. doi: 10.1016/j.jbi.2018.10.007. Epub 2018 Nov 3.
3
Developing Bayesian networks from a dependency-layered ontology: A proof-of-concept in radiation oncology.从依赖分层本体中开发贝叶斯网络:放射肿瘤学中的概念验证。
Med Phys. 2017 Aug;44(8):4350-4359. doi: 10.1002/mp.12340. Epub 2017 Jun 30.
4
Mucopolysaccharidosis type II detection by Naïve Bayes Classifier: An example of patient classification for a rare disease using electronic medical records from the Canadian Primary Care Sentinel Surveillance Network.基于朴素贝叶斯分类器的黏多糖贮积症 II 型检测:利用加拿大初级保健监测网络的电子病历对罕见病患者进行分类的范例。
PLoS One. 2018 Dec 19;13(12):e0209018. doi: 10.1371/journal.pone.0209018. eCollection 2018.
5
Automated ontology generation framework powered by linked biomedical ontologies for disease-drug domain.基于链接生物医学本体的疾病-药物领域自动化本体生成框架。
Comput Methods Programs Biomed. 2018 Oct;165:117-128. doi: 10.1016/j.cmpb.2018.08.010. Epub 2018 Aug 16.
6
A hierarchical Naïve Bayes Model for handling sample heterogeneity in classification problems: an application to tissue microarrays.一种用于处理分类问题中样本异质性的分层朴素贝叶斯模型:在组织微阵列中的应用。
BMC Bioinformatics. 2006 Nov 24;7:514. doi: 10.1186/1471-2105-7-514.
7
[Short Text Classification of EMR Based on Entities and Dependency Parser].基于实体和依存句法分析器的电子病历短文本分类
Zhongguo Yi Liao Qi Xie Za Zhi. 2016;40(4):245-9.
8
An Approach for Learning Expressive Ontologies in Medical Domain.医学领域中可表达本体的学习方法。
J Med Syst. 2015 Aug;39(8):75. doi: 10.1007/s10916-015-0261-z. Epub 2015 Jun 16.
9
Bayesian-knowledge driven ontologies: A framework for fusion of semantic knowledge under uncertainty and incompleteness.贝叶斯知识驱动的本体论:一种在不确定性和不完整性下融合语义知识的框架。
PLoS One. 2024 Mar 27;19(3):e0296864. doi: 10.1371/journal.pone.0296864. eCollection 2024.
10
A Random Forest classifier-based approach in the detection of abnormalities in the retina.基于随机森林分类器的视网膜异常检测方法。
Med Biol Eng Comput. 2019 Jan;57(1):193-203. doi: 10.1007/s11517-018-1878-0. Epub 2018 Aug 4.

引用本文的文献

1
Quantitative Analysis of Diagnostic Reasoning Using Initial Electronic Medical Records.使用初始电子病历对诊断推理进行定量分析。
Diagnostics (Basel). 2025 Jun 18;15(12):1561. doi: 10.3390/diagnostics15121561.
2
Cognitive Computing-Based CDSS in Medical Practice.医学实践中基于认知计算的临床决策支持系统
Health Data Sci. 2021 Jul 22;2021:9819851. doi: 10.34133/2021/9819851. eCollection 2021.
3
Risk assessment for commercial crime and legal protection based on modularity-optimized LM and SCA-optimized CNN.基于模块化优化的长短时记忆网络(LM)和二阶相关分析(SCA)优化的卷积神经网络(CNN)的商业犯罪风险评估与法律保护

本文引用的文献

1
Owlready: Ontology-oriented programming in Python with automatic classification and high level constructs for biomedical ontologies.Owlready:用于生物医学本体的面向本体的Python编程,具备自动分类和高级构造。
Artif Intell Med. 2017 Jul;80:11-28. doi: 10.1016/j.artmed.2017.07.002. Epub 2017 Aug 14.
2
Bidirectional RNN for Medical Event Detection in Electronic Health Records.用于电子健康记录中医疗事件检测的双向循环神经网络
Proc Conf. 2016 Jun;2016:473-482. doi: 10.18653/v1/n16-1056.
3
Automated Classification of Consumer Health Information Needs in Patient Portal Messages.
PeerJ Comput Sci. 2023 Nov 16;9:e1683. doi: 10.7717/peerj-cs.1683. eCollection 2023.
4
Artificial Intelligence (AI) and Internet of Medical Things (IoMT) Assisted Biomedical Systems for Intelligent Healthcare.人工智能 (AI) 和医疗物联网 (IoMT) 辅助的生物医学系统用于智能医疗保健。
Biosensors (Basel). 2022 Jul 25;12(8):562. doi: 10.3390/bios12080562.
5
DeePred-BBB: A Blood Brain Barrier Permeability Prediction Model With Improved Accuracy.DeePred-BBB:一种具有更高准确率的血脑屏障通透性预测模型。
Front Neurosci. 2022 May 3;16:858126. doi: 10.3389/fnins.2022.858126. eCollection 2022.
6
Managing Boundary Uncertainty in Diagnosing the Patients of Rural Area Using Fuzzy and Rough Set.运用模糊集和粗糙集处理农村地区患者诊断中的边界不确定性
J Healthc Inform Res. 2022 Jan 3;6(1):1-47. doi: 10.1007/s41666-021-00109-4. eCollection 2022 Mar.
7
Predictors of COVID-19 Hospital Treatment Outcome.新冠病毒病住院治疗结果的预测因素
Int J Gen Med. 2021 Dec 22;14:10247-10256. doi: 10.2147/IJGM.S334544. eCollection 2021.
8
Predictive Modeling of Outcomes After Traumatic and Nontraumatic Spinal Cord Injury Using Machine Learning: Review of Current Progress and Future Directions.使用机器学习对创伤性和非创伤性脊髓损伤后结局进行预测建模:当前进展与未来方向综述
Neurospine. 2019 Dec;16(4):678-685. doi: 10.14245/ns.1938390.195. Epub 2019 Dec 31.
患者门户消息中消费者健康信息需求的自动分类
AMIA Annu Symp Proc. 2015 Nov 5;2015:1861-70. eCollection 2015.
4
Development of an online, publicly accessible naive Bayesian decision support tool for mammographic mass lesions based on the American College of Radiology (ACR) BI-RADS lexicon.基于美国放射学会(ACR)乳腺影像报告和数据系统(BI-RADS)词典,开发一种在线的、可公开访问的用于乳腺钼靶肿块病变的朴素贝叶斯决策支持工具。
Eur Radiol. 2015 Jun;25(6):1768-75. doi: 10.1007/s00330-014-3570-6. Epub 2015 Jan 11.
5
Accuracy of a computer-based diagnostic program for ambulatory patients with knee pain.针对膝关节疼痛门诊患者的计算机辅助诊断程序的准确性
Am J Sports Med. 2014 Oct;42(10):2371-6. doi: 10.1177/0363546514541654. Epub 2014 Jul 29.
6
Human symptoms-disease network.人类症状-疾病网络。
Nat Commun. 2014 Jun 26;5:4212. doi: 10.1038/ncomms5212.
7
Comprehensive temporal information detection from clinical text: medical events, time, and TLINK identification.从临床文本中进行全面的时间信息检测:医学事件、时间和 TLINK 识别。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):836-42. doi: 10.1136/amiajnl-2013-001622. Epub 2013 Apr 4.
8
Towards comprehensive syntactic and semantic annotations of the clinical narrative.朝着临床叙述的全面句法和语义标注努力。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):922-30. doi: 10.1136/amiajnl-2012-001317. Epub 2013 Jan 25.
9
Knowledge-based biomedical word sense disambiguation: an evaluation and application to clinical document classification.基于知识的生物医学词义消歧:评估及在临床文档分类中的应用。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):882-6. doi: 10.1136/amiajnl-2012-001350. Epub 2012 Oct 16.
10
Next-generation phenotyping of electronic health records.电子健康记录的下一代表型分析。
J Am Med Inform Assoc. 2013 Jan 1;20(1):117-21. doi: 10.1136/amiajnl-2012-001145. Epub 2012 Sep 6.