• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用句法模式和大型语言模型识别症状病因。

Identifying symptom etiologies using syntactic patterns and large language models.

机构信息

Allen Institute for AI, Seattle, USA.

Faculty of Biomedical Engineering, Technion, Haifa, Israel.

出版信息

Sci Rep. 2024 Jul 13;14(1):16190. doi: 10.1038/s41598-024-65645-6.

DOI:10.1038/s41598-024-65645-6
PMID:39003296
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11246441/
Abstract

Differential diagnosis is a crucial aspect of medical practice, as it guides clinicians to accurate diagnoses and effective treatment plans. Traditional resources, such as medical books and services like UpToDate, are constrained by manual curation, potentially missing out on novel or less common findings. This paper introduces and analyzes two novel methods to mine etiologies from scientific literature. The first method employs a traditional Natural Language Processing (NLP) approach based on syntactic patterns. By using a novel application of human-guided pattern bootstrapping patterns are derived quickly, and symptom etiologies are extracted with significant coverage. The second method utilizes generative models, specifically GPT-4, coupled with a fact verification pipeline, marking a pioneering application of generative techniques in etiology extraction. Analyzing this second method shows that while it is highly precise, it offers lesser coverage compared to the syntactic approach. Importantly, combining both methodologies yields synergistic outcomes, enhancing the depth and reliability of etiology mining.

摘要

鉴别诊断是医学实践中的一个关键环节,因为它指导临床医生做出准确的诊断和有效的治疗计划。传统资源,如医学书籍和 UpToDate 等服务,受到人工编辑的限制,可能会错过新颖或不太常见的发现。本文介绍并分析了两种从科学文献中挖掘病因的新方法。第一种方法采用基于句法模式的传统自然语言处理(NLP)方法。通过新颖的人工引导模式自举应用,可以快速得出模式,并以显著的覆盖率提取症状病因。第二种方法利用生成模型,特别是 GPT-4,并结合事实验证管道,这标志着生成技术在病因提取中的开创性应用。分析第二种方法表明,虽然它具有很高的准确性,但与句法方法相比,它的覆盖范围较小。重要的是,结合这两种方法可以产生协同效果,增强病因挖掘的深度和可靠性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65cb/11246441/a05c1f5622d8/41598_2024_65645_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65cb/11246441/935d741c75ed/41598_2024_65645_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65cb/11246441/b44ae37533ac/41598_2024_65645_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65cb/11246441/4371bf9fed4d/41598_2024_65645_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65cb/11246441/be734e4636c6/41598_2024_65645_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65cb/11246441/0247f5a94b53/41598_2024_65645_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65cb/11246441/7e696d018e10/41598_2024_65645_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65cb/11246441/a05c1f5622d8/41598_2024_65645_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65cb/11246441/935d741c75ed/41598_2024_65645_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65cb/11246441/b44ae37533ac/41598_2024_65645_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65cb/11246441/4371bf9fed4d/41598_2024_65645_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65cb/11246441/be734e4636c6/41598_2024_65645_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65cb/11246441/0247f5a94b53/41598_2024_65645_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65cb/11246441/7e696d018e10/41598_2024_65645_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65cb/11246441/a05c1f5622d8/41598_2024_65645_Fig7_HTML.jpg

相似文献

1
Identifying symptom etiologies using syntactic patterns and large language models.使用句法模式和大型语言模型识别症状病因。
Sci Rep. 2024 Jul 13;14(1):16190. doi: 10.1038/s41598-024-65645-6.
2
Social media mining for birth defects research: A rule-based, bootstrapping approach to collecting data for rare health-related events on Twitter.社交媒体挖掘在出生缺陷研究中的应用:一种基于规则和自举的方法,用于在 Twitter 上收集罕见健康相关事件的数据。
J Biomed Inform. 2018 Nov;87:68-78. doi: 10.1016/j.jbi.2018.10.001. Epub 2018 Oct 4.
3
Induced lexico-syntactic patterns improve information extraction from online medical forums.诱导词汇句法模式可提高从在线医疗论坛中提取信息的能力。
J Am Med Inform Assoc. 2014 Sep-Oct;21(5):902-9. doi: 10.1136/amiajnl-2014-002669. Epub 2014 Jun 26.
4
Location-enhanced syntactic knowledge for biomedical relation extraction.基于位置增强的生物医学关系抽取句法知识。
J Biomed Inform. 2024 Aug;156:104676. doi: 10.1016/j.jbi.2024.104676. Epub 2024 Jun 12.
5
Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review.电子健康记录中自由文本叙述的症状的自然语言处理:系统评价。
J Am Med Inform Assoc. 2019 Apr 1;26(4):364-379. doi: 10.1093/jamia/ocy173.
6
Clinical trial cohort selection based on multi-level rule-based natural language processing system.基于多层次规则的自然语言处理系统的临床试验队列选择。
J Am Med Inform Assoc. 2019 Nov 1;26(11):1218-1226. doi: 10.1093/jamia/ocz109.
7
Enhancing the coverage of SemRep using a relation classification approach.利用关系分类方法增强 SemRep 的覆盖范围。
J Biomed Inform. 2024 Jul;155:104658. doi: 10.1016/j.jbi.2024.104658. Epub 2024 May 21.
8
Natural Language Processing Technologies in Radiology Research and Clinical Applications.放射学研究与临床应用中的自然语言处理技术
Radiographics. 2016 Jan-Feb;36(1):176-91. doi: 10.1148/rg.2016150080.
9
Using data-driven sublanguage pattern mining to induce knowledge models: application in medical image reports knowledge representation.利用数据驱动的子语言模式挖掘来诱导知识模型:在医学图像报告知识表示中的应用。
BMC Med Inform Decis Mak. 2018 Jul 6;18(1):61. doi: 10.1186/s12911-018-0645-3.
10
Semantic biomedical resource discovery: a Natural Language Processing framework.语义生物医学资源发现:一种自然语言处理框架。
BMC Med Inform Decis Mak. 2015 Sep 30;15:77. doi: 10.1186/s12911-015-0200-4.

引用本文的文献

1
Characterisation of serious mental illness trajectories through transdiagnostic clinical features.通过跨诊断临床特征对严重精神疾病轨迹进行特征描述。
Br J Psychiatry. 2025 Jun 23:1-8. doi: 10.1192/bjp.2025.107.
2
Large Language Models in Medical Diagnostics: Scoping Review With Bibliometric Analysis.医学诊断中的大语言模型:基于文献计量分析的综述
J Med Internet Res. 2025 Jun 9;27:e72062. doi: 10.2196/72062.

本文引用的文献

1
A Deep Language Model for Symptom Extraction From Clinical Text and its Application to Extract COVID-19 Symptoms From Social Media.一种从临床文本中提取症状的深度语言模型及其在从社交媒体中提取 COVID-19 症状的应用。
IEEE J Biomed Health Inform. 2022 Apr;26(4):1737-1748. doi: 10.1109/JBHI.2021.3123192. Epub 2022 Apr 14.
2
Recent advances in biomedical literature mining.生物医学文献挖掘的最新进展。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa057.
3
DISNET: a framework for extracting phenotypic disease information from public sources.
DISNET:一个从公共资源中提取疾病表型信息的框架。
PeerJ. 2020 Feb 17;8:e8580. doi: 10.7717/peerj.8580. eCollection 2020.
4
Task definition, annotated dataset, and supervised natural language processing models for symptom extraction from unstructured clinical notes.从非结构化临床记录中提取症状的任务定义、标注数据集和监督自然语言处理模型。
J Biomed Inform. 2020 Feb;102:103354. doi: 10.1016/j.jbi.2019.103354. Epub 2019 Dec 12.
5
DiseaSE: A biomedical text analytics system for disease symptom extraction and characterization.疾病:用于疾病症状提取和特征描述的生物医学文本分析系统。
J Biomed Inform. 2019 Dec;100:103324. doi: 10.1016/j.jbi.2019.103324. Epub 2019 Oct 31.
6
Disease networks and their contribution to disease understanding: A review of their evolution, techniques and data sources.疾病网络及其对疾病认识的贡献:对其演化、技术和数据源的综述。
J Biomed Inform. 2019 Jun;94:103206. doi: 10.1016/j.jbi.2019.103206. Epub 2019 May 8.
7
Mining Disease-Symptom Relation from Massive Biomedical Literature and Its Application in Severe Disease Diagnosis.从海量生物医学文献中挖掘疾病-症状关系及其在重症疾病诊断中的应用
AMIA Annu Symp Proc. 2018 Dec 5;2018:1118-1126. eCollection 2018.
8
Semisupervised Learning Based Disease-Symptom and Symptom-Therapeutic Substance Relation Extraction from Biomedical Literature.基于半监督学习的生物医学文献中疾病-症状及症状-治疗物质关系提取
Biomed Res Int. 2016;2016:3594937. doi: 10.1155/2016/3594937. Epub 2016 Oct 16.
9
BioCreative V CDR task corpus: a resource for chemical disease relation extraction.生物创意V化学疾病关系提取任务语料库:化学疾病关系提取的资源。
Database (Oxford). 2016 May 9;2016. doi: 10.1093/database/baw068. Print 2016.
10
Community challenges in biomedical text mining over 10 years: success, failure and the future.十年来生物医学文本挖掘中的社区挑战:成功、失败与未来。
Brief Bioinform. 2016 Jan;17(1):132-44. doi: 10.1093/bib/bbv024. Epub 2015 May 1.