• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Towards automated phenotype definition extraction using large language models.迈向使用大语言模型进行自动化表型定义提取
Genomics Inform. 2024 Oct 31;22(1):21. doi: 10.1186/s44342-024-00023-2.
2
An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing: Algorithm Development and Validation Study.零样本临床自然语言处理中大型语言模型提示策略的实证评估:算法开发与验证研究
JMIR Med Inform. 2024 Apr 8;12:e55318. doi: 10.2196/55318.
3
A hybrid framework with large language models for rare disease phenotyping.基于大语言模型的罕见病表型分析混合框架。
BMC Med Inform Decis Mak. 2024 Oct 8;24(1):289. doi: 10.1186/s12911-024-02698-7.
4
Large language models facilitate the generation of electronic health record phenotyping algorithms.大语言模型有助于电子健康记录表型算法的生成。
J Am Med Inform Assoc. 2024 Sep 1;31(9):1994-2001. doi: 10.1093/jamia/ocae072.
5
Large Language Models, scientific knowledge and factuality: A framework to streamline human expert evaluation.大语言模型、科学知识与真实性:简化人类专家评估的框架。
J Biomed Inform. 2024 Oct;158:104724. doi: 10.1016/j.jbi.2024.104724. Epub 2024 Sep 12.
6
Quality of Answers of Generative Large Language Models Versus Peer Users for Interpreting Laboratory Test Results for Lay Patients: Evaluation Study.生成式大语言模型与同行用户对解释非专业患者实验室检测结果的答案质量比较:评估研究。
J Med Internet Res. 2024 Apr 17;26:e56655. doi: 10.2196/56655.
7
Data Set and Benchmark (MedGPTEval) to Evaluate Responses From Large Language Models in Medicine: Evaluation Development and Validation.用于评估医学领域大语言模型回复的数据集和基准(MedGPTEval):评估开发与验证
JMIR Med Inform. 2024 Jun 28;12:e57674. doi: 10.2196/57674.
8
Identifying and Extracting Rare Diseases and Their Phenotypes with Large Language Models.使用大语言模型识别和提取罕见疾病及其表型
J Healthc Inform Res. 2024 Jan 5;8(2):438-461. doi: 10.1007/s41666-023-00155-0. eCollection 2024 Jun.
9
Programming techniques for improving rule readability for rule-based information extraction natural language processing pipelines of unstructured and semi-structured medical texts.用于改进基于规则的信息抽取自然语言处理管道的规则可读性的编程技术,这些管道处理非结构化和半结构化的医学文本。
Health Informatics J. 2023 Apr-Jun;29(2):14604582231164696. doi: 10.1177/14604582231164696.
10
Learning to Make Rare and Complex Diagnoses With Generative AI Assistance: Qualitative Study of Popular Large Language Models.利用生成式人工智能辅助学习罕见且复杂的诊断:对流行的大型语言模型的定性研究。
JMIR Med Educ. 2024 Feb 13;10:e51391. doi: 10.2196/51391.

引用本文的文献

1
Lightweight Language Models are Prone to Reasoning Errors for Complex Computational Phenotyping Tasks.轻量级语言模型在复杂的计算表型分析任务中容易出现推理错误。
ArXiv. 2025 Jul 30:arXiv:2507.23146v1.
2
SHREC: A Framework for Advancing Next-Generation Computational Phenotyping with Large Language Models.SHREC:一个利用大语言模型推进下一代计算表型分析的框架。
ArXiv. 2025 Jul 17:arXiv:2506.16359v3.

本文引用的文献

1
Enhancing phenotype recognition in clinical notes using large language models: PhenoBCBERT and PhenoGPT.使用大语言模型增强临床笔记中的表型识别:PhenoBCBERT和PhenoGPT。
Patterns (N Y). 2023 Dec 5;5(1):100887. doi: 10.1016/j.patter.2023.100887. eCollection 2024 Jan 12.
2
Ontologizing health systems data at scale: making translational discovery a reality.大规模实现卫生系统数据本体化:让转化性发现成为现实。
NPJ Digit Med. 2023 May 19;6(1):89. doi: 10.1038/s41746-023-00830-x.
3
Characterizing variability of electronic health record-driven phenotype definitions.电子健康记录驱动的表型定义的变异性特征描述。
J Am Med Inform Assoc. 2023 Feb 16;30(3):427-437. doi: 10.1093/jamia/ocac235.
4
PhenoTagger: a hybrid method for phenotype concept recognition using human phenotype ontology.PhenoTagger:一种使用人类表型本体进行表型概念识别的混合方法。
Bioinformatics. 2021 Jul 27;37(13):1884-1890. doi: 10.1093/bioinformatics/btab019.
5
Deep phenotyping: Embracing complexity and temporality-Towards scalability, portability, and interoperability.深度表型分析:拥抱复杂性和时间性——迈向可扩展性、便携性和互操作性。
J Biomed Inform. 2020 May;105:103433. doi: 10.1016/j.jbi.2020.103433. Epub 2020 Apr 23.
6
Considerations for Improving the Portability of Electronic Health Record-Based Phenotype Algorithms.提高基于电子健康记录的表型算法便携性的考量因素。
AMIA Annu Symp Proc. 2020 Mar 4;2019:755-764. eCollection 2019.
7
Combining deep learning with token selection for patient phenotyping from electronic health records.从电子健康记录中进行患者表型分析的深度学习与标记选择相结合。
Sci Rep. 2020 Jan 29;10(1):1432. doi: 10.1038/s41598-020-58178-1.
8
Toward clinical digital phenotyping: a timely opportunity to consider purpose, quality, and safety.迈向临床数字表型分析:一个考虑目的、质量和安全性的适时机遇。
NPJ Digit Med. 2019 Sep 6;2:88. doi: 10.1038/s41746-019-0166-1. eCollection 2019.
9
PheValuator: Development and evaluation of a phenotype algorithm evaluator.PheValuator:表型算法评估器的开发与评估。
J Biomed Inform. 2019 Sep;97:103258. doi: 10.1016/j.jbi.2019.103258. Epub 2019 Jul 29.
10
Advances in Electronic Phenotyping: From Rule-Based Definitions to Machine Learning Models.电子表型分析的进展:从基于规则的定义到机器学习模型
Annu Rev Biomed Data Sci. 2018 Jul;1:53-68. doi: 10.1146/annurev-biodatasci-080917-013315. Epub 2018 May 23.

迈向使用大语言模型进行自动化表型定义提取

Towards automated phenotype definition extraction using large language models.

作者信息

Tekumalla Ramya, Banda Juan M

机构信息

Mercer University, Atlanta, GA, USA.

Stanford Health Care, Stanford, CA, USA.

出版信息

Genomics Inform. 2024 Oct 31;22(1):21. doi: 10.1186/s44342-024-00023-2.

DOI:10.1186/s44342-024-00023-2
PMID:39482749
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11529293/
Abstract

Electronic phenotyping involves a detailed analysis of both structured and unstructured data, employing rule-based methods, machine learning, natural language processing, and hybrid approaches. Currently, the development of accurate phenotype definitions demands extensive literature reviews and clinical experts, rendering the process time-consuming and inherently unscalable. Large language models offer a promising avenue for automating phenotype definition extraction but come with significant drawbacks, including reliability issues, the tendency to generate non-factual data ("hallucinations"), misleading results, and potential harm. To address these challenges, our study embarked on two key objectives: (1) defining a standard evaluation set to ensure large language models outputs are both useful and reliable and (2) evaluating various prompting approaches to extract phenotype definitions from large language models, assessing them with our established evaluation task. Our findings reveal promising results that still require human evaluation and validation for this task. However, enhanced phenotype extraction is possible, reducing the amount of time spent in literature review and evaluation.

摘要

电子表型分析涉及对结构化和非结构化数据进行详细分析,采用基于规则的方法、机器学习、自然语言处理以及混合方法。目前,准确的表型定义的开发需要广泛的文献综述和临床专家参与,这使得该过程既耗时又本质上难以扩展。大语言模型为自动提取表型定义提供了一条有前景的途径,但也存在重大缺陷,包括可靠性问题、生成非事实数据(“幻觉”)的倾向、误导性结果以及潜在危害。为应对这些挑战,我们的研究着手实现两个关键目标:(1)定义一个标准评估集,以确保大语言模型的输出既有用又可靠;(2)评估各种提示方法,以便从大语言模型中提取表型定义,并通过我们既定的评估任务对其进行评估。我们的研究结果显示出有前景的成果,但对于这项任务仍需要人工评估和验证。不过,增强的表型提取是可行的,这减少了在文献综述和评估中花费的时间。