• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估生成式人工智能在公开可用的结构化基因数据集中识别癌症亚型的能力。

Evaluating Generative AI's Ability to Identify Cancer Subtypes in Publicly Available Structured Genetic Datasets.

作者信息

Hillis Ethan, Bhattarai Kriti, Abrams Zachary

机构信息

Institute for Informatics, Data Science and Biostatistics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110, USA.

Department of Computer Science, Washington University in St. Louis, St. Louis, MO 63130, USA.

出版信息

J Pers Med. 2024 Sep 25;14(10):1022. doi: 10.3390/jpm14101022.

DOI:10.3390/jpm14101022
PMID:39452530
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11508243/
Abstract

BACKGROUND

Genetic data play a crucial role in diagnosing and treating various diseases, reflecting a growing imperative to integrate these data into clinical care. However, significant barriers such as the structure of electronic health records (EHRs), insurance costs for genetic testing, and the interpretability of genetic results impede this integration.

METHODS

This paper explores solutions to these challenges by combining recent technological advances with informatics and data science, focusing on the diagnostic potential of artificial intelligence (AI) in cancer research. AI has historically been applied in medical research with limited success, but recent developments have led to the emergence of large language models (LLMs). These transformer-based generative AI models, trained on vast datasets, offer significant potential for genetic and genomic analyses. However, their effectiveness is constrained by their training on predominantly human-written text rather than comprehensive, structured genetic datasets.

RESULTS

This study reevaluates the capabilities of LLMs, specifically GPT models, in performing supervised prediction tasks using structured gene expression data. By comparing GPT models with traditional machine learning approaches, we assess their effectiveness in predicting cancer subtypes, demonstrating the potential of AI models to analyze real-world genetic data for generating real-world evidence.

摘要

背景

基因数据在各种疾病的诊断和治疗中起着至关重要的作用,这反映出将这些数据整合到临床护理中的紧迫性日益增加。然而,诸如电子健康记录(EHR)的结构、基因检测的保险成本以及基因检测结果的可解释性等重大障碍阻碍了这种整合。

方法

本文通过将近期的技术进步与信息学和数据科学相结合,探索应对这些挑战的解决方案,重点关注人工智能(AI)在癌症研究中的诊断潜力。人工智能在历史上应用于医学研究的成果有限,但最近的发展导致了大语言模型(LLM)的出现。这些基于Transformer的生成式人工智能模型在大量数据集上进行训练,为基因和基因组分析提供了巨大潜力。然而,它们的有效性受到其主要在人类编写的文本而非全面的结构化基因数据集上训练的限制。

结果

本研究重新评估了大语言模型,特别是GPT模型,在使用结构化基因表达数据执行监督预测任务方面的能力。通过将GPT模型与传统机器学习方法进行比较,我们评估了它们在预测癌症亚型方面的有效性,证明了人工智能模型分析真实世界基因数据以生成真实世界证据的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7db/11508243/29e84fdd1e6c/jpm-14-01022-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7db/11508243/ab4658037731/jpm-14-01022-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7db/11508243/78f0682a482c/jpm-14-01022-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7db/11508243/29e84fdd1e6c/jpm-14-01022-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7db/11508243/ab4658037731/jpm-14-01022-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7db/11508243/78f0682a482c/jpm-14-01022-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7db/11508243/29e84fdd1e6c/jpm-14-01022-g003.jpg

相似文献

1
Evaluating Generative AI's Ability to Identify Cancer Subtypes in Publicly Available Structured Genetic Datasets.评估生成式人工智能在公开可用的结构化基因数据集中识别癌症亚型的能力。
J Pers Med. 2024 Sep 25;14(10):1022. doi: 10.3390/jpm14101022.
2
Evaluating the accuracy of a state-of-the-art large language model for prediction of admissions from the emergency room.评估最先进的大型语言模型在预测急诊入院方面的准确性。
J Am Med Inform Assoc. 2024 Sep 1;31(9):1921-1928. doi: 10.1093/jamia/ocae103.
3
Generative AI in healthcare: an implementation science informed translational path on application, integration and governance.生成式人工智能在医疗保健领域的应用、整合和治理:基于实施科学的转化途径。
Implement Sci. 2024 Mar 15;19(1):27. doi: 10.1186/s13012-024-01357-9.
4
Learning to Make Rare and Complex Diagnoses With Generative AI Assistance: Qualitative Study of Popular Large Language Models.利用生成式人工智能辅助学习罕见且复杂的诊断:对流行的大型语言模型的定性研究。
JMIR Med Educ. 2024 Feb 13;10:e51391. doi: 10.2196/51391.
5
Evaluating the Capabilities of Generative AI Tools in Understanding Medical Papers: Qualitative Study.评估生成式人工智能工具理解医学论文的能力:定性研究
JMIR Med Inform. 2024 Sep 4;12:e59258. doi: 10.2196/59258.
6
The role of large language models in medical image processing: a narrative review.大语言模型在医学图像处理中的作用:一项叙述性综述。
Quant Imaging Med Surg. 2024 Jan 3;14(1):1108-1121. doi: 10.21037/qims-23-892. Epub 2023 Nov 23.
7
Artificial Intelligence in Dental Education: Opportunities and Challenges of Large Language Models and Multimodal Foundation Models.人工智能在牙科教育中的应用:大型语言模型和多模态基础模型的机遇与挑战。
JMIR Med Educ. 2024 Sep 27;10:e52346. doi: 10.2196/52346.
8
Assessing the role of GPT-4 in thyroid ultrasound diagnosis and treatment recommendations: enhancing interpretability with a chain of thought approach.评估GPT-4在甲状腺超声诊断及治疗建议中的作用:采用思维链方法提高可解释性
Quant Imaging Med Surg. 2024 Feb 1;14(2):1602-1615. doi: 10.21037/qims-23-1180. Epub 2024 Jan 11.
9
The Role of Large Language Models in Transforming Emergency Medicine: Scoping Review.大型语言模型在变革急诊医学中的作用:范围综述
JMIR Med Inform. 2024 May 10;12:e53787. doi: 10.2196/53787.
10
Large Language Models Versus Expert Clinicians in Crisis Prediction Among Telemental Health Patients: Comparative Study.大语言模型与专家临床医生在远程心理健康患者危机预测中的比较研究。
JMIR Ment Health. 2024 Aug 2;11:e58129. doi: 10.2196/58129.

本文引用的文献

1
A comprehensive evaluation of large language models in mining gene relations and pathway knowledge.大型语言模型在挖掘基因关系和通路知识方面的综合评估。
Quant Biol. 2024 Dec;12(4):360-374. doi: 10.1002/qub2.57. Epub 2024 Jun 21.
2
Large language model based framework for automated extraction of genetic interactions from unstructured data.基于大型语言模型的框架,用于从非结构化数据中自动提取遗传相互作用。
PLoS One. 2024 May 21;19(5):e0303231. doi: 10.1371/journal.pone.0303231. eCollection 2024.
3
Harnessing large language models (LLMs) for candidate gene prioritization and selection.
利用大型语言模型(LLMs)进行候选基因优先级排序和选择。
J Transl Med. 2023 Oct 16;21(1):728. doi: 10.1186/s12967-023-04576-8.
4
Artificial intelligence and machine learning in precision and genomic medicine.人工智能和机器学习在精准医学和基因组医学中的应用。
Med Oncol. 2022 Jun 15;39(8):120. doi: 10.1007/s12032-022-01711-1.
5
A gene expression-based single sample predictor of lung adenocarcinoma molecular subtype and prognosis.基于基因表达的肺腺癌分子亚型和预后的单一样本预测器。
Int J Cancer. 2021 Jan 1;148(1):238-251. doi: 10.1002/ijc.33242. Epub 2020 Aug 12.
6
Decision tree-based classifiers for lung cancer diagnosis and subtyping using TCGA miRNA expression data.基于决策树的分类器用于利用TCGA miRNA表达数据进行肺癌诊断和亚型分类。
Oncol Lett. 2019 Aug;18(2):2125-2131. doi: 10.3892/ol.2019.10462. Epub 2019 Jun 10.
7
Challenges and strategies for implementing genomic services in diverse settings: experiences from the Implementing GeNomics In pracTicE (IGNITE) network.在不同环境中实施基因组服务的挑战与策略:来自“实践中实施基因组学(IGNITE)”网络的经验
BMC Med Genomics. 2017 May 22;10(1):35. doi: 10.1186/s12920-017-0273-2.
8
Molecular genetic testing and the future of clinical genomics.分子遗传学检测与临床基因组学的未来。
Nat Rev Genet. 2013 Jun;14(6):415-26. doi: 10.1038/nrg3493.
9
The coming of age of artificial intelligence in medicine.人工智能在医学领域的成熟发展。
Artif Intell Med. 2009 May;46(1):5-17. doi: 10.1016/j.artmed.2008.07.017. Epub 2008 Sep 13.