• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因智能体:使用领域数据库进行基因集分析的自我验证语言智能体。

GeneAgent: self-verification language agent for gene-set analysis using domain databases.

作者信息

Wang Zhizheng, Jin Qiao, Wei Chih-Hsuan, Tian Shubo, Lai Po-Ting, Zhu Qingqing, Day Chi-Ping, Ross Christina, Leaman Robert, Lu Zhiyong

机构信息

Division of Intramural Research, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.

Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA.

出版信息

Nat Methods. 2025 Jul 28. doi: 10.1038/s41592-025-02748-6.

DOI:10.1038/s41592-025-02748-6
PMID:40721871
Abstract

Gene-set analysis seeks to identify the biological mechanisms underlying groups of genes with shared functions. Large language models (LLMs) have recently shown promise in generating functional descriptions for input gene sets but may produce factually incorrect statements, commonly referred to as hallucinations in LLMs. Here we present GeneAgent, an LLM-based AI agent for gene-set analysis that reduces hallucinations by autonomously interacting with biological databases to verify its own output. Evaluation of 1,106 gene sets collected from different sources demonstrates that GeneAgent is consistently more accurate than GPT-4 by a significant margin. We further applied GeneAgent to seven novel gene sets derived from mouse B2905 melanoma cell lines. Expert review confirmed that GeneAgent produces more relevant and comprehensive functional descriptions than GPT-4, providing valuable insights into gene functions and expediting knowledge discovery.

摘要

基因集分析旨在识别具有共享功能的基因群体背后的生物学机制。大型语言模型(LLMs)最近在为输入基因集生成功能描述方面显示出了前景,但可能会产生事实上错误的陈述,这在大型语言模型中通常被称为幻觉。在此,我们展示了GeneAgent,一种基于大型语言模型的用于基因集分析的人工智能代理,它通过与生物数据库自主交互来验证自身输出,从而减少幻觉。对从不同来源收集的1106个基因集的评估表明,GeneAgent始终比GPT-4准确得多。我们进一步将GeneAgent应用于从小鼠B2905黑色素瘤细胞系衍生的七个新基因集。专家评审证实,GeneAgent比GPT-4产生更相关和全面的功能描述,为基因功能提供了有价值的见解,并加快了知识发现。

相似文献

1
GeneAgent: self-verification language agent for gene-set analysis using domain databases.基因智能体:使用领域数据库进行基因集分析的自我验证语言智能体。
Nat Methods. 2025 Jul 28. doi: 10.1038/s41592-025-02748-6.
2
GeneAgent: Self-verification Language Agent for Gene Set Knowledge Discovery using Domain Databases.基因智能体:使用领域数据库进行基因集知识发现的自我验证语言智能体
ArXiv. 2024 May 25:arXiv:2405.16205v1.
3
A dataset and benchmark for hospital course summarization with adapted large language models.一个用于医院病程总结的数据集和基准测试,采用了适配的大语言模型。
J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.
4
Use of Large Language Models to Classify Epidemiological Characteristics in Synthetic and Real-World Social Media Posts About Conjunctivitis Outbreaks: Infodemiology Study.利用大语言模型对合成及真实世界社交媒体上有关结膜炎爆发的帖子中的流行病学特征进行分类:信息流行病学研究
J Med Internet Res. 2025 Jul 2;27:e65226. doi: 10.2196/65226.
5
Improving Large Language Models' Summarization Accuracy by Adding Highlights to Discharge Notes: Comparative Evaluation.通过在出院小结中添加重点内容提高大语言模型的总结准确性:比较评估
JMIR Med Inform. 2025 Jul 24;13:e66476. doi: 10.2196/66476.
6
Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.
7
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
8
Assessing the Accuracy and Reliability of Large Language Models in Psychiatry Using Standardized Multiple-Choice Questions: Cross-Sectional Study.使用标准化多项选择题评估大型语言模型在精神病学中的准确性和可靠性:横断面研究
J Med Internet Res. 2025 May 20;27:e69910. doi: 10.2196/69910.
9
Artificial intelligence-simplified information to advance reproductive genetic literacy and health equity.人工智能简化信息以促进生殖遗传知识普及和健康公平。
Hum Reprod. 2025 Jul 22. doi: 10.1093/humrep/deaf135.
10
Intravenous magnesium sulphate and sotalol for prevention of atrial fibrillation after coronary artery bypass surgery: a systematic review and economic evaluation.静脉注射硫酸镁和索他洛尔预防冠状动脉搭桥术后房颤:系统评价与经济学评估
Health Technol Assess. 2008 Jun;12(28):iii-iv, ix-95. doi: 10.3310/hta12280.

引用本文的文献

1
Knowledge-guided Contextual Gene Set Analysis Using Large Language Models.使用大语言模型进行知识引导的上下文基因集分析
ArXiv. 2025 Jun 4:arXiv:2506.04303v1.
2
Knowledge-Driven Feature Selection and Engineering for Genotype Data with Large Language Models.利用大语言模型对基因型数据进行知识驱动的特征选择与工程设计
ArXiv. 2025 Apr 16:arXiv:2410.01795v2.

本文引用的文献

1
Stochastic modeling of single-cell gene expression adaptation reveals non-genomic contribution to evolution of tumor subclones.单细胞基因表达适应性的随机建模揭示了肿瘤亚克隆进化的非基因组贡献。
Cell Syst. 2025 Jan 15;16(1):101156. doi: 10.1016/j.cels.2024.11.013. Epub 2024 Dec 18.
2
Evaluation of large language models for discovery of gene set function.用于发现基因集功能的大语言模型评估
Nat Methods. 2025 Jan;22(1):82-91. doi: 10.1038/s41592-024-02525-x. Epub 2024 Nov 28.
3
Integration of background knowledge for automatic detection of inconsistencies in gene ontology annotation.
背景知识的整合用于自动检测基因本体论注释中的不一致性。
Bioinformatics. 2024 Jun 28;40(Suppl 1):i390-i400. doi: 10.1093/bioinformatics/btae246.
4
GeneGPT: augmenting large language models with domain tools for improved access to biomedical information.GeneGPT:利用领域工具增强大型语言模型,以改善对生物医学信息的访问。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae075.
5
Opportunities and challenges for ChatGPT and large language models in biomedicine and health.ChatGPT 和大型语言模型在生物医学和健康领域的机遇与挑战。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad493.
6
Database resources of the National Center for Biotechnology Information.国家生物技术信息中心数据库资源。
Nucleic Acids Res. 2024 Jan 5;52(D1):D33-D43. doi: 10.1093/nar/gkad1044.
7
MedCPT: Contrastive Pre-trained Transformers with large-scale PubMed search logs for zero-shot biomedical information retrieval.MedCPT:利用大规模 PubMed 检索日志进行零样本生物医学信息检索的对比预训练 Transformer。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad651.
8
A Domain-Specific Next-Generation Large Language Model (LLM) or ChatGPT is Required for Biomedical Engineering and Research.生物医学工程与研究需要特定领域的下一代大语言模型(LLM)或ChatGPT。
Ann Biomed Eng. 2024 Mar;52(3):451-454. doi: 10.1007/s10439-023-03306-x. Epub 2023 Jul 10.
9
g:Profiler-interoperable web service for functional enrichment analysis and gene identifier mapping (2023 update).用于功能富集分析和基因标识符映射的可互操作网络服务(2023 更新)。
Nucleic Acids Res. 2023 Jul 5;51(W1):W207-W212. doi: 10.1093/nar/gkad347.
10
CORUM: the comprehensive resource of mammalian protein complexes-2022.CORUM:哺乳动物蛋白质复合物综合资源-2022 年版。
Nucleic Acids Res. 2023 Jan 6;51(D1):D539-D545. doi: 10.1093/nar/gkac1015.