• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人增强型大型语言模型驱动的谷胱甘肽过氧化物酶 4 选择作为循环红细胞血液转录生物标志物的候选物。

Human-augmented large language model-driven selection of glutathione peroxidase 4 as a candidate blood transcriptional biomarker for circulating erythroid cells.

机构信息

The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.

Williams College, Williamstown, MA, USA.

出版信息

Sci Rep. 2024 Oct 5;14(1):23225. doi: 10.1038/s41598-024-73916-5.

DOI:10.1038/s41598-024-73916-5
PMID:39369090
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11455862/
Abstract

The identification of optimal candidate genes from large-scale blood transcriptomic data is crucial for developing targeted assays to monitor immune responses. Here, we introduce a novel, optimized large language model (LLM)-based approach for prioritizing candidate biomarkers from blood transcriptional modules. Focusing on module M14.51 from the BloodGen3 repertoire, we implemented a multi-step LLM-driven workflow. Initial high-throughput screening used GPT-4, Claude 3, and Claude 3.5 Sonnet to score and rank the module's constituent genes across six criteria. Top candidates then underwent high-resolution scoring using Consensus GPT, with concurrent manual fact-checking and, when needed, iterative refinement of the scores based on user feedback. Qualitative assessment of literature-based narratives and analysis of reference transcriptome data further refined the selection process. This novel multi-tiered approach consistently identified Glutathione Peroxidase 4 (GPX4) as the top candidate gene for module M14.51. GPX4's role in oxidative stress regulation, its potential as a future drug target, and its expression pattern across diverse cell types supported its selection. The incorporation of reference transcriptome data further validated GPX4 as the most suitable candidate for this module. This study presents an advanced LLM-driven workflow with a novel optimized scoring strategy for candidate gene prioritization, incorporating human-in-the-loop augmentation. The approach identified GPX4 as a key gene in the erythroid cell-associated module M14.51, suggesting its potential utility for biomarker discovery and targeted assay development. By combining AI-driven literature analysis with iterative human expert validation, this method leverages the strengths of both artificial and human intelligence, potentially contributing to the development of biologically relevant and clinically informative targeted assays. Further validation studies are needed to confirm the broader applicability of this human-augmented AI approach.

摘要

从大规模血液转录组数据中鉴定出最佳候选基因对于开发靶向检测来监测免疫反应至关重要。在这里,我们介绍了一种新颖的、优化的基于大型语言模型(LLM)的方法,用于从血液转录模块中优先考虑候选生物标志物。我们专注于 BloodGen3 库中的模块 M14.51,实施了一个多步骤的 LLM 驱动的工作流程。最初的高通量筛选使用 GPT-4、Claude 3 和 Claude 3.5 Sonnet 对模块的组成基因进行评分和排名,共使用了六个标准。然后,使用 Consensus GPT 对顶级候选基因进行高分辨率评分,同时进行手动事实检查,并且在需要时根据用户反馈迭代地调整评分。基于文献的叙述的定性评估和参考转录组数据的分析进一步完善了选择过程。这种新颖的多层次方法一致地将谷胱甘肽过氧化物酶 4(GPX4)鉴定为模块 M14.51 的顶级候选基因。GPX4 在氧化应激调节中的作用、作为未来药物靶点的潜力以及在各种细胞类型中的表达模式支持了它的选择。参考转录组数据的纳入进一步验证了 GPX4 是该模块最适合的候选基因。本研究提出了一种先进的基于 LLM 的工作流程,具有新颖的优化评分策略,用于候选基因优先级排序,并结合了人工增强。该方法确定了 GPX4 是与红细胞相关的模块 M14.51 中的关键基因,表明其在生物标志物发现和靶向检测开发方面具有潜在的应用价值。通过将人工智能驱动的文献分析与迭代的人工专家验证相结合,该方法利用了人工智能和人类智能的优势,可能有助于开发具有生物学相关性和临床信息的靶向检测。需要进一步的验证研究来确认这种人工增强的人工智能方法的更广泛适用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d3c/11455862/0887fe89669a/41598_2024_73916_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d3c/11455862/e3c7739907d5/41598_2024_73916_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d3c/11455862/1d1aec4212a1/41598_2024_73916_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d3c/11455862/14a2eeadb395/41598_2024_73916_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d3c/11455862/0887fe89669a/41598_2024_73916_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d3c/11455862/e3c7739907d5/41598_2024_73916_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d3c/11455862/1d1aec4212a1/41598_2024_73916_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d3c/11455862/14a2eeadb395/41598_2024_73916_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d3c/11455862/0887fe89669a/41598_2024_73916_Fig4_HTML.jpg

相似文献

1
Human-augmented large language model-driven selection of glutathione peroxidase 4 as a candidate blood transcriptional biomarker for circulating erythroid cells.人增强型大型语言模型驱动的谷胱甘肽过氧化物酶 4 选择作为循环红细胞血液转录生物标志物的候选物。
Sci Rep. 2024 Oct 5;14(1):23225. doi: 10.1038/s41598-024-73916-5.
2
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
3
Quality improvement strategies for diabetes care: Effects on outcomes for adults living with diabetes.糖尿病护理质量改进策略:对成年糖尿病患者结局的影响。
Cochrane Database Syst Rev. 2023 May 31;5(5):CD014513. doi: 10.1002/14651858.CD014513.
4
AI-based Hepatic Steatosis Detection and Integrated Hepatic Assessment from Cardiac CT Attenuation Scans Enhances All-cause Mortality Risk Stratification: A Multi-center Study.基于人工智能的心脏CT衰减扫描检测肝脂肪变性及综合肝脏评估可增强全因死亡风险分层:一项多中心研究
medRxiv. 2025 Jun 11:2025.06.09.25329157. doi: 10.1101/2025.06.09.25329157.
5
Development of a GPT-4-Powered Virtual Simulated Patient and Communication Training Platform for Medical Students to Practice Discussing Abnormal Mammogram Results With Patients: Multiphase Study.开发一个由GPT-4驱动的虚拟模拟患者和沟通训练平台,供医学生练习与患者讨论异常乳房X光检查结果:多阶段研究。
JMIR Form Res. 2025 Apr 17;9:e65670. doi: 10.2196/65670.
6
Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.
7
Precision medicine for mood disorders: objective assessment, risk prediction, pharmacogenomics, and repurposed drugs.心境障碍的精准医学:客观评估、风险预测、药物基因组学和再利用药物。
Mol Psychiatry. 2021 Jul;26(7):2776-2804. doi: 10.1038/s41380-021-01061-w. Epub 2021 Apr 8.
8
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
9
Factors that impact on the use of mechanical ventilation weaning protocols in critically ill adults and children: a qualitative evidence-synthesis.影响重症成人和儿童机械通气撤机方案使用的因素:一项定性证据综合分析
Cochrane Database Syst Rev. 2016 Oct 4;10(10):CD011812. doi: 10.1002/14651858.CD011812.pub2.
10
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.样本采集部位和采集程序对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染鉴定的影响。
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.

本文引用的文献

1
ST3 beta-galactoside alpha-2,3-sialyltransferase 1 (ST3Gal1) synthesis of Siglec ligands mediates anti-tumour immunity in prostate cancer.ST3 半乳糖苷 α-2,3-唾液酸转移酶 1(ST3Gal1)合成 Siglec 配体介导前列腺癌中的抗肿瘤免疫。
Commun Biol. 2024 Mar 6;7(1):276. doi: 10.1038/s42003-024-05924-0.
2
Design of a targeted blood transcriptional panel for monitoring immunological changes accompanying pregnancy.用于监测伴随妊娠发生的免疫变化的靶向血液转录组学检测面板的设计。
Front Immunol. 2024 Jan 30;15:1319949. doi: 10.3389/fimmu.2024.1319949. eCollection 2024.
3
Harnessing large language models (LLMs) for candidate gene prioritization and selection.
利用大型语言模型(LLMs)进行候选基因优先级排序和选择。
J Transl Med. 2023 Oct 16;21(1):728. doi: 10.1186/s12967-023-04576-8.
4
Glutathione Peroxidase to Genes Expression in Experimental Brain Tumors Reveals Gender-Dependent Patterns.谷胱甘肽过氧化物酶对实验性脑肿瘤基因表达的影响揭示了性别依赖性模式。
Genes (Basel). 2023 Aug 24;14(9):1674. doi: 10.3390/genes14091674.
5
High-temporal resolution profiling reveals distinct immune trajectories following the first and second doses of COVID-19 mRNA vaccines.高时间分辨率分析揭示了 COVID-19 mRNA 疫苗接种第一针和第二针后的不同免疫轨迹。
Sci Adv. 2022 Nov 11;8(45):eabp9961. doi: 10.1126/sciadv.abp9961.
6
The Selenoprotein Glutathione Peroxidase 4: From Molecular Mechanisms to Novel Therapeutic Opportunities.硒蛋白谷胱甘肽过氧化物酶4:从分子机制到新的治疗机会
Biomedicines. 2022 Apr 13;10(4):891. doi: 10.3390/biomedicines10040891.
7
Predictive and prognostic impact of ferroptosis-related genes ACSL4 and GPX4 on breast cancer treated with neoadjuvant chemotherapy.铁死亡相关基因 ACSL4 和 GPX4 对新辅助化疗治疗乳腺癌的预测和预后影响。
EBioMedicine. 2021 Sep;71:103560. doi: 10.1016/j.ebiom.2021.103560. Epub 2021 Sep 2.
8
Expression Silencing of Glutathione Peroxidase 4 in Mouse Erythroleukemia Cells Delays In Vitro Erythropoiesis.谷胱甘肽过氧化物酶 4 在小鼠红白血病细胞中的表达沉默可延迟体外红细胞生成。
Int J Mol Sci. 2021 Jul 21;22(15):7795. doi: 10.3390/ijms22157795.
9
Development of a fixed module repertoire for the analysis and interpretation of blood transcriptome data.建立固定模块库,用于分析和解释血液转录组数据。
Nat Commun. 2021 Jul 19;12(1):4385. doi: 10.1038/s41467-021-24584-w.
10
The glutathione peroxidase Gpx4 prevents lipid peroxidation and ferroptosis to sustain Treg cell activation and suppression of antitumor immunity.谷胱甘肽过氧化物酶 Gpx4 可防止脂质过氧化和铁死亡,从而维持 Treg 细胞的激活和抑制抗肿瘤免疫。
Cell Rep. 2021 Jun 15;35(11):109235. doi: 10.1016/j.celrep.2021.109235.