• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DrugBERT:一种基于BERT的方法,集成LDA主题嵌入和疗效感知机制以预测抗肿瘤药物疗效。

DrugBERT: a BERT-based approach integrating LDA topic embedding and efficacy-aware mechanism for predicting anti-tumor drug efficacy.

作者信息

Zhu Weiwei, Jiang Xiaodong, Zhang Lei, Zhou Peng, Xie Xinping, Wang Hongqiang

机构信息

University of Science and Technology of China, Hefei, Anhui, 230026, China.

Institute of Intelligent Machines, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, Anhui, 230031, China.

出版信息

J Transl Med. 2025 Aug 5;23(1):864. doi: 10.1186/s12967-025-06795-7.

DOI:10.1186/s12967-025-06795-7
PMID:40764962
Abstract

BACKGROUND

Due to the complexity of tumor genetic heterogeneity, personalized medicine has progressively emerged as the central focus of cancer research. However, how to accurately predict the drug response of patients before receiving treatment is the critical challenge to the development of this field.

METHODS

This paper proposes DrugBERT, a BERT-based framework integrated with LDA topic embedding and a drug efficacy-aware mechanism for predicting the efficacy of antitumor drugs. The method incorporates LDA-generated topic embedding as a semantic enhancement module into the BERT language model and introduces a drug efficacy-aware attention mechanism to prioritize drug efficacy-related semantic features. The model is via LSTM to capture long-range dependencies in clinical text data. In addition, the SMOTE algorithm is used to synthesize samples of the minority class to solve the problem of data imbalance.

RESULTS

The proposed method DrugBERT demonstrated remarkable performance on a dataset of 958 patients with non-small cell cancer treated with antitumor drugs. Furthermore, when validated on an independent dataset of 266 bowel cancer patients, the model achieved a 3% improvement in AUC over previous methods, signifying its robust generalization capability.

CONCLUSIONS

DrugBERT can help predict the efficacy of antitumor drugs based on clinical text while exhibiting strong generalization capability. These findings highlight its potential for optimizing personalized therapeutic strategies through language model.

摘要

背景

由于肿瘤基因异质性的复杂性,个性化医疗已逐渐成为癌症研究的核心焦点。然而,如何在患者接受治疗前准确预测其药物反应是该领域发展的关键挑战。

方法

本文提出了DrugBERT,这是一个基于BERT的框架,集成了LDA主题嵌入和药物疗效感知机制,用于预测抗肿瘤药物的疗效。该方法将LDA生成的主题嵌入作为语义增强模块纳入BERT语言模型,并引入药物疗效感知注意力机制,以优先处理与药物疗效相关的语义特征。该模型通过LSTM来捕捉临床文本数据中的长程依赖关系。此外,使用SMOTE算法合成少数类样本以解决数据不平衡问题。

结果

所提出的DrugBERT方法在958例接受抗肿瘤药物治疗的非小细胞癌患者数据集上表现出显著性能。此外,在266例肠癌患者的独立数据集上进行验证时,该模型的AUC比以前的方法提高了3%,表明其具有强大的泛化能力。

结论

DrugBERT可以基于临床文本帮助预测抗肿瘤药物的疗效,同时表现出强大的泛化能力。这些发现突出了其通过语言模型优化个性化治疗策略的潜力。

相似文献

1
DrugBERT: a BERT-based approach integrating LDA topic embedding and efficacy-aware mechanism for predicting anti-tumor drug efficacy.DrugBERT:一种基于BERT的方法,集成LDA主题嵌入和疗效感知机制以预测抗肿瘤药物疗效。
J Transl Med. 2025 Aug 5;23(1):864. doi: 10.1186/s12967-025-06795-7.
2
Predicting Drug-Side Effect Relationships From Parametric Knowledge Embedded in Biomedical BERT Models: Methodological Study With a Natural Language Processing Approach.从生物医学BERT模型中嵌入的参数知识预测药物副作用关系:一种自然语言处理方法的方法学研究
JMIR Med Inform. 2025 Jul 10;13:e67513. doi: 10.2196/67513.
3
Short-Term Memory Impairment短期记忆障碍
4
Trajectory-Ordered Objectives for Self-Supervised Representation Learning of Temporal Healthcare Data Using Transformers: Model Development and Evaluation Study.使用Transformer进行时间序列医疗数据自监督表示学习的轨迹有序目标:模型开发与评估研究
JMIR Med Inform. 2025 Jun 4;13:e68138. doi: 10.2196/68138.
5
Detecting Redundant Health Survey Questions by Using Language-Agnostic Bidirectional Encoder Representations From Transformers Sentence Embedding: Algorithm Development Study.使用来自Transformer句子嵌入的语言无关双向编码器表示法检测冗余健康调查问题:算法开发研究
JMIR Med Inform. 2025 Jun 10;13:e71687. doi: 10.2196/71687.
6
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
7
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
8
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
9
Knowledge Graph-Enhanced Deep Learning Model (H-SYSTEM) for Hypertensive Intracerebral Hemorrhage: Model Development and Validation.用于高血压性脑出血的知识图谱增强深度学习模型(H-SYSTEM):模型开发与验证
J Med Internet Res. 2025 Jun 12;27:e66055. doi: 10.2196/66055.
10
The effectiveness of therapeutic patient education on adherence to oral anti-cancer medicines in adult cancer patients in ambulatory care settings: a systematic review.门诊护理环境中成人癌症患者接受治疗性患者教育对口服抗癌药物依从性的有效性:一项系统综述
JBI Database System Rev Implement Rep. 2015 Jun 12;13(5):244-92. doi: 10.11124/jbisrir-2015-2057.

本文引用的文献

1
A systematic review of large language model (LLM) evaluations in clinical medicine.对临床医学中大型语言模型(LLM)评估的系统综述。
BMC Med Inform Decis Mak. 2025 Mar 7;25(1):117. doi: 10.1186/s12911-025-02954-4.
2
DrBioRight 2.0: an LLM-powered bioinformatics chatbot for large-scale cancer functional proteomics analysis.DrBioRight 2.0:一款由大型语言模型驱动的用于大规模癌症功能蛋白质组学分析的生物信息学聊天机器人。
Nat Commun. 2025 Mar 6;16(1):2256. doi: 10.1038/s41467-025-57430-4.
3
Improving large language model applications in biomedicine with retrieval-augmented generation: a systematic review, meta-analysis, and clinical development guidelines.
利用检索增强生成改进生物医学中的大语言模型应用:一项系统综述、荟萃分析和临床开发指南
J Am Med Inform Assoc. 2025 Apr 1;32(4):605-615. doi: 10.1093/jamia/ocaf008.
4
ANI-1ccx-gelu Universal Interatomic Potential and Its Fine-Tuning: Toward Accurate and Efficient Anharmonic Vibrational Frequencies.ANI-1ccx-凝胶通用原子间势及其微调:迈向精确高效的非谐振动频率
J Phys Chem Lett. 2025 Jan 16;16(2):483-493. doi: 10.1021/acs.jpclett.4c03031. Epub 2025 Jan 2.
5
Hierarchical graph representation learning with multi-granularity features for anti-cancer drug response prediction.用于抗癌药物反应预测的具有多粒度特征的层次图表示学习
IEEE J Biomed Health Inform. 2024 Nov 6;PP. doi: 10.1109/JBHI.2024.3492806.
6
A method combining LDA and neural networks for antitumor drug efficacy prediction.一种结合LDA和神经网络的抗肿瘤药物疗效预测方法。
Digit Health. 2024 Sep 9;10:20552076241280103. doi: 10.1177/20552076241280103. eCollection 2024 Jan-Dec.
7
GCN-Based LSTM Autoencoder with Self-Attention for Bearing Fault Diagnosis.基于图卷积网络的带自注意力机制的长短期记忆自动编码器用于轴承故障诊断
Sensors (Basel). 2024 Jul 26;24(15):4855. doi: 10.3390/s24154855.
8
Sigmoid function model of parallel-connected DC-DC converters and analysis of their dynamic characteristics.并联DC-DC变换器的Sigmoid函数模型及其动态特性分析
Chaos. 2024 Jul 1;34(7). doi: 10.1063/5.0201373.
9
Personalized anti-tumor drug efficacy prediction based on clinical data.基于临床数据的个性化抗肿瘤药物疗效预测
Heliyon. 2024 Mar 4;10(6):e27300. doi: 10.1016/j.heliyon.2024.e27300. eCollection 2024 Mar 30.
10
LSTM algorithm optimization for COVID-19 prediction model.用于新冠病毒疾病预测模型的长短期记忆算法优化
Heliyon. 2024 Feb 16;10(4):e26158. doi: 10.1016/j.heliyon.2024.e26158. eCollection 2024 Feb 29.