• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用自然语言处理深度学习模型探索复发性脑胶质瘤的生物医学知识。

Exploration of biomedical knowledge for recurrent glioblastoma using natural language processing deep learning models.

机构信息

Department of Radiation Oncology, Seoul National University Hospital, Seoul, Korea.

Department of Radiation Oncology, College of Medicine, Seoul National University, Seoul, Korea.

出版信息

BMC Med Inform Decis Mak. 2022 Oct 13;22(1):267. doi: 10.1186/s12911-022-02003-4.

DOI:10.1186/s12911-022-02003-4
PMID:36229835
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9559267/
Abstract

BACKGROUND

Efficient exploration of knowledge for the treatment of recurrent glioblastoma (GBM) is critical for both clinicians and researchers. However, due to the large number of clinical trials and published articles, searching for this knowledge is very labor-intensive. In the current study, using natural language processing (NLP), we analyzed medical research corpora related to recurrent glioblastoma to find potential targets and treatments.

METHODS

We fine-tuned the 'SAPBERT', which was pretrained on biomedical ontologies, to perform question/answering (QA) and name entity recognition (NER) tasks for medical corpora. The model was fine-tuned with the SQUAD2 dataset and multiple NER datasets designed for QA task and NER task, respectively. Corpora were collected by searching the terms "recurrent glioblastoma" and "drug target", published from 2000 to 2020 in the Web of science (N = 288 articles). Also, clinical trial corpora were collected from 'clinicaltrial.gov' using the searching term of 'recurrent glioblastoma" (N = 587 studies).

RESULTS

For the QA task, the model showed an F1 score of 0.79. For the NER task, the model showed F1 scores of 0.90 and 0.76 for drug and gene name recognition, respectively. When asked what the molecular targets were promising for recurrent glioblastoma, the model answered that RTK inhibitors or LPA-1 antagonists were promising. From collected clinical trials, the model summarized them in the order of bevacizumab, temozolomide, lomustine, and nivolumab. Based on published articles, the model found the many drug-gene pairs with the NER task, and we presented them with a circus plot and related summarization ( https://github.com/bigwiz83/NLP_rGBM ).

CONCLUSION

Using NLP deep learning models, we could explore potential targets and treatments based on medical research and clinical trial corpora. The knowledge found by the models may be used for treating recurrent glioblastoma.

摘要

背景

高效探索复发性胶质母细胞瘤(GBM)的治疗知识,对临床医生和研究人员都至关重要。然而,由于临床试验和已发表文章数量众多,因此搜索这些知识非常耗费人力。在本研究中,我们使用自然语言处理(NLP)技术,分析与复发性 GBM 相关的医学研究文献,以寻找潜在的治疗靶点和治疗方法。

方法

我们对在生物医学本体上进行预训练的 'SAPBERT' 进行微调,以对医学文献进行问答(QA)和命名实体识别(NER)任务。该模型使用 SQUAD2 数据集和分别为 QA 任务和 NER 任务设计的多个 NER 数据集进行微调。通过在 Web of Science 中搜索“复发性胶质母细胞瘤”和“药物靶点”等术语,从 2000 年至 2020 年收集文献(N=288 篇文章)。此外,还从“clinicaltrial.gov”中使用“复发性胶质母细胞瘤”的搜索词收集临床试验文献(N=587 项研究)。

结果

对于 QA 任务,该模型的 F1 得分为 0.79。对于 NER 任务,该模型在药物和基因名称识别方面的 F1 得分分别为 0.90 和 0.76。当被问及哪些分子靶点对复发性 GBM 有治疗前景时,模型回答称 RTK 抑制剂或 LPA-1 拮抗剂有治疗前景。从收集到的临床试验中,模型按照贝伐单抗、替莫唑胺、洛莫司汀和纳武单抗的顺序对它们进行了总结。基于已发表的文章,该模型使用 NER 任务找到了许多药物-基因对,我们使用 circus plot 并提供了相关总结(https://github.com/bigwiz83/NLP_rGBM)。

结论

我们可以使用 NLP 深度学习模型,基于医学研究和临床试验文献探索潜在的治疗靶点和治疗方法。模型发现的知识可能用于治疗复发性 GBM。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ae/9563438/952f8ef6025a/12911_2022_2003_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ae/9563438/a95fb7cdd484/12911_2022_2003_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ae/9563438/933ec423c3ed/12911_2022_2003_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ae/9563438/952f8ef6025a/12911_2022_2003_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ae/9563438/a95fb7cdd484/12911_2022_2003_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ae/9563438/933ec423c3ed/12911_2022_2003_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ae/9563438/952f8ef6025a/12911_2022_2003_Fig3_HTML.jpg

相似文献

1
Exploration of biomedical knowledge for recurrent glioblastoma using natural language processing deep learning models.利用自然语言处理深度学习模型探索复发性脑胶质瘤的生物医学知识。
BMC Med Inform Decis Mak. 2022 Oct 13;22(1):267. doi: 10.1186/s12911-022-02003-4.
2
Evaluating Medical Entity Recognition in Health Care: Entity Model Quantitative Study.评估医疗保健中的实体识别:实体模型定量研究。
JMIR Med Inform. 2024 Oct 17;12:e59782. doi: 10.2196/59782.
3
Extracting comprehensive clinical information for breast cancer using deep learning methods.利用深度学习方法提取乳腺癌全面临床信息。
Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.
4
Transformers-sklearn: a toolkit for medical language understanding with transformer-based models.Transformer-sklearn:一个基于 Transformer 的模型的医学语言理解工具包。
BMC Med Inform Decis Mak. 2021 Jul 30;21(Suppl 2):90. doi: 10.1186/s12911-021-01459-0.
5
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
6
A comparative study of pre-trained language models for named entity recognition in clinical trial eligibility criteria from multiple corpora.基于多语料库的临床试验资格标准中命名实体识别的预训练语言模型的比较研究。
BMC Med Inform Decis Mak. 2022 Sep 6;22(Suppl 3):235. doi: 10.1186/s12911-022-01967-7.
7
A Combined Manual Annotation and Deep-Learning Natural Language Processing Study on Accurate Entity Extraction in Hereditary Disease Related Biomedical Literature.一种结合手动标注和深度学习自然语言处理的遗传性疾病相关生物医学文献中精确实体抽取方法的研究。
Interdiscip Sci. 2024 Jun;16(2):333-344. doi: 10.1007/s12539-024-00605-2. Epub 2024 Feb 10.
8
Integrating deep learning architectures for enhanced biomedical relation extraction: a pipeline approach.深度学习架构在增强生物医学关系抽取中的应用:一种流水线方法。
Database (Oxford). 2024 Aug 28;2024. doi: 10.1093/database/baae079.
9
Advancing entity recognition in biomedicine via instruction tuning of large language models.通过指令调整大型语言模型推进生物医学中的实体识别。
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae163.
10
Clinical Named Entity Recognition Using Deep Learning Models.使用深度学习模型的临床命名实体识别
AMIA Annu Symp Proc. 2018 Apr 16;2017:1812-1819. eCollection 2017.

引用本文的文献

1
Artificial Intelligence-Assisted Drug and Biomarker Discovery for Glioblastoma: A Scoping Review of the Literature.人工智能辅助胶质母细胞瘤的药物和生物标志物发现:文献综述
Cancers (Basel). 2025 Feb 7;17(4):571. doi: 10.3390/cancers17040571.

本文引用的文献

1
Lysophosphatidic Acid Receptor Antagonists and Cancer: The Current Trends, Clinical Implications, and Trials.溶血磷脂酸受体拮抗剂与癌症:当前趋势、临床意义和试验。
Cells. 2021 Jun 29;10(7):1629. doi: 10.3390/cells10071629.
2
Automated Identification of Patients With Immune-Related Adverse Events From Clinical Notes Using Word Embedding and Machine Learning.使用词嵌入和机器学习技术从临床记录中自动识别免疫相关不良事件患者
JCO Clin Cancer Inform. 2021 May;5:541-549. doi: 10.1200/CCI.20.00109.
3
Automated NLP Extraction of Clinical Rationale for Treatment Discontinuation in Breast Cancer.
自动化自然语言处理提取乳腺癌治疗停药的临床理由。
JCO Clin Cancer Inform. 2021 May;5:550-560. doi: 10.1200/CCI.20.00139.
4
COVID-19 information retrieval with deep-learning based semantic search, question answering, and abstractive summarization.基于深度学习的语义搜索、问答和摘要生成技术进行的COVID-19信息检索
NPJ Digit Med. 2021 Apr 12;4(1):68. doi: 10.1038/s41746-021-00437-0.
5
Clinical Natural Language Processing for Radiation Oncology: A Review and Practical Primer.临床自然语言处理在放射肿瘤学中的应用:综述与实用入门
Int J Radiat Oncol Biol Phys. 2021 Jul 1;110(3):641-655. doi: 10.1016/j.ijrobp.2021.01.044. Epub 2021 Feb 3.
6
Hypofractionated stereotactic re-irradiation with pembrolizumab and bevacizumab in patients with recurrent high-grade gliomas: results from a phase I study.帕博利珠单抗和贝伐单抗用于复发性高级别胶质瘤患者的低分割立体定向再照射:一项I期研究的结果
Neuro Oncol. 2021 Apr 12;23(4):677-686. doi: 10.1093/neuonc/noaa260.
7
Microglial lysophosphatidic acid promotes glioblastoma proliferation and migration via LPA receptor.小胶质细胞溶血磷脂酸通过 LPA 受体促进神经胶质瘤增殖和迁移。
J Neurochem. 2021 Feb;156(4):499-512. doi: 10.1111/jnc.15097. Epub 2020 Jun 20.
8
Effect of Nivolumab vs Bevacizumab in Patients With Recurrent Glioblastoma: The CheckMate 143 Phase 3 Randomized Clinical Trial.纳武利尤单抗对比贝伐珠单抗治疗复发性胶质母细胞瘤患者的效果:CheckMate 143 期随机临床试验。
JAMA Oncol. 2020 Jul 1;6(7):1003-1010. doi: 10.1001/jamaoncol.2020.1024.
9
Glioblastoma in adults: a Society for Neuro-Oncology (SNO) and European Society of Neuro-Oncology (EANO) consensus review on current management and future directions.成人脑胶质母细胞瘤:神经肿瘤学会(SNO)和欧洲神经肿瘤学会(EANO)关于当前管理和未来方向的共识综述。
Neuro Oncol. 2020 Aug 17;22(8):1073-1113. doi: 10.1093/neuonc/noaa106.
10
Deep learning in clinical natural language processing: a methodical review.深度学习在临床自然语言处理中的应用:系统综述。
J Am Med Inform Assoc. 2020 Mar 1;27(3):457-470. doi: 10.1093/jamia/ocz200.