• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

构建精准医学的药物基因组学知识模型:黑色素瘤案例研究

Building a Pharmacogenomics Knowledge Model Toward Precision Medicine: Case Study in Melanoma.

作者信息

Kang Hongyu, Li Jiao, Wu Meng, Shen Liu, Hou Li

机构信息

Institute of Medical Information &Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, China.

Department of Biomedical Engineering, School of Life Science, Beijing Institute of Technology, Beijing, China.

出版信息

JMIR Med Inform. 2020 Oct 21;8(10):e20291. doi: 10.2196/20291.

DOI:10.2196/20291
PMID:33084582
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7641779/
Abstract

BACKGROUND

Many drugs do not work the same way for everyone owing to distinctions in their genes. Pharmacogenomics (PGx) aims to understand how genetic variants influence drug efficacy and toxicity. It is often considered one of the most actionable areas of the personalized medicine paradigm. However, little prior work has included in-depth explorations and descriptions of drug usage, dosage adjustment, and so on.

OBJECTIVE

We present a pharmacogenomics knowledge model to discover the hidden relationships between PGx entities such as drugs, genes, and diseases, especially details in precise medication.

METHODS

PGx open data such as DrugBank and RxNorm were integrated in this study, as well as drug labels published by the US Food and Drug Administration. We annotated 190 drug labels manually for entities and relationships. Based on the annotation results, we trained 3 different natural language processing models to complete entity recognition. Finally, the pharmacogenomics knowledge model was described in detail.

RESULTS

In entity recognition tasks, the Bidirectional Encoder Representations from Transformers-conditional random field model achieved better performance with micro-F1 score of 85.12%. The pharmacogenomics knowledge model in our study included 5 semantic types: drug, gene, disease, precise medication (population, daily dose, dose form, frequency, etc), and adverse reaction. Meanwhile, 26 semantic relationships were defined in detail. Taking melanoma caused by a BRAF gene mutation into consideration, the pharmacogenomics knowledge model covered 7 related drugs and 4846 triples were established in this case. All the corpora, relationship definitions, and triples were made publically available.

CONCLUSIONS

We highlighted the pharmacogenomics knowledge model as a scalable framework for clinicians and clinical pharmacists to adjust drug dosage according to patient-specific genetic variation, and for pharmaceutical researchers to develop new drugs. In the future, a series of other antitumor drugs and automatic relation extractions will be taken into consideration to further enhance our framework with more PGx linked data.

摘要

背景

由于基因差异,许多药物对每个人的作用方式并不相同。药物基因组学(PGx)旨在了解基因变异如何影响药物疗效和毒性。它通常被认为是个性化医疗范式中最具可操作性的领域之一。然而,此前很少有研究对药物使用、剂量调整等进行深入探索和描述。

目的

我们提出一种药物基因组学知识模型,以发现药物、基因和疾病等PGx实体之间隐藏的关系,特别是精准用药方面的细节。

方法

本研究整合了DrugBank和RxNorm等PGx开放数据,以及美国食品药品监督管理局发布的药品标签。我们手动为190个药品标签标注了实体和关系。基于标注结果,我们训练了3种不同的自然语言处理模型来完成实体识别。最后,详细描述了药物基因组学知识模型。

结果

在实体识别任务中,基于变换器的双向编码器表征-条件随机场模型表现更佳,微F1分数达到85.12%。我们研究中的药物基因组学知识模型包括5种语义类型:药物、基因、疾病、精准用药(人群、每日剂量、剂型、用药频率等)和不良反应。同时,详细定义了26种语义关系。以BRAF基因突变引起的黑色素瘤为例,药物基因组学知识模型涵盖了7种相关药物,并在此案例中建立了4846个三元组。所有语料库、关系定义和三元组均已公开。

结论

我们强调药物基因组学知识模型是一个可扩展的框架,可供临床医生和临床药剂师根据患者特定的基因变异调整药物剂量,也可供药物研究人员开发新药。未来,将考虑一系列其他抗肿瘤药物和自动关系提取,以利用更多PGx链接数据进一步增强我们的框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9406/7641779/44e2280d660d/medinform_v8i10e20291_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9406/7641779/a9548a8d3894/medinform_v8i10e20291_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9406/7641779/8bfbfb852731/medinform_v8i10e20291_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9406/7641779/f4ddecb9b5b9/medinform_v8i10e20291_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9406/7641779/090bb2be8e81/medinform_v8i10e20291_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9406/7641779/2e326b00322f/medinform_v8i10e20291_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9406/7641779/2dc0ae7ba01f/medinform_v8i10e20291_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9406/7641779/44e2280d660d/medinform_v8i10e20291_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9406/7641779/a9548a8d3894/medinform_v8i10e20291_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9406/7641779/8bfbfb852731/medinform_v8i10e20291_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9406/7641779/f4ddecb9b5b9/medinform_v8i10e20291_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9406/7641779/090bb2be8e81/medinform_v8i10e20291_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9406/7641779/2e326b00322f/medinform_v8i10e20291_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9406/7641779/2dc0ae7ba01f/medinform_v8i10e20291_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9406/7641779/44e2280d660d/medinform_v8i10e20291_fig7.jpg

相似文献

1
Building a Pharmacogenomics Knowledge Model Toward Precision Medicine: Case Study in Melanoma.构建精准医学的药物基因组学知识模型:黑色素瘤案例研究
JMIR Med Inform. 2020 Oct 21;8(10):e20291. doi: 10.2196/20291.
2
A Fine-Tuned Bidirectional Encoder Representations From Transformers Model for Food Named-Entity Recognition: Algorithm Development and Validation.基于 Transformer 的双向编码器表示模型的精细调整在食品命名实体识别中的应用:算法开发与验证。
J Med Internet Res. 2021 Aug 9;23(8):e28229. doi: 10.2196/28229.
3
Automatic knowledge extraction from Chinese electronic medical records and rheumatoid arthritis knowledge graph construction.从中国电子病历中自动提取知识并构建类风湿性关节炎知识图谱。
Quant Imaging Med Surg. 2023 Jun 1;13(6):3873-3890. doi: 10.21037/qims-22-1158. Epub 2023 May 8.
4
Adversarial active learning for the identification of medical concepts and annotation inconsistency.对抗式主动学习在医学概念识别和标注不一致性中的应用。
J Biomed Inform. 2020 Aug;108:103481. doi: 10.1016/j.jbi.2020.103481. Epub 2020 Jul 18.
5
A semi-supervised approach to extract pharmacogenomics-specific drug-gene pairs from biomedical literature for personalized medicine.一种从生物医学文献中提取用于个性化医疗的药物-基因对的半监督方法。
J Biomed Inform. 2013 Aug;46(4):585-93. doi: 10.1016/j.jbi.2013.04.001. Epub 2013 Apr 6.
6
Chinese-Named Entity Recognition From Adverse Drug Event Records: Radical Embedding-Combined Dynamic Embedding-Based BERT in a Bidirectional Long Short-term Conditional Random Field (Bi-LSTM-CRF) Model.从药品不良事件记录中识别中文命名实体:基于激进嵌入与动态嵌入相结合的BERT的双向长短期条件随机场(Bi-LSTM-CRF)模型
JMIR Med Inform. 2021 Dec 1;9(12):e26407. doi: 10.2196/26407.
7
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
8
Multi-Label Classification in Patient-Doctor Dialogues With the RoBERTa-WWM-ext + CNN (Robustly Optimized Bidirectional Encoder Representations From Transformers Pretraining Approach With Whole Word Masking Extended Combining a Convolutional Neural Network) Model: Named Entity Study.基于RoBERTa-WWM-ext + CNN(带有全词掩码扩展的基于变换器预训练方法的稳健优化双向编码器表示与卷积神经网络相结合)模型的医患对话多标签分类:命名实体研究
JMIR Med Inform. 2022 Apr 21;10(4):e35606. doi: 10.2196/35606.
9
Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records.从中文电子病历中提取垂体腺瘤的临床命名实体。
BMC Med Inform Decis Mak. 2022 Mar 23;22(1):72. doi: 10.1186/s12911-022-01810-z.
10
PGxCorpus, a manually annotated corpus for pharmacogenomics.PGxCorpus,一个用于药物基因组学的人工标注语料库。
Sci Data. 2020 Jan 2;7(1):3. doi: 10.1038/s41597-019-0342-9.

本文引用的文献

1
BERT-based Ranking for Biomedical Entity Normalization.基于BERT的生物医学实体规范化排序
AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:269-277. eCollection 2020.
2
Fine-Tuning Bidirectional Encoder Representations From Transformers (BERT)-Based Models on Large-Scale Electronic Health Record Notes: An Empirical Study.基于大规模电子健康记录笔记对基于变换器的双向编码器表征(BERT)模型进行微调:一项实证研究。
JMIR Med Inform. 2019 Sep 12;7(3):e14830. doi: 10.2196/14830.
3
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.
BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
4
DigChem: Identification of disease-gene-chemical relationships from Medline abstracts.DigChem:从 Medline 文摘中识别疾病-基因-化学关系。
PLoS Comput Biol. 2019 May 15;15(5):e1007022. doi: 10.1371/journal.pcbi.1007022. eCollection 2019 May.
5
A global network of biomedical relationships derived from text.从文本中提取的生物医学关系的全球网络。
Bioinformatics. 2018 Aug 1;34(15):2614-2624. doi: 10.1093/bioinformatics/bty114.
6
An automatic approach for constructing a knowledge base of symptoms in Chinese.一种构建中文症状知识库的自动方法。
J Biomed Semantics. 2017 Sep 20;8(Suppl 1):33. doi: 10.1186/s13326-017-0145-x.
7
Drug target ontology to classify and integrate drug discovery data.用于对药物发现数据进行分类和整合的药物靶点本体论。
J Biomed Semantics. 2017 Nov 9;8(1):50. doi: 10.1186/s13326-017-0161-x.
8
Durable Response to Afatinib in Lung Adenocarcinoma Harboring NRG1 Gene Fusions.携带NRG1基因融合的肺腺癌对阿法替尼的持久反应
J Thorac Oncol. 2017 Aug;12(8):e107-e110. doi: 10.1016/j.jtho.2017.04.025. Epub 2017 May 10.
9
Alectinib versus crizotinib in patients with ALK-positive non-small-cell lung cancer (J-ALEX): an open-label, randomised phase 3 trial.阿来替尼对比克唑替尼用于治疗 ALK 阳性非小细胞肺癌患者(J-ALEX):一项开放标签、随机、III 期临床试验。
Lancet. 2017 Jul 1;390(10089):29-39. doi: 10.1016/S0140-6736(17)30565-2. Epub 2017 May 10.
10
Learning from biomedical linked data to suggest valid pharmacogenes.从生物医学关联数据中学习以推荐有效的药物基因。
J Biomed Semantics. 2017 Apr 20;8(1):16. doi: 10.1186/s13326-017-0125-1.