利用生物医学文献中的语义谓词构建的联合药物治疗知识图谱：算法开发

Du Jian, Li Xiaoying

National Institute of Health Data Science, Peking University, Beijing, China.

Institute of Medical Information, Chinese Academy of Medical Sciences, Beijing, China.

JMIR Med Inform. 2020 Apr 28;8(4):e18323. doi: 10.2196/18323.

BACKGROUND

Combination therapy plays an important role in the effective treatment of malignant neoplasms and precision medicine. Numerous clinical studies have been carried out to investigate combination drug therapies. Automated knowledge discovery of these combinations and their graphic representation in knowledge graphs will enable pattern recognition and identification of drug combinations used to treat a specific type of cancer, improve drug efficacy and treatment of human disorders.

OBJECTIVE

This paper aims to develop an automated, visual approach to discover knowledge about combination therapies from biomedical literature, especially from those studies with high-level evidence such as clinical trial reports and clinical practice guidelines.

METHODS

Based on semantic predications, which consist of a triple structure of subject-predicate-object (SPO), we proposed an automated algorithm to discover knowledge of combination drug therapies using the following rules: 1) two or more semantic predications (S-P-O and S-P-O, i = 2, 3…) can be extracted from one conclusive claim (sentence) in the abstract of a given publication, and 2) these predications have an identical predicate (that closely relates to human disease treatment, eg, "treat") and object (eg, disease name) but different subjects (eg, drug names). A customized knowledge graph organizes and visualizes these combinations, improving the traditional semantic triples. After automatic filtering of broad concepts such as "pharmacologic actions" and generic disease names, a set of combination drug therapies were identified and characterized through manual interpretation.

RESULTS

We retrieved 22,263 clinical trial reports and 31 clinical practice guidelines from PubMed abstracts by searching "antineoplastic agents" for drug restriction (published between Jan 2009 and Oct 2019). There were 15,603 conclusive claims locally parsed using the search terms "conclusion*" and "conclude*" ready for semantic predications extraction by SemRep, and 325 candidate groups of semantic predications about combined medications were automatically discovered within 316 conclusive claims. Based on manual analysis, we determined that 255/316 claims (78.46%) were accurately identified as describing combination therapies and adopted these to construct the customized knowledge graph. We also identified two categories (and 4 subcategories) to characterize the inaccurate results: limitations of SemRep and limitations of proposal. We further learned the predominant patterns of drug combinations based on mechanism of action for new combined medication studies and discovered 4 obvious markers ("combin*," "coadministration," "co-administered," and "regimen") to identify potential combination therapies to enable development of a machine learning algorithm.

CONCLUSIONS

Semantic predications from conclusive claims in the biomedical literature can be used to support automated knowledge discovery and knowledge graph construction for combination therapies. A machine learning approach is warranted to take full advantage of the identified markers and other contextual features.

背景

联合治疗在恶性肿瘤的有效治疗和精准医学中发挥着重要作用。已经开展了大量临床研究来探究联合药物治疗。对这些联合用药进行自动化知识发现并在知识图谱中进行图形化表示，将有助于模式识别以及识别用于治疗特定类型癌症的药物组合，提高药物疗效并改善人类疾病的治疗效果。

目的

本文旨在开发一种自动化的可视化方法，从生物医学文献，尤其是从那些具有高级别证据的研究（如临床试验报告和临床实践指南）中发现有关联合治疗的知识。

方法

基于由主语 - 谓语 - 宾语（SPO）三元结构组成的语义谓词，我们提出了一种自动化算法，使用以下规则来发现联合药物治疗的知识：1）可以从给定出版物摘要中的一个结论性声明（句子）中提取两个或更多语义谓词（S - P - O和S - P - O，i = 2, 3…），并且2）这些谓词具有相同的谓语（与人类疾病治疗密切相关，例如“治疗”）和宾语（例如疾病名称），但主语不同（例如药物名称）。一个定制的知识图谱对这些组合进行组织和可视化，改进了传统的语义三元组。在自动过滤诸如“药理作用”等宽泛概念和通用疾病名称后，通过人工解读确定并表征了一组联合药物治疗。

结果

我们通过在PubMed摘要中搜索“抗肿瘤药”进行药物限制（发表于2009年1月至2019年10月之间），检索到22,263份临床试验报告和31份临床实践指南。使用搜索词“conclusion*”和“conclude*”对15,603个结论性声明进行了本地解析，准备由SemRep提取语义谓词，并且在316个结论性声明中自动发现了325个关于联合用药的候选语义谓词组。基于人工分析，我们确定316个声明中的255个（78.46%）被准确识别为描述联合治疗，并采用这些声明来构建定制的知识图谱。我们还确定了两类（以及4个子类）来表征不准确的结果：SemRep的局限性和提议的局限性。我们进一步基于新联合用药研究的作用机制了解了药物组合的主要模式，并发现了4个明显的标记（“combin*”、“coadministration”、“co - administered”和“regimen”）来识别潜在的联合治疗，以开发机器学习算法。

结论

生物医学文献中结论性声明的语义谓词可用于支持联合治疗的自动化知识发现和知识图谱构建。有必要采用机器学习方法来充分利用已识别的标记和其他上下文特征。

相似文献

A Knowledge Graph of Combined Drug Therapies Using Semantic Predications From Biomedical Literature: Algorithm Development.

JMIR Med Inform. 2020 Apr 28;8(4):e18323. doi: 10.2196/18323.

Towards a characterization of apparent contradictions in the biomedical literature using context analysis.

J Biomed Inform. 2019 Oct;98:103275. doi: 10.1016/j.jbi.2019.103275. Epub 2019 Aug 29.

Context-driven automatic subgraph creation for literature-based discovery.

J Biomed Inform. 2015 Apr;54:141-57. doi: 10.1016/j.jbi.2015.01.014. Epub 2015 Feb 7.

Expanding vocabularies for complementary and alternative medicine therapies.

Int J Med Inform. 2019 Jan;121:64-74. doi: 10.1016/j.ijmedinf.2018.11.009. Epub 2018 Nov 22.

Assigning factuality values to semantic relations extracted from biomedical research literature.

PLoS One. 2017 Jul 5;12(7):e0179926. doi: 10.1371/journal.pone.0179926. eCollection 2017.

Towards medical knowmetrics: representing and computing medical knowledge using semantic predications as the knowledge unit and the uncertainty as the knowledge context.

Scientometrics. 2021;126(7):6225-6251. doi: 10.1007/s11192-021-03880-8. Epub 2021 Feb 14.

A graph-based recovery and decomposition of Swanson's hypothesis using semantic predications.

J Biomed Inform. 2013 Apr;46(2):238-51. doi: 10.1016/j.jbi.2012.09.004. Epub 2012 Sep 28.

Developing a Knowledge Graph for Pharmacokinetic Natural Product-Drug Interactions.

J Biomed Inform. 2023 Apr;140:104341. doi: 10.1016/j.jbi.2023.104341. Epub 2023 Mar 17.

Adverse Drug Event Prediction Using Noisy Literature-Derived Knowledge Graphs: Algorithm Development and Validation.

JMIR Med Inform. 2021 Oct 25;9(10):e32730. doi: 10.2196/32730.

Drug repurposing for COVID-19 via knowledge graph completion.

J Biomed Inform. 2021 Mar;115:103696. doi: 10.1016/j.jbi.2021.103696. Epub 2021 Feb 8.

引用本文的文献

A study on large-scale disease causality discovery from biomedical literature.

BMC Med Inform Decis Mak. 2025 Mar 18;25(1):136. doi: 10.1186/s12911-025-02893-0.

Mining literature and pathway data to explore the relations of ketamine with neurotransmitters and gut microbiota using a knowledge-graph.

Bioinformatics. 2024 Jan 2;40(1). doi: 10.1093/bioinformatics/btad771.

[Overview of the application of knowledge graphs in the medical field].

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2023 Oct 25;40(5):1040-1044. doi: 10.7507/1001-5515.202204016.

本文引用的文献

Two-drug combination benefits patients with chronic lymphocytic leukemia.

Cancer. 2020 Jan 1;126(1):13. doi: 10.1002/cncr.32647.

Phase I Study of Combination Therapy With Weekly Nanoparticle Albumin-bound Paclitaxel and Cyclophosphamide in Metastatic Breast Cancer Patients.

Anticancer Res. 2019 Dec;39(12):6903-6907. doi: 10.21873/anticanres.13910.

Combining epigenetic drugs with other therapies for solid tumours - past lessons and future promise.

Nat Rev Clin Oncol. 2020 Feb;17(2):91-107. doi: 10.1038/s41571-019-0267-4. Epub 2019 Sep 30.

Combination Therapies of Artemisinin and its Derivatives as a Viable Approach for Future Cancer Treatment.

Curr Pharm Des. 2019;25(31):3323-3338. doi: 10.2174/1381612825666190902155957.

Towards a characterization of apparent contradictions in the biomedical literature using context analysis.

J Biomed Inform. 2019 Oct;98:103275. doi: 10.1016/j.jbi.2019.103275. Epub 2019 Aug 29.

Knowledge-guided convolutional networks for chemical-disease relation extraction.

BMC Bioinformatics. 2019 May 21;20(1):260. doi: 10.1186/s12859-019-2873-7.

Evaluating active learning methods for annotating semantic predications.

JAMIA Open. 2018 Oct;1(2):275-282. doi: 10.1093/jamiaopen/ooy021. Epub 2018 Jun 27.

Drug Repositioning to Accelerate Drug Development Using Social Media Data: Computational Study on Parkinson Disease.

J Med Internet Res. 2018 Oct 11;20(10):e271. doi: 10.2196/jmir.9646.

Using predicate and provenance information from a knowledge graph for drug efficacy screening.

J Biomed Semantics. 2018 Sep 6;9(1):23. doi: 10.1186/s13326-018-0189-6.

Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration.

BMC Med Genomics. 2017 Dec 28;10(Suppl 5):79. doi: 10.1186/s12920-017-0311-0.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

A Knowledge Graph of Combined Drug Therapies Using Semantic Predications From Biomedical Literature: Algorithm Development.

JMIR Med Inform. 2020 Apr 28;8(4):e18323. doi: 10.2196/18323.

Towards a characterization of apparent contradictions in the biomedical literature using context analysis.

J Biomed Inform. 2019 Oct;98:103275. doi: 10.1016/j.jbi.2019.103275. Epub 2019 Aug 29.

Context-driven automatic subgraph creation for literature-based discovery.

J Biomed Inform. 2015 Apr;54:141-57. doi: 10.1016/j.jbi.2015.01.014. Epub 2015 Feb 7.

Expanding vocabularies for complementary and alternative medicine therapies.

Int J Med Inform. 2019 Jan;121:64-74. doi: 10.1016/j.ijmedinf.2018.11.009. Epub 2018 Nov 22.

Assigning factuality values to semantic relations extracted from biomedical research literature.

PLoS One. 2017 Jul 5;12(7):e0179926. doi: 10.1371/journal.pone.0179926. eCollection 2017.

Towards medical knowmetrics: representing and computing medical knowledge using semantic predications as the knowledge unit and the uncertainty as the knowledge context.

Scientometrics. 2021;126(7):6225-6251. doi: 10.1007/s11192-021-03880-8. Epub 2021 Feb 14.

A graph-based recovery and decomposition of Swanson's hypothesis using semantic predications.

J Biomed Inform. 2013 Apr;46(2):238-51. doi: 10.1016/j.jbi.2012.09.004. Epub 2012 Sep 28.

Developing a Knowledge Graph for Pharmacokinetic Natural Product-Drug Interactions.

J Biomed Inform. 2023 Apr;140:104341. doi: 10.1016/j.jbi.2023.104341. Epub 2023 Mar 17.

Adverse Drug Event Prediction Using Noisy Literature-Derived Knowledge Graphs: Algorithm Development and Validation.

JMIR Med Inform. 2021 Oct 25;9(10):e32730. doi: 10.2196/32730.

Drug repurposing for COVID-19 via knowledge graph completion.

J Biomed Inform. 2021 Mar;115:103696. doi: 10.1016/j.jbi.2021.103696. Epub 2021 Feb 8.

引用本文的文献

A study on large-scale disease causality discovery from biomedical literature.

BMC Med Inform Decis Mak. 2025 Mar 18;25(1):136. doi: 10.1186/s12911-025-02893-0.

Mining literature and pathway data to explore the relations of ketamine with neurotransmitters and gut microbiota using a knowledge-graph.

Bioinformatics. 2024 Jan 2;40(1). doi: 10.1093/bioinformatics/btad771.

[Overview of the application of knowledge graphs in the medical field].

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2023 Oct 25;40(5):1040-1044. doi: 10.7507/1001-5515.202204016.

本文引用的文献

Two-drug combination benefits patients with chronic lymphocytic leukemia.

Cancer. 2020 Jan 1;126(1):13. doi: 10.1002/cncr.32647.

Phase I Study of Combination Therapy With Weekly Nanoparticle Albumin-bound Paclitaxel and Cyclophosphamide in Metastatic Breast Cancer Patients.

Anticancer Res. 2019 Dec;39(12):6903-6907. doi: 10.21873/anticanres.13910.

Combining epigenetic drugs with other therapies for solid tumours - past lessons and future promise.

Nat Rev Clin Oncol. 2020 Feb;17(2):91-107. doi: 10.1038/s41571-019-0267-4. Epub 2019 Sep 30.

Combination Therapies of Artemisinin and its Derivatives as a Viable Approach for Future Cancer Treatment.

Curr Pharm Des. 2019;25(31):3323-3338. doi: 10.2174/1381612825666190902155957.

Towards a characterization of apparent contradictions in the biomedical literature using context analysis.

J Biomed Inform. 2019 Oct;98:103275. doi: 10.1016/j.jbi.2019.103275. Epub 2019 Aug 29.

Knowledge-guided convolutional networks for chemical-disease relation extraction.

BMC Bioinformatics. 2019 May 21;20(1):260. doi: 10.1186/s12859-019-2873-7.

Evaluating active learning methods for annotating semantic predications.

JAMIA Open. 2018 Oct;1(2):275-282. doi: 10.1093/jamiaopen/ooy021. Epub 2018 Jun 27.

Drug Repositioning to Accelerate Drug Development Using Social Media Data: Computational Study on Parkinson Disease.

J Med Internet Res. 2018 Oct 11;20(10):e271. doi: 10.2196/jmir.9646.

Using predicate and provenance information from a knowledge graph for drug efficacy screening.

J Biomed Semantics. 2018 Sep 6;9(1):23. doi: 10.1186/s13326-018-0189-6.

Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration.

BMC Med Genomics. 2017 Dec 28;10(Suppl 5):79. doi: 10.1186/s12920-017-0311-0.

A Knowledge Graph of Combined Drug Therapies Using Semantic Predications From Biomedical Literature: Algorithm Development.

作者信息

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献