广义富集分析可提高从生物医学文献中检测药物不良事件的能力。

Generalized enrichment analysis improves the detection of adverse drug events from the biomedical literature.

作者信息

Winnenburg Rainer, Shah Nigam H

机构信息

Stanford Center for Biomedical Informatics Research, 1265 Welch Road, MSOB, Stanford, CA, 94305, USA.

出版信息

BMC Bioinformatics. 2016 Jun 23;17:250. doi: 10.1186/s12859-016-1080-z.

DOI:10.1186/s12859-016-1080-z

PMID:27333889

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4918084/

Abstract

BACKGROUND

Identification of associations between marketed drugs and adverse events from the biomedical literature assists drug safety monitoring efforts. Assessing the significance of such literature-derived associations and determining the granularity at which they should be captured remains a challenge. Here, we assess how defining a selection of adverse event terms from MeSH, based on information content, can improve the detection of adverse events for drugs and drug classes.

RESULTS

We analyze a set of 105,354 candidate drug adverse event pairs extracted from article indexes in MEDLINE. First, we harmonize extracted adverse event terms by aggregating them into higher-level MeSH terms based on the terms' information content. Then, we determine statistical enrichment of adverse events associated with drug and drug classes using a conditional hypergeometric test that adjusts for dependencies among associated terms. We compare our results with methods based on disproportionality analysis (proportional reporting ratio, PRR) and quantify the improvement in signal detection with our generalized enrichment analysis (GEA) approach using a gold standard of drug-adverse event associations spanning 174 drugs and four events. For single drugs, the best GEA method (Precision: .92/Recall: .71/F1-measure: .80) outperforms the best PRR based method (.69/.69/.69) on all four adverse event outcomes in our gold standard. For drug classes, our GEA performs similarly (.85/.69/.74) when increasing the level of abstraction for adverse event terms. Finally, on examining the 1609 individual drugs in our MEDLINE set, which map to chemical substances in ATC, we find signals for 1379 drugs (10,122 unique adverse event associations) on applying GEA with p < 0.005.

CONCLUSIONS

We present an approach based on generalized enrichment analysis that can be used to detect associations between drugs, drug classes and adverse events at a given level of granularity, at the same time correcting for known dependencies among events. Our study demonstrates the use of GEA, and the importance of choosing appropriate abstraction levels to complement current drug safety methods. We provide an R package for exploration of alternative abstraction levels of adverse event terms based on information content.

摘要

背景

从生物医学文献中识别已上市药物与不良事件之间的关联有助于药物安全监测工作。评估此类文献衍生关联的显著性并确定应捕捉它们的粒度仍然是一项挑战。在此，我们评估基于信息内容从医学主题词表（MeSH）中定义一组不良事件术语如何能够改善对药物和药物类别的不良事件的检测。

结果

我们分析了从MEDLINE文章索引中提取的一组105354个候选药物不良事件对。首先，我们通过基于术语的信息内容将提取的不良事件术语汇总为更高级别的MeSH术语来统一这些术语。然后，我们使用条件超几何检验确定与药物和药物类别相关的不良事件的统计富集情况，该检验会针对相关术语之间的依赖性进行调整。我们将我们的结果与基于不成比例分析（比例报告比，PRR）的方法进行比较，并使用涵盖174种药物和四种事件的药物 - 不良事件关联的金标准，通过我们的广义富集分析（GEA）方法量化信号检测方面的改进。对于单一药物，在我们的金标准中的所有四种不良事件结果上，最佳的GEA方法（精确率：0.92/召回率：0.71/F1值：0.80）优于基于最佳PRR的方法（0.69/0.69/0.69）。对于药物类别，当提高不良事件术语的抽象级别时，我们的GEA表现相似（0.85/0.69/0.74）。最后，在检查我们MEDLINE数据集中映射到解剖学治疗学化学分类系统（ATC）中的化学物质的1609种个体药物时，我们发现在应用p < 0.005的GEA时，有1379种药物（10122个独特的不良事件关联）存在信号。

结论

我们提出了一种基于广义富集分析的方法，该方法可用于在给定的粒度水平上检测药物、药物类别与不良事件之间的关联，同时校正事件之间已知的依赖性。我们的研究展示了GEA的用途，以及选择合适的抽象级别以补充当前药物安全方法的重要性。我们提供了一个R包，用于基于信息内容探索不良事件术语的替代抽象级别。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1873/4918084/1bcb521849e2/12859_2016_1080_Fig1_HTML.jpg

相似文献

Generalized enrichment analysis improves the detection of adverse drug events from the biomedical literature.

BMC Bioinformatics. 2016 Jun 23;17:250. doi: 10.1186/s12859-016-1080-z.

Leveraging MEDLINE indexing for pharmacovigilance - Inherent limitations and mitigation strategies.

J Biomed Inform. 2015 Oct;57:425-35. doi: 10.1016/j.jbi.2015.08.022. Epub 2015 Sep 2.

Design and validation of an automated method to detect known adverse drug reactions in MEDLINE: a contribution from the EU-ADR project.

J Am Med Inform Assoc. 2013 May 1;20(3):446-52. doi: 10.1136/amiajnl-2012-001083. Epub 2012 Nov 29.

Exploring adverse drug events at the class level.

J Biomed Semantics. 2015 May 1;6:18. doi: 10.1186/s13326-015-0017-1. eCollection 2015.

Statistical and graphical approaches for disproportionality analysis of spontaneously-reported adverse events in pharmacovigilance.

Chin J Nat Med. 2013 May;11(3):314-20. doi: 10.1016/S1875-5364(13)60035-7.

Effectiveness of adverse effects search filters: drugs versus medical devices.

J Med Libr Assoc. 2016 Jul;104(3):221-5. doi: 10.3163/1536-5050.104.3.007.

Structured assessment for prospective identification of safety signals in electronic medical records: evaluation in the health improvement network.

Drug Saf. 2015 Jan;38(1):87-100. doi: 10.1007/s40264-014-0251-y.

Triptans and serious adverse vascular events: data mining of the FDA Adverse Event Reporting System database.

Cephalalgia. 2014 Jan;34(1):5-13. doi: 10.1177/0333102413499649. Epub 2013 Aug 6.

Information content in Medline record fields.

Int J Med Inform. 2004 Jun 30;73(6):515-27. doi: 10.1016/j.ijmedinf.2004.02.008.

Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research.

BMC Med Inform Decis Mak. 2016 May 23;16:54. doi: 10.1186/s12911-016-0298-z.

引用本文的文献

Inferring new relations between medical entities using literature curated term co-occurrences.

JAMIA Open. 2019 Jul 1;2(3):378-385. doi: 10.1093/jamiaopen/ooz022. eCollection 2019 Oct.

Complementing Observational Signals with Literature-Derived Distributed Representations for Post-Marketing Drug Surveillance.

Drug Saf. 2020 Jan;43(1):67-77. doi: 10.1007/s40264-019-00872-9.

Discovering associations between problem list and practice setting.

BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):69. doi: 10.1186/s12911-019-0779-y.

Learning predictive models of drug side-effect relationships from distributed representations of literature-derived semantic predications.

J Am Med Inform Assoc. 2018 Oct 1;25(10):1339-1350. doi: 10.1093/jamia/ocy077.

Performing an Informatics Consult: Methods and Challenges.

J Am Coll Radiol. 2018 Mar;15(3 Pt B):563-568. doi: 10.1016/j.jacr.2017.12.023. Epub 2018 Feb 13.

Prediction on the risk population of idiosyncratic adverse reactions based on molecular docking with mutant proteins.

Oncotarget. 2017 Oct 5;8(56):95568-95576. doi: 10.18632/oncotarget.21509. eCollection 2017 Nov 10.

本文引用的文献

Leveraging MEDLINE indexing for pharmacovigilance - Inherent limitations and mitigation strategies.

J Biomed Inform. 2015 Oct;57:425-35. doi: 10.1016/j.jbi.2015.08.022. Epub 2015 Sep 2.

A method for systematic discovery of adverse drug events from clinical notes.

J Am Med Inform Assoc. 2015 Nov;22(6):1196-204. doi: 10.1093/jamia/ocv102. Epub 2015 Jul 31.

Pioglitazone Use and Risk of Bladder Cancer and Other Common Cancers in Persons With Diabetes.

JAMA. 2015 Jul 21;314(3):265-77. doi: 10.1001/jama.2015.7996.

Building the graph of medicine from millions of clinical narratives.

Sci Data. 2014 Sep 16;1:140032. doi: 10.1038/sdata.2014.32. eCollection 2014.

Exploring adverse drug events at the class level.

J Biomed Semantics. 2015 May 1;6:18. doi: 10.1186/s13326-015-0017-1. eCollection 2015.

MeSH ORA framework: R/Bioconductor packages to support MeSH over-representation analysis.

BMC Bioinformatics. 2015 Feb 15;16:45. doi: 10.1186/s12859-015-0453-z.

Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features.

J Am Med Inform Assoc. 2015 May;22(3):671-81. doi: 10.1093/jamia/ocu041. Epub 2015 Mar 9.

DOSE: an R/Bioconductor package for disease ontology semantic and enrichment analysis.

Bioinformatics. 2015 Feb 15;31(4):608-9. doi: 10.1093/bioinformatics/btu684. Epub 2014 Oct 17.

A time-indexed reference standard of adverse drug reactions.

Sci Data. 2014 Nov 11;1:140043. doi: 10.1038/sdata.2014.43.

Comment on: "Zoo or savannah? Choice of training ground for evidence-based pharmacovigilance".

Drug Saf. 2015 Jan;38(1):113-4. doi: 10.1007/s40264-014-0245-9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

广义富集分析可提高从生物医学文献中检测药物不良事件的能力。

Generalized enrichment analysis improves the detection of adverse drug events from the biomedical literature.

作者信息

Winnenburg Rainer, Shah Nigam H

机构信息

Stanford Center for Biomedical Informatics Research, 1265 Welch Road, MSOB, Stanford, CA, 94305, USA.