• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从文献中提取 HIV 耐药性的因果关系。

Extracting causal relations on HIV drug resistance from literature.

机构信息

Computational Science, University of Amsterdam, Science Park 107, 1098 XG Amsterdam, The Netherlands.

出版信息

BMC Bioinformatics. 2010 Feb 23;11:101. doi: 10.1186/1471-2105-11-101.

DOI:10.1186/1471-2105-11-101
PMID:20178611
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2841207/
Abstract

BACKGROUND

In HIV treatment it is critical to have up-to-date resistance data of applicable drugs since HIV has a very high rate of mutation. These data are made available through scientific publications and must be extracted manually by experts in order to be used by virologists and medical doctors. Therefore there is an urgent need for a tool that partially automates this process and is able to retrieve relations between drugs and virus mutations from literature.

RESULTS

In this work we present a novel method to extract and combine relationships between HIV drugs and mutations in viral genomes. Our extraction method is based on natural language processing (NLP) which produces grammatical relations and applies a set of rules to these relations. We applied our method to a relevant set of PubMed abstracts and obtained 2,434 extracted relations with an estimated performance of 84% for F-score. We then combined the extracted relations using logistic regression to generate resistance values for each <drug, mutation> pair. The results of this relation combination show more than 85% agreement with the Stanford HIVDB for the ten most frequently occurring mutations. The system is used in 5 hospitals from the Virolab project http://www.virolab.org to preselect the most relevant novel resistance data from literature and present those to virologists and medical doctors for further evaluation.

CONCLUSIONS

The proposed relation extraction and combination method has a good performance on extracting HIV drug resistance data. It can be used in large-scale relation extraction experiments. The developed methods can also be applied to extract other type of relations such as gene-protein, gene-disease, and disease-mutation.

摘要

背景

在 HIV 治疗中,拥有最新的适用药物耐药性数据至关重要,因为 HIV 的突变率非常高。这些数据通过科学出版物提供,必须由专家手动提取,以便病毒学家和医生使用。因此,迫切需要一种能够部分自动化该过程并能够从文献中检索药物与病毒突变之间关系的工具。

结果

在这项工作中,我们提出了一种从病毒基因组中提取和组合 HIV 药物与突变之间关系的新方法。我们的提取方法基于自然语言处理(NLP),它生成语法关系,并对这些关系应用一组规则。我们将我们的方法应用于一组相关的 PubMed 摘要,并获得了 2434 个提取关系,估计 F 分数的性能为 84%。然后,我们使用逻辑回归对提取关系进行组合,为每个 <药物,突变> 对生成耐药值。这种关系组合的结果与斯坦福 HIVDB 对于最常发生的 10 种突变的一致性超过 85%。该系统在来自 Virolab 项目的 5 家医院中使用,用于从文献中预先选择最相关的新型耐药数据,并将其呈现给病毒学家和医生进行进一步评估。

结论

所提出的关系提取和组合方法在提取 HIV 耐药数据方面具有良好的性能。它可以用于大规模的关系提取实验。开发的方法还可以应用于提取其他类型的关系,如基因-蛋白质、基因-疾病和疾病-突变。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4133/2841207/8290ae73b929/1471-2105-11-101-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4133/2841207/286d518bfff6/1471-2105-11-101-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4133/2841207/6d55a5cc07bf/1471-2105-11-101-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4133/2841207/f77553390510/1471-2105-11-101-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4133/2841207/8290ae73b929/1471-2105-11-101-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4133/2841207/286d518bfff6/1471-2105-11-101-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4133/2841207/6d55a5cc07bf/1471-2105-11-101-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4133/2841207/f77553390510/1471-2105-11-101-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4133/2841207/8290ae73b929/1471-2105-11-101-4.jpg

相似文献

1
Extracting causal relations on HIV drug resistance from literature.从文献中提取 HIV 耐药性的因果关系。
BMC Bioinformatics. 2010 Feb 23;11:101. doi: 10.1186/1471-2105-11-101.
2
Text-mining of PubMed abstracts by natural language processing to create a public knowledge base on molecular mechanisms of bacterial enteropathogens.通过自然语言处理对PubMed摘要进行文本挖掘,以创建关于细菌性肠道病原体分子机制的公共知识库。
BMC Bioinformatics. 2009 Jun 10;10:177. doi: 10.1186/1471-2105-10-177.
3
An ontology-driven method for hierarchical mining of temporal patterns: application to HIV drug resistance research.一种用于时间模式分层挖掘的本体驱动方法:应用于HIV耐药性研究
AMIA Annu Symp Proc. 2007 Oct 11;2007:614-9.
4
Construction, training and clinical validation of an interpretation system for genotypic HIV-1 drug resistance based on fuzzy rules revised by virological outcomes.基于病毒学结果修订的模糊规则的HIV-1基因型耐药性解读系统的构建、训练及临床验证
Antivir Ther. 2004 Aug;9(4):583-93.
5
Analysis of discrepancies in the interpretation of antiretroviral drug resistance results in HIV-1 infected patients of Basque Country, Spain.西班牙巴斯克地区HIV-1感染患者抗逆转录病毒药物耐药性检测结果解读差异分析
J Clin Virol. 2005 Jul;33(3):224-9. doi: 10.1016/j.jcv.2004.06.014. Epub 2005 Jan 12.
6
Study of the impact of HIV genotypic drug resistance testing on therapy efficacy.人类免疫缺陷病毒基因耐药性检测对治疗效果的影响研究。
Verh K Acad Geneeskd Belg. 2001;63(5):447-73.
7
Protein structures and information extraction from biological texts: the PASTA system.蛋白质结构与从生物文本中提取信息:PASTA系统
Bioinformatics. 2003 Jan;19(1):135-43. doi: 10.1093/bioinformatics/19.1.135.
8
Human immunodeficiency virus type 1 (HIV-1) genotyping in Rio de Janeiro, Brazil: assessing subtype and drug-resistance associated mutations in HIV-1 infected individuals failing highly active antiretroviral therapy.巴西里约热内卢的1型人类免疫缺陷病毒(HIV-1)基因分型:评估接受高效抗逆转录病毒治疗失败的HIV-1感染者的亚型和耐药相关突变。
Mem Inst Oswaldo Cruz. 2005 Feb;100(1):73-8. doi: 10.1590/s0074-02762005000100014. Epub 2005 Apr 12.
9
MedEvi: retrieving textual evidence of relations between biomedical concepts from Medline.MedEvi:从医学在线数据库检索生物医学概念之间关系的文本证据。
Bioinformatics. 2008 Jun 1;24(11):1410-2. doi: 10.1093/bioinformatics/btn117. Epub 2008 Apr 9.
10
HIV-1C proviral DNA for detection of drug resistance mutations.HIV-1C 前病毒 DNA 用于检测耐药突变。
PLoS One. 2018 Oct 4;13(10):e0205119. doi: 10.1371/journal.pone.0205119. eCollection 2018.

引用本文的文献

1
Using transfer learning-based causality extraction to mine latent factors for Sjögren's syndrome from biomedical literature.利用基于迁移学习的因果关系提取从生物医学文献中挖掘干燥综合征的潜在因素。
Heliyon. 2023 Aug 22;9(9):e19265. doi: 10.1016/j.heliyon.2023.e19265. eCollection 2023 Sep.
2
Understanding the genetics of viral drug resistance by integrating clinical data and mining of the scientific literature.通过整合临床数据和挖掘科学文献来理解病毒耐药性的遗传学。
Sci Rep. 2022 Aug 25;12(1):14476. doi: 10.1038/s41598-022-17746-3.
3
Automatic extraction of protein-protein interactions using grammatical relationship graph.

本文引用的文献

1
Bayesian inference of protein-protein interactions from biological literature.基于生物文献的蛋白质-蛋白质相互作用的贝叶斯推断
Bioinformatics. 2009 Jun 15;25(12):1536-42. doi: 10.1093/bioinformatics/btp245. Epub 2009 Apr 15.
2
The challenge of finding a cure for HIV infection.寻找治愈艾滋病毒感染方法的挑战。
Science. 2009 Mar 6;323(5919):1304-7. doi: 10.1126/science.1165706.
3
Evaluating contributions of natural language parsers to protein-protein interaction extraction.评估自然语言解析器对蛋白质-蛋白质相互作用提取的贡献。
基于语法关系图自动提取蛋白质相互作用。
BMC Med Inform Decis Mak. 2018 Jul 23;18(Suppl 2):42. doi: 10.1186/s12911-018-0628-4.
4
Use of ontology structure and Bayesian models to aid the crowdsourcing of ICD-11 sanctioning rules.使用本体结构和贝叶斯模型辅助国际疾病分类第11版(ICD-11)批准规则的众包工作。
J Biomed Inform. 2017 Apr;68:20-34. doi: 10.1016/j.jbi.2017.02.004. Epub 2017 Feb 10.
5
Bridging semantics and syntax with graph algorithms-state-of-the-art of extracting biomedical relations.用图算法弥合语义与句法——提取生物医学关系的研究现状
Brief Bioinform. 2017 Jan;18(1):160-178. doi: 10.1093/bib/bbw001. Epub 2016 Feb 5.
6
Survey of Natural Language Processing Techniques in Bioinformatics.生物信息学中的自然语言处理技术综述
Comput Math Methods Med. 2015;2015:674296. doi: 10.1155/2015/674296. Epub 2015 Oct 7.
7
Domain adaption of parsing for operative notes.手术记录解析的领域适应
J Biomed Inform. 2015 Apr;54:1-9. doi: 10.1016/j.jbi.2015.01.016. Epub 2015 Feb 7.
8
Integrated bio-entity network: a system for biological knowledge discovery.综合生物实体网络:一个用于生物知识发现的系统。
PLoS One. 2011;6(6):e21474. doi: 10.1371/journal.pone.0021474. Epub 2011 Jun 27.
9
Semantic relations for problem-oriented medical records.面向问题的病历的语义关系。
Artif Intell Med. 2010 Oct;50(2):63-73. doi: 10.1016/j.artmed.2010.05.006. Epub 2010 Jun 19.
Bioinformatics. 2009 Feb 1;25(3):394-400. doi: 10.1093/bioinformatics/btn631. Epub 2008 Dec 9.
4
Large-scale directional relationship extraction and resolution.大规模方向关系提取与解析。
BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S11. doi: 10.1186/1471-2105-9-S9-S11.
5
Detection of gene interactions based on syntactic relations.基于句法关系的基因相互作用检测。
J Biomed Biotechnol. 2008;2008:371710. doi: 10.1155/2008/371710.
6
A text-mining perspective on the requirements for electronically annotated abstracts.从文本挖掘角度看电子注释摘要的要求。
FEBS Lett. 2008 Apr 9;582(8):1178-81. doi: 10.1016/j.febslet.2008.02.072. Epub 2008 Mar 6.
7
Extracting interactions between proteins from the literature.从文献中提取蛋白质之间的相互作用。
J Biomed Inform. 2008 Apr;41(2):393-407. doi: 10.1016/j.jbi.2007.11.008. Epub 2007 Dec 15.
8
Kernel approaches for genic interaction extraction.用于基因相互作用提取的核方法。
Bioinformatics. 2008 Jan 1;24(1):118-26. doi: 10.1093/bioinformatics/btm544. Epub 2007 Nov 14.
9
Frontiers of biomedical text mining: current progress.生物医学文本挖掘前沿:当前进展
Brief Bioinform. 2007 Sep;8(5):358-75. doi: 10.1093/bib/bbm045. Epub 2007 Oct 30.
10
Mining complex genotypic features for predicting HIV-1 drug resistance.挖掘复杂基因型特征以预测HIV-1耐药性。
Bioinformatics. 2007 Sep 15;23(18):2455-62. doi: 10.1093/bioinformatics/btm353. Epub 2007 Aug 13.