• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

针对一项整理任务的疾病、药物和基因之间的排名关系。

Ranking relations between diseases, drugs and genes for a curation task.

作者信息

Clematide Simon, Rinaldi Fabio

机构信息

Institute of Computational Linguistics, University of Zurich, Binzmühlestrasse 14, 8050 Zurich, Switzerland.

出版信息

J Biomed Semantics. 2012 Oct 5;3 Suppl 3(Suppl 3):S5. doi: 10.1186/2041-1480-3-S3-S5.

DOI:10.1186/2041-1480-3-S3-S5
PMID:23046495
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3465213/
Abstract

BACKGROUND

One of the key pieces of information which biomedical text mining systems are expected to extract from the literature are interactions among different types of biomedical entities (proteins, genes, diseases, drugs, etc.). Several large resources of curated relations between biomedical entities are currently available, such as the Pharmacogenomics Knowledge Base (PharmGKB) or the Comparative Toxicogenomics Database (CTD).Biomedical text mining systems, and in particular those which deal with the extraction of relationships among entities, could make better use of the wealth of already curated material.

RESULTS

We propose a simple and effective method based on logistic regression (also known as maximum entropy modeling) for an optimized ranking of relation candidates utilizing curated abstracts. Furthermore, we examine the effects and difficulties of using widely available metadata (i.e. MeSH terms and chemical substance index terms) for relation extraction. Cross-validation experiments result in an improvement of the ranking quality in terms of AUCiP/R by 39% (PharmGKB) and 116% (CTD) against a frequency-based baseline of 0.39 (PharmGKB) and 0.21 (CTD). For the TAP-10 metrics, we achieve an improvement of 53% (PharmGKB) and 134% (CTD) against the same baseline system (0.21 PharmGKB and 0.15 CTD).

CONCLUSIONS

Our experiments with the PharmGKB and the CTD database show a strong positive effect for the ranking of relation candidates utilizing the vast amount of curated relations covered by currently available knowledge databases. The tasks of concept identification and candidate relation generation profit from the adaptation to previously curated material. This presents an effective and practical method suitable for conservative extension and re-validation of biomedical relations from texts that has been successfully used for curation experiments with the PharmGKB and CTD database.

摘要

背景

生物医学文本挖掘系统有望从文献中提取的关键信息之一是不同类型生物医学实体(蛋白质、基因、疾病、药物等)之间的相互作用。目前有几个生物医学实体之间经过整理的关系的大型资源,例如药物基因组学知识库(PharmGKB)或比较毒理基因组学数据库(CTD)。生物医学文本挖掘系统,尤其是那些处理实体间关系提取的系统,可以更好地利用已整理的大量材料。

结果

我们提出了一种基于逻辑回归(也称为最大熵建模)的简单有效方法,用于利用经过整理的摘要对关系候选进行优化排序。此外,我们研究了使用广泛可用的元数据(即医学主题词和化学物质索引词)进行关系提取的效果和困难。交叉验证实验表明,与基于频率的基线(PharmGKB为0.39,CTD为0.21)相比,在AUCiP/R方面,排名质量提高了39%(PharmGKB)和116%(CTD)。对于TAP - 10指标,与相同的基线系统(PharmGKB为0.21,CTD为0.15)相比,我们实现了53%(PharmGKB)和134%(CTD)的提高。

结论

我们使用PharmGKB和CTD数据库进行的实验表明,利用当前可用知识数据库涵盖的大量经过整理的关系对关系候选进行排序具有很强的积极效果。概念识别和候选关系生成任务受益于对先前整理材料的适应。这提出了一种有效且实用的方法,适用于从文本中保守扩展和重新验证生物医学关系,该方法已成功用于PharmGKB和CTD数据库的整理实验。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/517b/3465213/810d42cc13f6/2041-1480-3-S3-S5-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/517b/3465213/d5bd33c16034/2041-1480-3-S3-S5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/517b/3465213/39e965e20c21/2041-1480-3-S3-S5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/517b/3465213/5f76d10d23c7/2041-1480-3-S3-S5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/517b/3465213/810d42cc13f6/2041-1480-3-S3-S5-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/517b/3465213/d5bd33c16034/2041-1480-3-S3-S5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/517b/3465213/39e965e20c21/2041-1480-3-S3-S5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/517b/3465213/5f76d10d23c7/2041-1480-3-S3-S5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/517b/3465213/810d42cc13f6/2041-1480-3-S3-S5-4.jpg

相似文献

1
Ranking relations between diseases, drugs and genes for a curation task.针对一项整理任务的疾病、药物和基因之间的排名关系。
J Biomed Semantics. 2012 Oct 5;3 Suppl 3(Suppl 3):S5. doi: 10.1186/2041-1480-3-S3-S5.
2
Text mining and manual curation of chemical-gene-disease networks for the comparative toxicogenomics database (CTD).文本挖掘和化学-基因-疾病网络的人工整理用于比较毒理学基因组数据库(CTD)。
BMC Bioinformatics. 2009 Oct 8;10:326. doi: 10.1186/1471-2105-10-326.
3
Relation mining experiments in the pharmacogenomics domain.药物基因组学领域的关系挖掘实验。
J Biomed Inform. 2012 Oct;45(5):851-61. doi: 10.1016/j.jbi.2012.04.014. Epub 2012 May 10.
4
Text mining effectively scores and ranks the literature for improving chemical-gene-disease curation at the comparative toxicogenomics database.文本挖掘有效地对文献进行评分和排序,以提高比较毒理学基因组学数据库中的化学物质-基因-疾病的编纂工作。
PLoS One. 2013 Apr 17;8(4):e58201. doi: 10.1371/journal.pone.0058201. Print 2013.
5
miRiaD: A Text Mining Tool for Detecting Associations of microRNAs with Diseases.miRiaD:一种用于检测微小RNA与疾病关联的文本挖掘工具。
J Biomed Semantics. 2016 Apr 29;7(1):9. doi: 10.1186/s13326-015-0044-y.
6
PGxMine: Text mining for curation of PharmGKB.PGxMine:用于 PharmGKB 策管的文本挖掘。
Pac Symp Biocomput. 2020;25:611-622.
7
Extraction of relations between genes and diseases from text and large-scale data analysis: implications for translational research.从文本和大规模数据分析中提取基因与疾病之间的关系:对转化研究的启示。
BMC Bioinformatics. 2015 Feb 21;16:55. doi: 10.1186/s12859-015-0472-9.
8
Using ODIN for a PharmGKB revalidation experiment.使用 ODIN 进行 PharmGKB 再验证实验。
Database (Oxford). 2012 Apr 23;2012:bas021. doi: 10.1093/database/bas021. Print 2012.
9
Targeted journal curation as a method to improve data currency at the Comparative Toxicogenomics Database.靶向期刊策展作为一种提高比较毒理学基因组学数据库数据时效性的方法。
Database (Oxford). 2012 Dec 6;2012:bas051. doi: 10.1093/database/bas051. Print 2012.
10
The curation paradigm and application tool used for manual curation of the scientific literature at the Comparative Toxicogenomics Database.比较毒理学基因组学数据库中用于科学文献人工注释的注释范例和应用工具。
Database (Oxford). 2011 Sep 20;2011:bar034. doi: 10.1093/database/bar034. Print 2011.

引用本文的文献

1
qPCR Analysis Reveals Association of Differential Expression of , , and Genes With Type 2 Diabetes Mellitus.qPCR 分析显示 、 和 基因的差异表达与 2 型糖尿病有关。
Front Endocrinol (Lausanne). 2022 Jan 3;12:774696. doi: 10.3389/fendo.2021.774696. eCollection 2021.
2
Systems-level differential gene expression analysis reveals new genetic variants of oral cancer.系统水平的差异基因表达分析揭示了口腔癌的新遗传变异。
Sci Rep. 2020 Sep 4;10(1):14667. doi: 10.1038/s41598-020-71346-7.
3
A systematic simulation-based meta-analytical framework for prediction of physiological biomarkers in alopecia.

本文引用的文献

1
Using ODIN for a PharmGKB revalidation experiment.使用 ODIN 进行 PharmGKB 再验证实验。
Database (Oxford). 2012 Apr 23;2012:bas021. doi: 10.1093/database/bas021. Print 2012.
2
BioCreative III interactive task: an overview.BioCreative III 交互式任务概述。
BMC Bioinformatics. 2011 Oct 3;12 Suppl 8(Suppl 8):S4. doi: 10.1186/1471-2105-12-S8-S4.
3
Terminological resources for text mining over biomedical scientific literature.生物医学文献文本挖掘的术语资源。
一种基于系统模拟的荟萃分析框架,用于预测脱发中的生理生物标志物。
J Biol Res (Thessalon). 2019 Apr 4;26:2. doi: 10.1186/s40709-019-0094-x. eCollection 2019 Dec.
4
LimTox: a web tool for applied text mining of adverse event and toxicity associations of compounds, drugs and genes.LimTox:一个用于化合物、药物和基因的不良事件和毒性关联的应用文本挖掘的网络工具。
Nucleic Acids Res. 2017 Jul 3;45(W1):W484-W489. doi: 10.1093/nar/gkx462.
5
Cellular Signaling Pathways in Insulin Resistance-Systems Biology Analyses of Microarray Dataset Reveals New Drug Target Gene Signatures of Type 2 Diabetes Mellitus.胰岛素抵抗中的细胞信号通路——微阵列数据集的系统生物学分析揭示2型糖尿病的新药靶基因特征
Front Physiol. 2017 Jan 25;8:13. doi: 10.3389/fphys.2017.00013. eCollection 2017.
6
DTMiner: identification of potential disease targets through biomedical literature mining.DTMiner:通过生物医学文献挖掘识别潜在疾病靶点。
Bioinformatics. 2016 Dec 1;32(23):3619-3626. doi: 10.1093/bioinformatics/btw503. Epub 2016 Aug 9.
7
OntoGene web services for biomedical text mining.OntoGene 生物医学文本挖掘网络服务。
BMC Bioinformatics. 2014;15 Suppl 14(Suppl 14):S6. doi: 10.1186/1471-2105-15-S14-S6. Epub 2014 Nov 27.
8
Using the OntoGene pipeline for the triage task of BioCreative 2012.使用 OntoGene 流水线进行 BioCreative 2012 的分诊任务。
Database (Oxford). 2013 Feb 9;2013:bas053. doi: 10.1093/database/bas053. Print 2013.
9
Topics in machine learning for biomedical literature analysis and text retrieval.用于生物医学文献分析和文本检索的机器学习主题。
J Biomed Semantics. 2012 Oct 5;3 Suppl 3(Suppl 3):S1. doi: 10.1186/2041-1480-3-S3-S1.
10
Using ODIN for a PharmGKB revalidation experiment.使用 ODIN 进行 PharmGKB 再验证实验。
Database (Oxford). 2012 Apr 23;2012:bas021. doi: 10.1093/database/bas021. Print 2012.
Artif Intell Med. 2011 Jun;52(2):107-14. doi: 10.1016/j.artmed.2011.04.011. Epub 2011 Jun 11.
4
The Comparative Toxicogenomics Database: update 2011.比较毒理基因组学数据库:2011年更新版
Nucleic Acids Res. 2011 Jan;39(Database issue):D1067-72. doi: 10.1093/nar/gkq813. Epub 2010 Sep 22.
5
An Overview of BioCreative II.5.BioCreative II.5 概述。
IEEE/ACM Trans Comput Biol Bioinform. 2010 Jul-Sep;7(3):385-99. doi: 10.1109/tcbb.2010.61.
6
OntoGene in BioCreative II.5.Ontogene 在 BioCreative II.5 中。
IEEE/ACM Trans Comput Biol Bioinform. 2010 Jul-Sep;7(3):472-80. doi: 10.1109/TCBB.2010.50.
7
Threshold Average Precision (TAP-k): a measure of retrieval designed for bioinformatics.阈平均值精度(TAP-k):一种专为生物信息学设计的检索度量标准。
Bioinformatics. 2010 Jul 15;26(14):1708-13. doi: 10.1093/bioinformatics/btq270. Epub 2010 May 26.
8
An overview of MetaMap: historical perspective and recent advances.MetaMap 概述:历史视角与最新进展。
J Am Med Inform Assoc. 2010 May-Jun;17(3):229-36. doi: 10.1136/jamia.2009.002733.
9
BioPortal: ontologies and integrated data resources at the click of a mouse.生物门户:一键点击即可获取本体和集成数据资源。
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W170-3. doi: 10.1093/nar/gkp440. Epub 2009 May 29.
10
Evaluating contributions of natural language parsers to protein-protein interaction extraction.评估自然语言解析器对蛋白质-蛋白质相互作用提取的贡献。
Bioinformatics. 2009 Feb 1;25(3):394-400. doi: 10.1093/bioinformatics/btn631. Epub 2008 Dec 9.