• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

LAITOR--术语共现和关系识别的文献助手。

LAITOR--Literature Assistant for Identification of Terms co-Occurrences and Relationships.

机构信息

Max-Delbrück Center for Molecular Medicine, Berlin, Germany.

出版信息

BMC Bioinformatics. 2010 Feb 1;11:70. doi: 10.1186/1471-2105-11-70.

DOI:10.1186/1471-2105-11-70
PMID:20122157
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3098111/
Abstract

BACKGROUND

Biological knowledge is represented in scientific literature that often describes the function of genes/proteins (bioentities) in terms of their interactions (biointeractions). Such bioentities are often related to biological concepts of interest that are specific of a determined research field. Therefore, the study of the current literature about a selected topic deposited in public databases, facilitates the generation of novel hypotheses associating a set of bioentities to a common context.

RESULTS

We created a text mining system (LAITOR: Literature Assistant for Identification of Terms co-Occurrences and Relationships) that analyses co-occurrences of bioentities, biointeractions, and other biological terms in MEDLINE abstracts. The method accounts for the position of the co-occurring terms within sentences or abstracts. The system detected abstracts mentioning protein-protein interactions in a standard test (BioCreative II IAS test data) with a precision of 0.82-0.89 and a recall of 0.48-0.70. We illustrate the application of LAITOR to the detection of plant response genes in a dataset of 1000 abstracts relevant to the topic.

CONCLUSIONS

Text mining tools combining the extraction of interacting bioentities and biological concepts with network displays can be helpful in developing reasonable hypotheses in different scientific backgrounds.

摘要

背景

生物知识在科学文献中得到体现,这些文献通常根据基因/蛋白质(生物实体)的相互作用(生物相互作用)来描述其功能。这些生物实体通常与特定研究领域感兴趣的生物概念有关。因此,研究当前存储在公共数据库中的关于选定主题的文献,有助于生成将一组生物实体与共同背景联系起来的新假设。

结果

我们创建了一个文本挖掘系统(LAITOR:用于识别术语共现和关系的文献助手),该系统分析 MEDLINE 摘要中生物实体、生物相互作用和其他生物术语的共现。该方法考虑了共现术语在句子或摘要中的位置。该系统在标准测试(BioCreative II IAS 测试数据)中检测到提及蛋白质-蛋白质相互作用的摘要,精度为 0.82-0.89,召回率为 0.48-0.70。我们说明了将 Laitor 应用于从与主题相关的 1000 个摘要数据集中检测植物响应基因的情况。

结论

将提取相互作用的生物实体和生物概念与网络显示相结合的文本挖掘工具,可有助于在不同的科学背景下提出合理的假设。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ed5/3098111/5a05a20cd5c0/1471-2105-11-70-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ed5/3098111/e4fd7f3e57f2/1471-2105-11-70-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ed5/3098111/5a05a20cd5c0/1471-2105-11-70-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ed5/3098111/e4fd7f3e57f2/1471-2105-11-70-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ed5/3098111/5a05a20cd5c0/1471-2105-11-70-2.jpg

相似文献

1
LAITOR--Literature Assistant for Identification of Terms co-Occurrences and Relationships.LAITOR--术语共现和关系识别的文献助手。
BMC Bioinformatics. 2010 Feb 1;11:70. doi: 10.1186/1471-2105-11-70.
2
SciMiner: web-based literature mining tool for target identification and functional enrichment analysis.SciMiner:用于靶点识别和功能富集分析的基于网络的文献挖掘工具。
Bioinformatics. 2009 Mar 15;25(6):838-40. doi: 10.1093/bioinformatics/btp049. Epub 2009 Feb 2.
3
Hierarchical network analysis of co-occurring bioentities in literature.文献中共同出现的生物实体的层次网络分析。
Sci Rep. 2022 May 12;12(1):7885. doi: 10.1038/s41598-022-12093-9.
4
Biological information extraction and co-occurrence analysis.生物信息提取与共现分析。
Methods Mol Biol. 2014;1159:77-92. doi: 10.1007/978-1-4939-0709-0_5.
5
A comprehensive and quantitative comparison of text-mining in 15 million full-text articles versus their corresponding abstracts.全面且定量地比较了 1500 万篇全文文章及其相应摘要中的文本挖掘。
PLoS Comput Biol. 2018 Feb 15;14(2):e1005962. doi: 10.1371/journal.pcbi.1005962. eCollection 2018 Feb.
6
An evaluation of GO annotation retrieval for BioCreAtIvE and GOA.对生物创意(BioCreAtIvE)和基因本体注释(GOA)的基因本体(GO)注释检索的评估。
BMC Bioinformatics. 2005;6 Suppl 1(Suppl 1):S17. doi: 10.1186/1471-2105-6-S1-S17. Epub 2005 May 24.
7
Textpresso: an ontology-based information retrieval and extraction system for biological literature.Textpresso:一个基于本体的生物文献信息检索与提取系统。
PLoS Biol. 2004 Nov;2(11):e309. doi: 10.1371/journal.pbio.0020309. Epub 2004 Sep 21.
8
Text mining tools for extracting information about microbial biodiversity in food.用于从食品中提取微生物生物多样性信息的文本挖掘工具。
Food Microbiol. 2019 Aug;81:63-75. doi: 10.1016/j.fm.2018.04.011. Epub 2018 Apr 21.
9
Information content in Medline record fields.医学在线数据库(Medline)记录字段中的信息内容。
Int J Med Inform. 2004 Jun 30;73(6):515-27. doi: 10.1016/j.ijmedinf.2004.02.008.
10
Text-mining approaches in molecular biology and biomedicine.分子生物学和生物医学中的文本挖掘方法。
Drug Discov Today. 2005 Mar 15;10(6):439-45. doi: 10.1016/S1359-6446(05)03376-3.

引用本文的文献

1
Identification of most influential co-occurring gene suites for gastrointestinal cancer using biomedical literature mining and graph-based influence maximization.利用生物医学文献挖掘和基于图的影响力最大化方法识别对胃肠道癌最具影响力的共现基因集
BMC Med Inform Decis Mak. 2020 Sep 3;20(1):208. doi: 10.1186/s12911-020-01227-6.
2
LAITOR4HPC: A text mining pipeline based on HPC for building interaction networks.LAITOR4HPC:一个基于高性能计算的文本挖掘管道,用于构建交互网络。
BMC Bioinformatics. 2020 Aug 24;21(1):365. doi: 10.1186/s12859-020-03620-4.
3
Text Mining for Protein Docking.

本文引用的文献

1
PLAN2L: a web tool for integrated text mining and literature-based bioentity relation extraction.PLAN2L:一个用于集成文本挖掘和基于文献的生物实体关系提取的网络工具。
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W160-5. doi: 10.1093/nar/gkp484. Epub 2009 Jun 11.
2
MedlineRanker: flexible ranking of biomedical literature.MedlineRanker:生物医学文献的灵活排序
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W141-6. doi: 10.1093/nar/gkp353. Epub 2009 May 8.
3
Querying parse tree database of Medline text to synthesize user-specific biomolecular networks.
用于蛋白质对接的文本挖掘
PLoS Comput Biol. 2015 Dec 9;11(12):e1004630. doi: 10.1371/journal.pcbi.1004630. eCollection 2015 Dec.
4
Database constraints applied to metabolic pathway reconstruction tools.应用于代谢途径重建工具的数据库约束条件。
ScientificWorldJournal. 2014;2014:967294. doi: 10.1155/2014/967294. Epub 2014 Aug 17.
5
Large-scale structure of a network of co-occurring MeSH terms: statistical analysis of macroscopic properties.共同出现的医学主题词网络的大规模结构:宏观属性的统计分析
PLoS One. 2014 Jul 9;9(7):e102188. doi: 10.1371/journal.pone.0102188. eCollection 2014.
6
A systems biological approach reveals multiple crosstalk mechanism between gram-positive and negative bacterial infections: an insight into core mechanism and unique molecular signatures.一种系统生物学方法揭示了革兰氏阳性菌和阴性菌感染之间的多种相互作用机制:对核心机制和独特分子特征的洞察。
PLoS One. 2014 Feb 28;9(2):e89993. doi: 10.1371/journal.pone.0089993. eCollection 2014.
7
Extracting rate changes in transcriptional regulation from MEDLINE abstracts.从 MEDLINE 摘要中提取转录调控的变化率。
BMC Bioinformatics. 2014;15 Suppl 2(Suppl 2):S4. doi: 10.1186/1471-2105-15-S2-S4. Epub 2014 Jan 24.
8
Systems biology elucidates common pathogenic mechanisms between nonalcoholic and alcoholic-fatty liver disease.系统生物学阐明了非酒精性和酒精性脂肪性肝病之间的共同致病机制。
PLoS One. 2013;8(3):e58895. doi: 10.1371/journal.pone.0058895. Epub 2013 Mar 13.
9
Context-specific protein network miner--an online system for exploring context-specific protein interaction networks from the literature.语境特异蛋白网络挖掘器——一个在线系统,用于从文献中挖掘语境特异蛋白相互作用网络。
PLoS One. 2012;7(4):e34480. doi: 10.1371/journal.pone.0034480. Epub 2012 Apr 6.
10
Preimplantation development regulatory pathway construction through a text-mining approach.通过文本挖掘方法构建胚胎植入前发育调控通路。
BMC Genomics. 2011 Dec 22;12 Suppl 4(Suppl 4):S3. doi: 10.1186/1471-2164-12-S4-S3.
Pac Symp Biocomput. 2009:87-98.
4
Arena3D: visualization of biological networks in 3D.Arena3D:生物网络的三维可视化
BMC Syst Biol. 2008 Nov 28;2:104. doi: 10.1186/1752-0509-2-104.
5
STRING 8--a global view on proteins and their functional interactions in 630 organisms.STRING 8——关于630种生物中蛋白质及其功能相互作用的全局视图。
Nucleic Acids Res. 2009 Jan;37(Database issue):D412-6. doi: 10.1093/nar/gkn760. Epub 2008 Oct 21.
6
MINT and IntAct contribute to the Second BioCreative challenge: serving the text-mining community with high quality molecular interaction data.MINT和IntAct助力第二届生物创意挑战赛:为文本挖掘社区提供高质量分子相互作用数据。
Genome Biol. 2008;9 Suppl 2(Suppl 2):S5. doi: 10.1186/gb-2008-9-s2-s5. Epub 2008 Sep 1.
7
Overview of the protein-protein interaction annotation extraction task of BioCreative II.生物创意II蛋白质-蛋白质相互作用注释提取任务概述。
Genome Biol. 2008;9 Suppl 2(Suppl 2):S4. doi: 10.1186/gb-2008-9-s2-s4. Epub 2008 Sep 1.
8
Salicylic acid in plant defence--the players and protagonists.植物防御中的水杨酸——参与者与主角
Curr Opin Plant Biol. 2007 Oct;10(5):466-72. doi: 10.1016/j.pbi.2007.08.008. Epub 2007 Sep 27.
9
Mechanisms of high salinity tolerance in plants.植物耐高盐机制。
Methods Enzymol. 2007;428:419-38. doi: 10.1016/S0076-6879(07)28024-3.
10
Automatic reconstruction of a bacterial regulatory network using Natural Language Processing.使用自然语言处理自动重建细菌调控网络。
BMC Bioinformatics. 2007 Aug 7;8:293. doi: 10.1186/1471-2105-8-293.