Baltoumas Fotis A, Zafeiropoulou Sofia, Karatzas Evangelos, Paragkamian Savvas, Thanati Foteini, Iliopoulos Ioannis, Eliopoulos Aristides G, Schneider Reinhard, Jensen Lars Juhl, Pafilis Evangelos, Pavlopoulos Georgios A
Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center "Alexander Fleming", Vari 16672, Greece.
Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Former U.S. Base of Gournes P.O. Box 2214, 71003 Heraklion, Crete, Greece.
NAR Genom Bioinform. 2021 Oct 6;3(4):lqab090. doi: 10.1093/nargab/lqab090. eCollection 2021 Dec.
Extracting and processing information from documents is of great importance as lots of experimental results and findings are stored in local files. Therefore, extracting and analyzing biomedical terms from such files in an automated way is absolutely necessary. In this article, we present OnTheFly, a web application for extracting biomedical entities from individual files such as plain texts, office documents, PDF files or images. OnTheFly can generate informative summaries in popup windows containing knowledge related to the identified terms along with links to various databases. It uses the EXTRACT tagging service to perform named entity recognition (NER) for genes/proteins, chemical compounds, organisms, tissues, environments, diseases, phenotypes and gene ontology terms. Multiple files can be analyzed, whereas identified terms such as proteins or genes can be explored through functional enrichment analysis or be associated with diseases and PubMed entries. Finally, protein-protein and protein-chemical networks can be generated with the use of STRING and STITCH services. To demonstrate its capacity for knowledge discovery, we interrogated published meta-analyses of clinical biomarkers of severe COVID-19 and uncovered inflammatory and senescence pathways that impact disease pathogenesis. OnTheFly currently supports 197 species and is available at http://bib.fleming.gr:3838/OnTheFly/ and http://onthefly.pavlopouloslab.info.
从文档中提取和处理信息非常重要,因为大量的实验结果和发现都存储在本地文件中。因此,以自动化方式从此类文件中提取和分析生物医学术语绝对必要。在本文中,我们介绍了OnTheFly,这是一个用于从单个文件(如纯文本、办公文档、PDF文件或图像)中提取生物医学实体的Web应用程序。OnTheFly可以在弹出窗口中生成包含与已识别术语相关知识的信息摘要以及指向各种数据库的链接。它使用EXTRACT标记服务对基因/蛋白质、化合物、生物体、组织、环境、疾病、表型和基因本体术语进行命名实体识别(NER)。可以分析多个文件,而诸如蛋白质或基因等已识别术语可以通过功能富集分析进行探索,或者与疾病和PubMed条目相关联。最后,可以使用STRING和STITCH服务生成蛋白质-蛋白质和蛋白质-化学网络。为了证明其知识发现能力,我们查阅了已发表的关于重症COVID-19临床生物标志物的荟萃分析,并发现了影响疾病发病机制的炎症和衰老途径。OnTheFly目前支持197个物种,可在http://bib.fleming.gr:3838/OnTheFly/和http://onthefly.pavlopouloslab.info上获取。
J Cheminform. 2018-12-5
Bioinformatics. 2014-8-6
Bioinformatics. 2011-10-12
Bioinformatics. 2004-5-1
Bioinformatics. 2016-9-15
Comput Struct Biotechnol J. 2025-6-14
Comput Struct Biotechnol J. 2024-8-21
Front Cardiovasc Med. 2023-4-18
Genomics Proteomics Bioinformatics. 2022-6
Science. 2021-7-16
Bioinformatics. 2021-9-9
Nucleic Acids Res. 2021-1-8
Nucleic Acids Res. 2021-1-8
Nucleic Acids Res. 2021-1-8