FACTA：一个用于查找相关生物医学概念的文本搜索引擎。

FACTA: a text search engine for finding associated biomedical concepts.

作者信息

Tsuruoka Yoshimasa, Tsujii Jun'ichi, Ananiadou Sophia

机构信息

School of Computer Science, The University of Manchester, Manchester, UK.

出版信息

Bioinformatics. 2008 Nov 1;24(21):2559-60. doi: 10.1093/bioinformatics/btn469. Epub 2008 Sep 4.

DOI:10.1093/bioinformatics/btn469

PMID:18772154

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2572701/

Abstract

UNLABELLED

FACTA is a text search engine for MEDLINE abstracts, which is designed particularly to help users browse biomedical concepts (e.g. genes/proteins, diseases, enzymes and chemical compounds) appearing in the documents retrieved by the query. The concepts are presented to the user in a tabular format and ranked based on the co-occurrence statistics. Unlike existing systems that provide similar functionality, FACTA pre-indexes not only the words but also the concepts mentioned in the documents, which enables the user to issue a flexible query (e.g. free keywords or Boolean combinations of keywords/concepts) and receive the results immediately even when the number of the documents that match the query is very large. The user can also view snippets from MEDLINE to get textual evidence of associations between the query terms and the concepts. The concept IDs and their names/synonyms for building the indexes were collected from several biomedical databases and thesauri, such as UniProt, BioThesaurus, UMLS, KEGG and DrugBank.

AVAILABILITY

The system is available at http://www.nactem.ac.uk/software/facta/

摘要

未加标注

FACTA是一个用于MEDLINE摘要的文本搜索引擎，其特别设计用于帮助用户浏览查询检索到的文档中出现的生物医学概念（如基因/蛋白质、疾病、酶和化合物）。这些概念以表格形式呈现给用户，并根据共现统计进行排序。与提供类似功能的现有系统不同，FACTA不仅对文档中出现的单词进行预索引，还对概念进行预索引，这使得用户能够发出灵活的查询（如自由关键词或关键词/概念的布尔组合），即使匹配查询的文档数量非常大，也能立即收到结果。用户还可以查看MEDLINE的片段，以获取查询词与概念之间关联的文本证据。用于构建索引的概念ID及其名称/同义词是从几个生物医学数据库和词库中收集的，如UniProt、BioThesaurus、UMLS、KEGG和DrugBank。

可用性

该系统可在http://www.nactem.ac.uk/software/facta/获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ac7/2648144/5f242cc5c2e6/btn469f1.jpg

相似文献

FACTA: a text search engine for finding associated biomedical concepts.FACTA：一个用于查找相关生物医学概念的文本搜索引擎。

Bioinformatics. 2008 Nov 1;24(21):2559-60. doi: 10.1093/bioinformatics/btn469. Epub 2008 Sep 4.

Discovering and visualizing indirect associations between biomedical concepts.发现和可视化生物医学概念之间的间接关联。

Bioinformatics. 2011 Jul 1;27(13):i111-9. doi: 10.1093/bioinformatics/btr214.

MedEvi: retrieving textual evidence of relations between biomedical concepts from Medline.MedEvi：从医学在线数据库检索生物医学概念之间关系的文本证据。

Bioinformatics. 2008 Jun 1;24(11):1410-2. doi: 10.1093/bioinformatics/btn117. Epub 2008 Apr 9.

BIOMedical Search Engine Framework: Lightweight and customized implementation of domain-specific biomedical search engines.生物医学搜索引擎框架：特定领域生物医学搜索引擎的轻量级定制实现。

Comput Methods Programs Biomed. 2016 Jul;131:63-77. doi: 10.1016/j.cmpb.2016.03.030. Epub 2016 Apr 8.

G-Bean: an ontology-graph based web tool for biomedical literature retrieval.G-Bean：基于本体图的生物医学文献检索网络工具。

BMC Bioinformatics. 2014;15 Suppl 12(Suppl 12):S1. doi: 10.1186/1471-2105-15-S12-S1. Epub 2014 Nov 6.

Bayesian approach to incorporating different types of biomedical knowledge bases into information retrieval systems for clinical decision support in precision medicine.贝叶斯方法在将不同类型的生物医学知识库整合到精准医学临床决策支持信息检索系统中的应用。

J Biomed Inform. 2019 Oct;98:103238. doi: 10.1016/j.jbi.2019.103238. Epub 2019 Jul 10.

Concept-based query expansion for retrieving gene related publications from MEDLINE.基于概念的查询扩展，从 MEDLINE 中检索与基因相关的文献。

BMC Bioinformatics. 2010 Apr 28;11:212. doi: 10.1186/1471-2105-11-212.

MEDRank: using graph-based concept ranking to index biomedical texts.MEDRank：基于图的概念排序在生物医学文本索引中的应用。

Int J Med Inform. 2011 Jun;80(6):431-41. doi: 10.1016/j.ijmedinf.2011.02.008. Epub 2011 Mar 25.

EBIMed--text crunching to gather facts for proteins from Medline.EBIMed——通过文本处理从医学在线数据库中收集蛋白质相关事实。

Bioinformatics. 2007 Jan 15;23(2):e237-44. doi: 10.1093/bioinformatics/btl302.

Meshable: searching PubMed abstracts by utilizing MeSH and MeSH-derived topical terms.可网格化：利用医学主题词表（MeSH）及其衍生主题词搜索PubMed摘要。

Bioinformatics. 2016 Oct 1;32(19):3044-6. doi: 10.1093/bioinformatics/btw331. Epub 2016 Jun 10.

引用本文的文献

Predicting potential target genes in molecular biology experiments using machine learning and multifaceted data sources.利用机器学习和多方面数据源预测分子生物学实验中的潜在靶基因。

iScience. 2024 Feb 23;27(3):109309. doi: 10.1016/j.isci.2024.109309. eCollection 2024 Mar 15.

The Multienzyme Complex Nature of Dehydroepiandrosterone Sulfate Biosynthesis.硫酸脱氢表雄酮生物合成的多酶复合物性质。

Int J Mol Sci. 2024 Feb 8;25(4):2072. doi: 10.3390/ijms25042072.

Optimizing Signal Management in a Vaccine Adverse Event Reporting System: A Proof-of-Concept with COVID-19 Vaccines Using Signs, Symptoms, and Natural Language Processing.优化疫苗不良事件报告系统中的信号管理：使用体征、症状和自然语言处理对 COVID-19 疫苗进行概念验证

Drug Saf. 2024 Feb;47(2):173-182. doi: 10.1007/s40264-023-01381-6. Epub 2023 Dec 7.

Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service.大规模科技文献查询服务的文献聚类算法研究。

Comput Intell Neurosci. 2022 Aug 21;2022:3392489. doi: 10.1155/2022/3392489. eCollection 2022.

Combining Literature Mining and Machine Learning for Predicting Biomedical Discoveries.结合文献挖掘和机器学习预测生物医学发现。

Methods Mol Biol. 2022;2496:123-140. doi: 10.1007/978-1-0716-2305-3_7.

Darling: A Web Application for Detecting Disease-Related Biomedical Entity Associations with Literature Mining.亲爱的：一个使用文献挖掘技术检测与疾病相关的生物医学实体关联的网络应用程序。

Biomolecules. 2022 Mar 30;12(4):520. doi: 10.3390/biom12040520.

Diseases 2.0: a weekly updated database of disease-gene associations from text mining and data integration.疾病 2.0：从文本挖掘和数据集成中获取的每周更新的疾病-基因关联数据库。

Database (Oxford). 2022 Mar 28;2022. doi: 10.1093/database/baac019.

DeepEventMine: end-to-end neural nested event extraction from biomedical texts.DeepEventMine：从生物医学文本中进行端到端的神经嵌套事件提取。

Bioinformatics. 2020 Dec 8;36(19):4910-4917. doi: 10.1093/bioinformatics/btaa540.

A New Synuclein-Transgenic Mouse Model for Early Parkinson's Reveals Molecular Features of Preclinical Disease.一种新型的帕金森病转基因小鼠模型揭示了临床前期疾病的分子特征。

Mol Neurobiol. 2021 Feb;58(2):576-602. doi: 10.1007/s12035-020-02085-z. Epub 2020 Sep 30.

Employing computational linguistics techniques to identify limited patient health literacy: Findings from the ECLIPPSE study.运用计算语言学技术识别有限的患者健康素养：来自 ECLIPPSE 研究的发现。

Health Serv Res. 2021 Feb;56(1):132-144. doi: 10.1111/1475-6773.13560. Epub 2020 Sep 23.

本文引用的文献

Anni 2.0: a multipurpose text-mining tool for the life sciences.Anni 2.0：一款用于生命科学的多功能文本挖掘工具。

Genome Biol. 2008;9(6):R96. doi: 10.1186/gb-2008-9-6-r96. Epub 2008 Jun 12.

PolySearch: a web-based text mining system for extracting relationships between human diseases, genes, mutations, drugs and metabolites.PolySearch：一个基于网络的文本挖掘系统，用于提取人类疾病、基因、突变、药物和代谢物之间的关系。

Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W399-405. doi: 10.1093/nar/gkn296. Epub 2008 May 16.

EBIMed--text crunching to gather facts for proteins from Medline.EBIMed——通过文本处理从医学在线数据库中收集蛋白质相关事实。

Bioinformatics. 2007 Jan 15;23(2):e237-44. doi: 10.1093/bioinformatics/btl302.

BioThesaurus: a web-based thesaurus of protein and gene names.生物词库：一个基于网络的蛋白质和基因名称词库。

Bioinformatics. 2006 Jan 1;22(1):103-5. doi: 10.1093/bioinformatics/bti749. Epub 2005 Nov 2.

LitMiner and WikiGene: identifying problem-related key players of gene regulation using publication abstracts.LitMiner和WikiGene：利用出版物摘要识别基因调控中与问题相关的关键参与者。

Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W779-82. doi: 10.1093/nar/gki417.

MedlineR: an open source library in R for Medline literature data mining.MedlineR：R语言中用于Medline文献数据挖掘的开源库。

Bioinformatics. 2004 Dec 12;20(18):3659-61. doi: 10.1093/bioinformatics/bth404. Epub 2004 Jul 29.

Update on XplorMed: A web server for exploring scientific literature.XplorMed最新进展：用于探索科学文献的网络服务器

Nucleic Acids Res. 2003 Jul 1;31(13):3866-8. doi: 10.1093/nar/gkg538.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

FACTA：一个用于查找相关生物医学概念的文本搜索引擎。

FACTA: a text search engine for finding associated biomedical concepts.

作者信息

机构信息

出版信息

UNLABELLED

AVAILABILITY

未加标注

可用性

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献