Gyori Benjamin M, Hoyt Charles Tapley, Steppi Albert
Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA 02115, USA.
Bioinform Adv. 2022 May 11;2(1):vbac034. doi: 10.1093/bioadv/vbac034. eCollection 2022.
Gilda is a software tool and web service that implements a scored string matching algorithm for names and synonyms across entries in biomedical ontologies covering genes, proteins (and their families and complexes), small molecules, biological processes and diseases. Gilda integrates machine-learned disambiguation models to choose between ambiguous strings given relevant surrounding text as context, and supports species-prioritization in case of ambiguity.
The Gilda web service is available at http://grounding.indra.bio with source code, documentation and tutorials available via https://github.com/indralab/gilda.
Supplementary data are available at online.
吉尔达(Gilda)是一种软件工具和网络服务,它针对生物医学本体中涵盖基因、蛋白质(及其家族和复合物)、小分子、生物过程和疾病的条目,实现了一种带评分的字符串匹配算法。吉尔达集成了机器学习消歧模型,以便在给定相关上下文文本的情况下,在模糊字符串之间进行选择,并在出现歧义时支持物种优先级排序。
吉尔达网络服务可通过http://grounding.indra.bio获取,其源代码、文档和教程可通过https://github.com/indralab/gilda获取。
补充数据可在网上获取。