Department of Biomedical Informatics, School of Medicine, Emory University, Atlanta, GA 30322, USA.
Bioinformatics. 2021 Aug 25;37(16):2499-2501. doi: 10.1093/bioinformatics/btaa995.
LexExp is an open-source, data-centric lexicon expansion system that generates spelling variants of lexical expressions in a lexicon using a phrase embedding model, lexical similarity-based natural language processing methods and a set of tunable threshold decay functions. The system is customizable, can be optimized for recall or precision and can generate variants for multi-word expressions.
Code available at: https://bitbucket.org/asarker/lexexp; data and resources available at: https://sarkerlab.org/lexexp.
Supplementary data are available at Bioinformatics online.
LexExp 是一个开源的、以数据为中心的词典扩展系统,它使用短语嵌入模型、基于词汇相似度的自然语言处理方法和一组可调的阈值衰减函数,根据词典中的词汇表达式生成拼写变体。该系统具有可定制性,可以针对召回率或准确率进行优化,并且可以为多词表达式生成变体。
代码可在:https://bitbucket.org/asarker/lexexp 获得;数据和资源可在:https://sarkerlab.org/lexexp 获得。
补充数据可在生物信息学在线获得。