Department of Pharmaceutical Industry, Industrial Technology Center of Wakayama Prefecture, Wakayama, Japan.
Department of Digital Manufacturing, Industrial Technology Center of Wakayama Prefecture, Wakayama, Japan.
PLoS One. 2023 May 15;18(5):e0285716. doi: 10.1371/journal.pone.0285716. eCollection 2023.
Plant extract is a mixture of diverse phytochemicals, and considered as an important resource for drug discovery. However, large-scale exploration of the bioactive extracts has been hindered by various obstacles until now. In this research, we have introduced and evaluated a new computational screening strategy that classifies bioactive compounds and plants in semantic space generated by word embedding algorithm. The classifier showed good performance in binary (presence/absence of bioactivity) classification for both compounds and plant genera. Furthermore, the strategy led to the discovery of antimicrobial activity of essential oils from Lindera triloba and Cinnamomum sieboldii against Staphylococcus aureus. The results of this study indicate that machine-learning classification in semantic space can be a highly efficient approach for exploring bioactive plant extracts.
植物提取物是多种植物化学物质的混合物,被认为是药物发现的重要资源。然而,直到现在,大规模探索具有生物活性的提取物一直受到各种障碍的阻碍。在这项研究中,我们引入并评估了一种新的计算筛选策略,该策略通过词嵌入算法在语义空间中对生物活性化合物和植物进行分类。该分类器在化合物和植物属的二元(存在/不存在生物活性)分类中表现出良好的性能。此外,该策略还发现了月桂叶和肉桂的精油对金黄色葡萄球菌的抗菌活性。本研究结果表明,语义空间中的机器学习分类可以成为探索具有生物活性的植物提取物的一种高效方法。