Faculty of Pharmaceutical Sciences, Tokyo University of Science, Noda, Yamazaki, Chiba, 2641, Japan.
Center for Kampo Medicine, Keio University School of Medicine, 35, Shinanomachi, Shinjuku-ku, Tokyo, Japan.
BMC Med Inform Decis Mak. 2023 Jul 13;23(1):119. doi: 10.1186/s12911-023-02230-3.
Kampo medicine is widely used in Japan; however, most physicians and pharmacists have insufficient knowledge and experience in it. Although a chatbot-style system using machine learning and natural language processing has been used in some clinical settings and proven useful, the system developed specifically for the Japanese language using this method has not been validated by research. The purpose of this study is to develop a novel drug information provision system for Kampo medicines using a natural language classifier® (NLC®) based on IBM Watson.
The target Kampo formulas were 33 formulas listed in the 17th revision of the Japanese Pharmacopoeia. The information included in the system comes from the package inserts of Kampo medicines, Manuals for Management of Individual Serious Adverse Drug Reactions, and data on off-label usage. The system developed in this study classifies questions about the drug information of Kampo formulas input by natural language into preset questions and outputs preset answers for the questions. The system uses morphological analysis, synonym conversion by thesaurus, and NLC®. We fine-tuned the information registered into NLC® and increased the thesaurus. To validate the system, 900 validation questions were provided by six pharmacists who were classified into high or low levels of knowledge and experience of Kampo medicines and three pharmacy students.
The precision, recall, and F-measure of the system performance were 0.986, 0.915, and 0.949, respectively. The results were stable even with differences in the amount of expertise of the question authors.
We developed a system using natural language classification that can give appropriate answers to most of the validation questions.
汉方药在日本被广泛应用;然而,大多数医生和药剂师对其了解和经验不足。虽然在一些临床环境中已经使用了基于机器学习和自然语言处理的聊天机器人式系统,并被证明是有用的,但使用这种方法专门为日语开发的系统尚未通过研究验证。本研究的目的是使用基于 IBM Watson 的自然语言分类器(NLC®)为汉方药开发一种新的药物信息提供系统。
目标汉方制剂是日本药典第 17 版中列出的 33 种配方。系统中包含的信息来自汉方药的包装说明书、个别严重药物不良反应管理手册以及标签外使用的数据。本研究开发的系统将输入的关于汉方制剂药物信息的自然语言问题分类为预设问题,并为问题输出预设答案。该系统使用形态分析、同义词转换词库和 NLC®。我们对注册到 NLC®中的信息进行了微调,并增加了词库。为了验证系统,我们向六名药剂师和三名药学学生提供了 900 个验证问题,这些药剂师被分为对汉方药知识和经验水平较高和较低的两类。
系统性能的精度、召回率和 F 度量分别为 0.986、0.915 和 0.949。即使问题作者的专业知识数量存在差异,结果也很稳定。
我们开发了一种使用自然语言分类的系统,能够对大多数验证问题给出适当的答案。