UFR SMBH Léonard de Vinci, Université Paris 13, 93017 Bobigny Cedex, France.
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):549-54. doi: 10.1136/jamia.2010.004036.
Pharmacotherapy is an integral part of any medical care process and plays an important role in the medical history of most patients. Information on medication is crucial for several tasks such as pharmacovigilance, medical decision or biomedical research.
Within a narrative text, medication-related information can be buried within other non-relevant data. Specific methods, such as those provided by text mining, must be designed for accessing them, and this is the objective of this study.
The authors designed a system for analyzing narrative clinical documents to extract from them medication occurrences and medication-related information. The system also attempts to deduce medications not covered by the dictionaries used.
Results provided by the system were evaluated within the framework of the I2B2 NLP challenge held in 2009. The system achieved an F-measure of 0.78 and ranked 7th out of 20 participating teams (the highest F-measure was 0.86). The system provided good results for the annotation and extraction of medication names, their frequency, dosage and mode of administration (F-measure over 0.81), while information on duration and reasons is poorly annotated and extracted (F-measure 0.36 and 0.29, respectively). The performance of the system was stable between the training and test sets.
药物疗法是任何医疗护理过程的一个组成部分,在大多数患者的医疗史中发挥着重要作用。药物信息对于药物警戒、医疗决策或生物医学研究等多项任务至关重要。
在叙述性文本中,与药物相关的信息可能隐藏在其他不相关的数据中。必须设计特定的方法(如文本挖掘提供的方法)来访问这些信息,这就是本研究的目的。
作者设计了一个用于分析叙述性临床文档的系统,以从文档中提取药物出现和与药物相关的信息。该系统还试图推断出字典中未涵盖的药物。
系统的结果在 2009 年 I2B2 NLP 挑战赛的框架内进行了评估。该系统的 F 度量值为 0.78,在 20 个参赛团队中排名第 7(最高 F 度量值为 0.86)。该系统在药物名称的注释和提取、其频率、剂量和给药方式(F 度量值均超过 0.81)方面提供了良好的结果,而关于持续时间和原因的信息注释和提取效果较差(F 度量值分别为 0.36 和 0.29)。系统在训练集和测试集之间的性能表现稳定。