Medical Physics Unit, McGill University Health Centre, Montreal, QC, Canada; Physics Department, McGill University, Montreal, QC, Canada.
Imagia, 6650 Saint-Urbain Street, Suite 100, Montreal, QC, Canada.
J Biomed Inform. 2021 Aug;120:103864. doi: 10.1016/j.jbi.2021.103864. Epub 2021 Jul 12.
The majority of cancer patients suffer from severe pain at the advanced stage of their illness. In most cases, cancer pain is underestimated by clinical staff and is not properly managed until it reaches a critical stage. Therefore, detecting and addressing cancer pain early can potentially improve the quality of life of cancer patients. The objective of this research project was to develop a generalizable Natural Language Processing (NLP) pipeline to find and classify physician-reported pain in the radiation oncology consultation notes of cancer patients with bone metastases.
The texts of 1249 publicly-available hospital discharge notes in the i2b2 database were used as a training and validation set. The MetaMap and NegEx algorithms were implemented for medical terms extraction. Sets of NLP rules were developed to score pain terms in each note. By averaging pain scores, each note was assigned to one of the three verbally-declared pain (VDP) labels, including no pain, pain, and no mention of pain. Without further training, the generalizability of our pipeline in scoring individual pain terms was tested independently using 30 hospital discharge notes from the MIMIC-III database and 30 consultation notes of cancer patients with bone metastasis from our institution's radiation oncology electronic health record. Finally, 150 notes from our institution were used to assess the pipeline's performance at assigning VDP.
Our NLP pipeline successfully detected and quantified pain in the i2b2 summary notes with 93% overall precision and 92% overall recall. Testing on the MIMIC-III database achieved precision and recall of 91% and 86% respectively. The pipeline successfully detected pain with 89% precision and 82% recall on our institutional radiation oncology corpus. Finally, our pipeline assigned a VDP to each note in our institutional corpus with 84% and 82% precision and recall, respectively.
Our NLP pipeline enables the detection and classification of physician-reported pain in our radiation oncology corpus. This portable and ready-to-use pipeline can be used to automatically extract and classify physician-reported pain from clinical notes where the pain is not otherwise documented through structured data entry.
大多数癌症患者在疾病晚期都会遭受剧烈疼痛。在大多数情况下,临床医务人员会低估癌症疼痛,直到其发展到危急阶段才进行适当的治疗。因此,早期发现和处理癌症疼痛可能会提高癌症患者的生活质量。本研究项目的目的是开发一种可推广的自然语言处理 (NLP) 管道,以发现和分类癌症骨转移患者放射肿瘤学咨询记录中医生报告的疼痛。
使用 i2b2 数据库中 1249 份公开的住院记录文本作为训练和验证集。实现了 MetaMap 和 NegEx 算法来提取医学术语。开发了一系列 NLP 规则来对每个记录中的疼痛术语进行评分。通过平均疼痛评分,将每个记录分配到三个口头声明的疼痛 (VDP) 标签之一,包括无疼痛、疼痛和无疼痛提及。无需进一步培训,我们的管道在使用 MIMIC-III 数据库中的 30 份住院记录和我们机构放射肿瘤学电子健康记录中的 30 份癌症骨转移患者咨询记录独立测试单个疼痛术语的可推广性时,分别取得了 91%和 86%的精度和 82%和 76%的召回率。最后,我们使用 150 份来自我们机构的记录来评估管道在分配 VDP 时的性能。
我们的 NLP 管道成功地在 i2b2 摘要记录中检测和量化了疼痛,总体精度为 93%,总体召回率为 92%。在 MIMIC-III 数据库上的测试分别取得了 91%和 86%的精度和 82%和 76%的召回率。该管道成功地在我们机构的放射肿瘤学语料库中以 89%的精度和 82%的召回率检测到疼痛。最后,我们的管道在我们机构的语料库中为每个记录分配了一个 VDP,精度和召回率分别为 84%和 82%。
我们的 NLP 管道能够在我们的放射肿瘤学语料库中检测和分类医生报告的疼痛。这种便携式、即用型管道可用于从临床记录中自动提取和分类医生报告的疼痛,而这些记录没有通过结构化数据输入来记录疼痛。