Schroeck Florian R, Patterson Olga V, Alba Patrick R, Pattison Erik A, Seigne John D, DuVall Scott L, Robertson Douglas J, Sirovich Brenda, Goodney Philip P
VA Outcomes Group, White River Junction VA Medical Center, White River Junction, VT; Section of Urology, Dartmouth Hitchcock Medical Center, Lebanon, NH; Norris Cotton Cancer Center, Dartmouth Hitchcock Medical Center, Lebanon, NH; The Dartmouth Institute for Health Policy and Clinical Practice, Geisel School of Medicine at Dartmouth College, Hanover, NH.
Department of Internal Medicine, VA Salt Lake City Health Care System and University of Utah, Salt Lake City, UT.
Urology. 2017 Dec;110:84-91. doi: 10.1016/j.urology.2017.07.056. Epub 2017 Sep 12.
To take the first step toward assembling population-based cohorts of patients with bladder cancer with longitudinal pathology data, we developed and validated a natural language processing (NLP) engine that abstracts pathology data from full-text pathology reports.
Using 600 bladder pathology reports randomly selected from the Department of Veterans Affairs, we developed and validated an NLP engine to abstract data on histology, invasion (presence vs absence and depth), grade, the presence of muscularis propria, and the presence of carcinoma in situ. Our gold standard was based on an independent review of reports by 2 urologists, followed by adjudication. We assessed the NLP performance by calculating the accuracy, the positive predictive value, and the sensitivity. We subsequently applied the NLP engine to pathology reports from 10,725 patients with bladder cancer.
When comparing the NLP output to the gold standard, NLP achieved the highest accuracy (0.98) for the presence vs the absence of carcinoma in situ. Accuracy for histology, invasion (presence vs absence), grade, and the presence of muscularis propria ranged from 0.83 to 0.96. The most challenging variable was depth of invasion (accuracy 0.68), with an acceptable positive predictive value for lamina propria (0.82) and for muscularis propria (0.87) invasion. The validated engine was capable of abstracting pathologic characteristics for 99% of the patients with bladder cancer.
NLP had high accuracy for 5 of 6 variables and abstracted data for the vast majority of the patients. This now allows for the assembly of population-based cohorts with longitudinal pathology data.
为朝着建立具有纵向病理数据的膀胱癌患者人群队列迈出第一步,我们开发并验证了一种自然语言处理(NLP)引擎,该引擎可从全文病理报告中提取病理数据。
我们从退伍军人事务部随机选取600份膀胱病理报告,开发并验证了一个NLP引擎,以提取有关组织学、浸润(存在与否及深度)、分级、固有肌层的存在以及原位癌的存在等数据。我们的金标准基于两名泌尿科医生对报告的独立审查,随后进行裁定。我们通过计算准确率、阳性预测值和敏感性来评估NLP的性能。随后,我们将该NLP引擎应用于10725例膀胱癌患者的病理报告。
将NLP输出与金标准进行比较时,NLP在原位癌存在与否方面的准确率最高(0.98)。组织学、浸润(存在与否)、分级以及固有肌层存在情况的准确率在0.83至0.96之间。最具挑战性的变量是浸润深度(准确率0.68),对于固有层浸润(0.82)和固有肌层浸润(0.87),其阳性预测值尚可接受。经过验证的引擎能够提取99%膀胱癌患者的病理特征。
NLP在6个变量中的5个方面具有较高准确率,并且为绝大多数患者提取了数据。这现在使得能够建立具有纵向病理数据的人群队列。