Wadia Roxanne, Akgun Kathleen, Brandt Cynthia, Fenton Brenda T, Levin Woody, Marple Andrew H, Garla Vijay, Rose Michal G, Taddei Tamar, Taylor Caroline
Roxanne Wadia, Kathleen Akgun, Cynthia Brandt, Brenda T. Fenton, Andrew H. Marple, Vijay Garla, Michal G. Rose, and Tamar Taddei, Yale University School of Medicine, New Haven; and Roxanne Wadia, Kathleen Akgun, Cynthia Brandt, Brenda T. Fenton, Woody Levin, Michal G. Rose, Tamar Taddei, and Caroline Taylor, Veterans Affairs Connecticut Healthcare System, West Haven, CT.
JCO Clin Cancer Inform. 2018 Dec;2:1-7. doi: 10.1200/CCI.17.00069.
To compare the accuracy and reliability of a natural language processing (NLP) algorithm with manual coding by radiologists, and the combination of the two methods, for the identification of patients whose computed tomography (CT) reports raised the concern for lung cancer.
An NLP algorithm was developed using Clinical Text Analysis and Knowledge Extraction System (cTAKES) with the Yale cTAKES Extensions and trained to differentiate between language indicating benign lesions and lesions concerning for lung cancer. A random sample of 450 chest CT reports performed at Veterans Affairs Connecticut Healthcare System between January 2014 and July 2015 was selected. A reference standard was created by the manual review of reports to determine if the text stated that follow-up was needed for concern for cancer. The NLP algorithm was applied to all reports and compared with case identification using the manual coding by the radiologists.
A total of 450 reports representing 428 patients were analyzed. NLP had higher sensitivity and lower specificity than manual coding (77.3% v 51.5% and 72.5% v 82.5%, respectively). NLP and manual coding had similar positive predictive values (88.4% v 88.9%), and NLP had a higher negative predictive value than manual coding (54% v 38.5%). When NLP and manual coding were combined, sensitivity increased to 92.3%, with a decrease in specificity to 62.85%. Combined NLP and manual coding had a positive predictive value of 87.0% and a negative predictive value of 75.2%.
Our NLP algorithm was more sensitive than manual coding of CT chest reports for the identification of patients who required follow-up for suspicion of lung cancer. The combination of NLP and manual coding is a sensitive way to identify patients who need further workup for lung cancer.
比较自然语言处理(NLP)算法与放射科医生手动编码以及两种方法相结合,在识别计算机断层扫描(CT)报告提示肺癌可能性的患者时的准确性和可靠性。
使用临床文本分析与知识提取系统(cTAKES)及耶鲁cTAKES扩展开发了一种NLP算法,并进行训练以区分表示良性病变和提示肺癌病变的语言。选取了2014年1月至2015年7月在康涅狄格州退伍军人事务医疗系统进行的450份胸部CT报告的随机样本。通过对报告进行人工审核创建参考标准,以确定文本中是否表明因怀疑癌症需要进行随访。将NLP算法应用于所有报告,并与放射科医生的手动编码进行病例识别比较。
共分析了代表428例患者的450份报告。NLP的敏感性高于手动编码,特异性低于手动编码(分别为77.3%对51.5%和72.5%对82.5%)。NLP和手动编码的阳性预测值相似(88.4%对88.9%),且NLP的阴性预测值高于手动编码(54%对38.5%)。当NLP和手动编码相结合时,敏感性提高到92.3%,特异性降至62.85%。NLP与手动编码相结合的阳性预测值为87.0%,阴性预测值为75.2%。
我们的NLP算法在识别因怀疑肺癌需要随访的患者方面比CT胸部报告的手动编码更敏感。NLP与手动编码相结合是识别需要进一步检查肺癌患者的一种敏感方法。