Hripcsak George, Austin John H M, Alderson Philip O, Friedman Carol
Department of Medical Informatics, Columbia University, 622 W 168th St, VC-5, New York, NY 10032, USA.
Radiology. 2002 Jul;224(1):157-63. doi: 10.1148/radiol.2241011118.
To evaluate translation of chest radiographic reports by using natural language processing and to compare the findings with those in the literature.
A natural language processor coded 10 years of narrative chest radiographic reports from an urban academic medical center. Coding for 150 reports was compared with manual coding. Frequencies and co-occurrences of 24 clinical conditions (diseases, abnormalities, and clinical states) were estimated. The ratio of right to left lung mass, association of pleural effusion with other conditions, and frequency of bullet and stab wounds were compared with independent observations. The sensitivity and specificity of the system's pneumothorax coding were compared with those of manual financial coding.
The system coded 889,921 reports on 251,186 patients. On the basis of manual coding of 150 reports, the processor's sensitivity (0.81) and specificity (0.99) were comparable to those previously reported for natural language processing and for expert coders. The frequencies of the selected conditions ranged from 0.22 for pleural effusion to 0.0004 for tension pneumothorax. The database confirmed earlier observations that lung cancer occurs in a 3:2 right-to-left ratio. The association of pleural effusion with other conditions mirrored that in the literature. Bullet and stab wounds decreased during 10 years at a rate consistent with crime statistics. A review of pneumothorax cases showed that the database (sensitivity, 1.00; specificity, 0.996) was more accurate than financial discharge coding (sensitivity, 0.17; P =.002; specificity, 0.996; not significant).
Internal and external validation in this study confirmed the accuracy of natural language processing for translating chest radiographic narrative reports into a large database of information.
利用自然语言处理技术评估胸部X光报告的翻译情况,并将结果与文献中的发现进行比较。
一个自然语言处理器对一家城市学术医疗中心10年的叙述性胸部X光报告进行编码。将150份报告的编码与人工编码进行比较。估计了24种临床情况(疾病、异常和临床状态)的频率和共现情况。将右肺与左肺肿块的比例、胸腔积液与其他情况的关联以及枪伤和刺伤的频率与独立观察结果进行比较。将该系统气胸编码的敏感性和特异性与人工财务编码的敏感性和特异性进行比较。
该系统对251,186名患者的889,921份报告进行了编码。基于对150份报告的人工编码,该处理器的敏感性(0.81)和特异性(0.99)与先前报道的自然语言处理和专家编码的敏感性和特异性相当。所选情况的频率范围从胸腔积液的0.22到张力性气胸的0.0004。该数据库证实了早期的观察结果,即肺癌的发生比例为右肺与左肺3:2。胸腔积液与其他情况的关联与文献中的情况相符。在10年期间,枪伤和刺伤的数量以与犯罪统计数据一致的速度下降。对气胸病例的审查表明,该数据库(敏感性为1.00;特异性为0.996)比财务出院编码(敏感性为0.17;P = 0.002;特异性为0.996;无显著性差异)更准确。
本研究中的内部和外部验证证实了自然语言处理技术在将胸部X光叙述性报告翻译成大型信息数据库方面的准确性。