Hripcsak G, Kuperman G J, Friedman C
Department of Medical Informatics, Columbia University, New York, USA.
Methods Inf Med. 1998 Jan;37(1):1-7.
While natural language processing systems are beginning to see clinical use, it remains unclear whether they can be disseminated effectively through the health care community. MedLEE, a general-purpose natural language processor developed for Columbia-Presbyterian Medical Center, was compared to physicians' ability to detect seven clinical conditions in 200 Brigham and Women's Hospital chest radiograph reports. Using the system on the new institution's reports resulted in a small but measurable drop in performance (it was distinguishable from physicians at p = 0.011). By making adjustments to the interpretation of the processor's coded output (without changing the processor itself), local behavior was better accommodated, and performance improved so that it was indistinguishable from the physicians. Pairs of physicians disagreed on at least one condition for 22% of reports; the source of disagreement appeared to be interpretation of findings, gauging likelihood and degree of disease, and coding errors.
虽然自然语言处理系统已开始在临床中使用,但它们能否在医疗保健领域有效推广仍不明确。MedLEE是为哥伦比亚长老会医学中心开发的通用自然语言处理器,研究人员将其与医生在200份布莱根妇女医院胸部X光报告中检测七种临床病症的能力进行了比较。在新机构的报告中使用该系统导致性能出现了虽小但可测量的下降(在p = 0.011时与医生的表现有显著差异)。通过调整对处理器编码输出的解释(不改变处理器本身),更好地适应了本地情况,性能得到改善,使其与医生的表现难以区分。在22%的报告中,医生对至少一种病症的诊断存在分歧;分歧的根源似乎在于对检查结果的解读、对疾病可能性和程度的判断以及编码错误。