Sager N, Bross I D, Story G, Bastedo P, Marsh E, Shedd D
Comput Biol Med. 1982;12(1):43-56. doi: 10.1016/0010-4825(82)90011-7.
An experiment in the automatic encoding of English-language medical data is described. The encoding program has two stages. First, the free-text input is parsed and the information is arranged in a tabular format by a general-purpose natural language processor developed at New York University. Then a simple code-dependent subprogram assigns numerical values to the entries on the basis of the positions the input words occupy in the information format. Results of a blind test of the encoding program using the code employed at Roswell Park Memorial Institute for earliest symptoms of head-neck cancer are presented.
本文描述了一项关于英语医学数据自动编码的实验。编码程序有两个阶段。首先,由纽约大学开发的通用自然语言处理器对自由文本输入进行解析,并将信息整理成表格形式。然后,一个简单的代码相关子程序根据输入词在信息格式中所占的位置为条目赋予数值。文中展示了使用罗斯韦尔公园纪念研究所用于头颈癌最早症状的编码对该编码程序进行盲测的结果。