Institute of Information Science, Academia Sinica, Nankang, Taipei, Taiwan, ROC; Department of Information Management, National Taiwan University, Taipei, Taiwan, ROC.
Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan, ROC.
J Biomed Inform. 2013 Dec;46 Suppl:S54-S62. doi: 10.1016/j.jbi.2013.09.007. Epub 2013 Sep 20.
Patient discharge summaries provide detailed medical information about individuals who have been hospitalized. To make a precise and legitimate assessment of the abundant data, a proper time layout of the sequence of relevant events should be compiled and used to drive a patient-specific timeline, which could further assist medical personnel in making clinical decisions. The process of identifying the chronological order of entities is called temporal relation extraction. In this paper, we propose a hybrid method to identify appropriate temporal links between a pair of entities. The method combines two approaches: one is rule-based and the other is based on the maximum entropy model. We develop an integration algorithm to fuse the results of the two approaches. All rules and the integration algorithm are formally stated so that one can easily reproduce the system and results. To optimize the system's configuration, we used the 2012 i2b2 challenge TLINK track dataset and applied threefold cross validation to the training set. Then, we evaluated its performance on the training and test datasets. The experiment results show that the proposed TEMPTING (TEMPoral relaTion extractING) system (ranked seventh) achieved an F-score of 0.563, which was at least 30% better than that of the baseline system, which randomly selects TLINK candidates from all pairs and assigns the TLINK types. The TEMPTING system using the hybrid method also outperformed the stage-based TEMPTING system. Its F-scores were 3.51% and 0.97% better than those of the stage-based system on the training set and test set, respectively.
患者出院小结提供了有关住院患者的详细医疗信息。为了对丰富的数据进行精确和合法的评估,应该编制适当的时间布局,以编译相关事件的顺序,并使用该时间布局来驱动特定于患者的时间线,这可以进一步帮助医务人员做出临床决策。确定实体时间顺序的过程称为时间关系提取。在本文中,我们提出了一种混合方法来识别一对实体之间的适当时间链接。该方法结合了两种方法:一种是基于规则的方法,另一种是基于最大熵模型的方法。我们开发了一种集成算法来融合这两种方法的结果。所有规则和集成算法都进行了正式说明,以便人们可以轻松复制系统和结果。为了优化系统配置,我们使用了 2012 年 i2b2 挑战赛 TLINK 轨道数据集,并对训练集进行了三折交叉验证。然后,我们在训练集和测试集上评估了其性能。实验结果表明,所提出的 TEMPTING(TEMPoral relaTion extractING)系统(排名第七)的 F 分数为 0.563,比随机从所有对中选择 TLINK 候选并分配 TLINK 类型的基线系统至少提高了 30%。基于混合方法的 TEMPTING 系统也优于基于阶段的 TEMPTING 系统。在训练集和测试集上,其 F 分数分别比基于阶段的系统高 3.51%和 0.97%。