Tak Yae Won, Han Jeong Hyun, Park Yu Jin, Kim Do-Hoon, Oh Ji Seon, Lee Yura
Department of Information Medicine, Asan Medical Center, University of Ulsan College of Medicine, Seoul 05505, Republic of Korea.
Medical Information-Management Team, Asan Medical Center, Seoul 05505, Republic of Korea.
Cancers (Basel). 2023 Jun 27;15(13):3371. doi: 10.3390/cancers15133371.
Death is a crucial outcome in retrospective cohort studies, serving as a criterion for analyzing mortality in a database. This study aimed to assess the quality of extracted death data and investigate the potential of the final-administered medication as a variable to quantify accuracy for the validation dataset. Electronic health records from both an in-hospital and the Korean Central Cancer Registry were used for this study. The gold standard was established by examining the differences between the dates of in-hospital deaths and cancer-registered deaths. Cosine similarity was employed to quantify the final-administered medication similarities between the gold standard and other cohorts. The gold standard was determined as patients who died in the hospital after 2006 and whose final hospital visit/discharge date and death date differed by 0 or 1 day. For all three criteria-(a) cancer stage, (b) cancer type, and (c) type of final visit-there was a positive correlation between mortality rates and the similarities of the final-administered medication. This study introduces a measure that can provide additional accurate information regarding death and differentiates the reliability of the dataset.
死亡是回顾性队列研究中的一个关键结果,作为分析数据库中死亡率的一个标准。本研究旨在评估提取的死亡数据质量,并调查最终使用药物作为验证数据集准确性量化变量的潜力。本研究使用了来自一家医院和韩国中央癌症登记处的电子健康记录。通过检查住院死亡日期和癌症登记死亡日期之间的差异来建立金标准。采用余弦相似度来量化金标准与其他队列之间最终使用药物的相似性。金标准被确定为2006年后在医院死亡且最后一次住院就诊/出院日期与死亡日期相差0或1天的患者。对于所有三个标准——(a)癌症分期、(b)癌症类型和(c)最后一次就诊类型——死亡率与最终使用药物的相似性之间存在正相关。本研究引入了一种可以提供关于死亡的额外准确信息并区分数据集可靠性的方法。